PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS &...

17
PREMIS Implementation Fair Sa n Francisco, CA, October 7 20 09 1 Stanford Digital Reposito PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge Motifs San Mateo, CA

Transcript of PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS &...

Page 1: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

1

Stanford Digital Repository

PREMIS & Geospatial Resources

Nancy J. HoebelheinrichKnowledge MotifsSan Mateo, CA

Page 2: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

2

Outline

Brief descriptions:High Resolution orthoimagery (HRO)GIS dataset

Complexities using PREMIS for these Geospatial Resources

Further investigations

Page 3: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

3

High Resolution Orthoimagery (HRO): Monterey Bay Aquariam

Description: • Collection of various

files comprising a classic “shapefile”

• Additional files created by SW for rendering

Curatorial Decisions:• What is to be

archived?• For what purpose?• What is context for the

artifact?• How much of context

should be archived?

Page 4: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

4

Compiled GIS Dataset: Network of San Francisco Bay Area = “snapshot”

Description: • Compilation of data

“layers” • Only certain data points

featured from layers• Output:

• Image as tourist map• MS Access “personal

geodatabase”

Curatorial Decisions:• What is to be

archived?• For what purpose?• What is context for the

artifact?• How much of context

should be archived?

Page 5: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

5

Application of PREMIS to HRO & GIS Dataset: complexities

• Structural Aspects• Contextual Aspects:• Function / purpose of data subset created• Provenance: what’s important to record• Events to be recorded

Page 6: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

6

Application of PREMIS to GIS Dataset: Structure of artifact

Dependent upon archival unit

?map output ?compiled dataset

output as .mdb ?each layer component ?all of the above

how determine relationships – got MD?

Use of <object>relationship

Or, Use of METS Both

Page 7: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

7

Application of PREMIS to GIS Dataset: Context & Provenance with artifact

Intention: To make earth science data useful to the scientific community, *longterm*

Earth science data community emphasizing the importance of context & provenance

Page 8: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

8

Guidance from NASA & NOAA: from US Global Change Climate Report

“As necessary the processing software, instrument operations history and related science documentation

Information intended to support understanding and use of the data in the long term (i.e., after instrument operations end)

Standard linkages among the critical data and instrument information, including provenance information”

NOTE: Lineage or “provenance” data considered as the processing steps used to create scientific data product

Page 9: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

9

Guidance from NASA & NOAA: more specifically, e.g.,

Instrument/ sensor characteristics such as performance measurements calibration data & method

Processing algorithms & their scientific basis for creation of the product, for example

Bibliography of pertinent Technical Notes and articles reporting on research using the data set

Feedback from users of the dataset etc.,

Page 10: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

10

Context & Provenance in PREMIS – where? <object>

significantProperties using extension element?

objectCharacteritics using extension element

Use elements from domain specific MD standards, e.g, FGDC *see ref in Sources

North American Profile of ISO 19115:2003 – Geographic Information, Metadata

Data Documentation Initiative (DDI)

Page 11: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

11

PREMIS & Complex Geospatial Data For more detail on using domain specific metadata, see “An

Investigation into Archiving Geospatial data Formats “ prepared for NGDA Project, funded by NDIIPP (http://www.ngda.org/research.php) Approaches of FGDC, PREMIS, and Center for International Earth Science

Information Network (CIESIN)‘s Geospatial Electronic Record (GER) model on basis of:

Environment/ computer platform Semantic underpinnings domain specific terminology provenance data quality appropriate use

Page 12: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

12

Context & Provenance in PREMIS – where?

What influenced or impacted the creation of the dataset to understand the data

Addition of metadata to the resource, e.g., USGS which added FGDC MD to HRO from

Monterey Bay Water Resource

<event>

Example Event to be documented: Merge c:\temp\states1;c:\temp \states2; c:\temp\

USA

Page 13: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

13

Context & Provenance in PREMIS – where?

?Agency adding descriptive MD to HRO, e.g., USGS’ addition of descriptive info from the Monterey Water District

<agent>

Page 14: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

14

Implementation issues

Heavy emphasis upon receiving documentation, metadata, and reference materials at time of creation and/or acquisition from: governmental agency sponsoring research scientists creating

In order to do this practicably, need to be able to determine factors for creation / archiving the artifact. -- include link to taxonomy in the PREMIS schema in lieu of TYPE semantic unit

How? Chance of being built into tools eventually, e.g., GeoMedia, ESRI

ArcGIS tools In process of building domain concensus of importance of archiving

Page 15: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

15

Further investigations

ESIP Federation – building a MD testbed for preservation MD (NOAA & NASA, etc.,)

American Geophysical Union position statement & follow-up

NDIIPP geospatial projects including NGDA and collaborative states

Discussions of “significant properties / characteristics” Dappert, Farquhar article

Page 16: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

16

Sources

US Global Change Research Program: www.knmi.nl/.../DSWG_Aura2009_Moses_ArchivingAuraData2009.ppt

“An Investigation into Archiving Geospatial data Formats “ prepared for NGDA Project, funded by NDIIPP (http://www.ngda.org/research.php)

NAP – ISO 19113:2003 http://www.fgdc.gov/standards/projects/incits-l1-standards-projects/NAP-Metadata

ESIP Federationhttp://www.esipfed.org/ • Preserving the Context of Science Data. Greg Janée and James Frew (2008). Eos,

Transactions, American Geophysical Union 89(53), Fall Meeting Supplement, abstract U13D-06.

• Preserving Geospatial Data: The National Geospatial Digital Archive's Approach. Greg Janée (2009). Archiving 2009: Final Program and Proceedings (Arlington, Virginia; May

4-7, 2009): 25-29. • Significance is in the Eye of the Stakeholder, Angela Dappert, Adam Farquhar, The

British Library (2009)

Page 17: PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.

PREMIS Implementation Fair San Francisco, CA, October 7 2009

17

Questions? / comments?

Nancy J. Hoebelheinrich

[email protected]