PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS &...
-
Upload
alison-small -
Category
Documents
-
view
212 -
download
0
Transcript of PREMIS Implementation Fair San Francisco, CA, October 7 2009 1 Stanford Digital Repository PREMIS &...
PREMIS Implementation Fair San Francisco, CA, October 7 2009
1
Stanford Digital Repository
PREMIS & Geospatial Resources
Nancy J. HoebelheinrichKnowledge MotifsSan Mateo, CA
PREMIS Implementation Fair San Francisco, CA, October 7 2009
2
Outline
Brief descriptions:High Resolution orthoimagery (HRO)GIS dataset
Complexities using PREMIS for these Geospatial Resources
Further investigations
PREMIS Implementation Fair San Francisco, CA, October 7 2009
3
High Resolution Orthoimagery (HRO): Monterey Bay Aquariam
Description: • Collection of various
files comprising a classic “shapefile”
• Additional files created by SW for rendering
Curatorial Decisions:• What is to be
archived?• For what purpose?• What is context for the
artifact?• How much of context
should be archived?
PREMIS Implementation Fair San Francisco, CA, October 7 2009
4
Compiled GIS Dataset: Network of San Francisco Bay Area = “snapshot”
Description: • Compilation of data
“layers” • Only certain data points
featured from layers• Output:
• Image as tourist map• MS Access “personal
geodatabase”
Curatorial Decisions:• What is to be
archived?• For what purpose?• What is context for the
artifact?• How much of context
should be archived?
PREMIS Implementation Fair San Francisco, CA, October 7 2009
5
Application of PREMIS to HRO & GIS Dataset: complexities
• Structural Aspects• Contextual Aspects:• Function / purpose of data subset created• Provenance: what’s important to record• Events to be recorded
PREMIS Implementation Fair San Francisco, CA, October 7 2009
6
Application of PREMIS to GIS Dataset: Structure of artifact
Dependent upon archival unit
?map output ?compiled dataset
output as .mdb ?each layer component ?all of the above
how determine relationships – got MD?
Use of <object>relationship
Or, Use of METS Both
PREMIS Implementation Fair San Francisco, CA, October 7 2009
7
Application of PREMIS to GIS Dataset: Context & Provenance with artifact
Intention: To make earth science data useful to the scientific community, *longterm*
Earth science data community emphasizing the importance of context & provenance
PREMIS Implementation Fair San Francisco, CA, October 7 2009
8
Guidance from NASA & NOAA: from US Global Change Climate Report
“As necessary the processing software, instrument operations history and related science documentation
Information intended to support understanding and use of the data in the long term (i.e., after instrument operations end)
Standard linkages among the critical data and instrument information, including provenance information”
NOTE: Lineage or “provenance” data considered as the processing steps used to create scientific data product
PREMIS Implementation Fair San Francisco, CA, October 7 2009
9
Guidance from NASA & NOAA: more specifically, e.g.,
Instrument/ sensor characteristics such as performance measurements calibration data & method
Processing algorithms & their scientific basis for creation of the product, for example
Bibliography of pertinent Technical Notes and articles reporting on research using the data set
Feedback from users of the dataset etc.,
PREMIS Implementation Fair San Francisco, CA, October 7 2009
10
Context & Provenance in PREMIS – where? <object>
significantProperties using extension element?
objectCharacteritics using extension element
Use elements from domain specific MD standards, e.g, FGDC *see ref in Sources
North American Profile of ISO 19115:2003 – Geographic Information, Metadata
Data Documentation Initiative (DDI)
PREMIS Implementation Fair San Francisco, CA, October 7 2009
11
PREMIS & Complex Geospatial Data For more detail on using domain specific metadata, see “An
Investigation into Archiving Geospatial data Formats “ prepared for NGDA Project, funded by NDIIPP (http://www.ngda.org/research.php) Approaches of FGDC, PREMIS, and Center for International Earth Science
Information Network (CIESIN)‘s Geospatial Electronic Record (GER) model on basis of:
Environment/ computer platform Semantic underpinnings domain specific terminology provenance data quality appropriate use
PREMIS Implementation Fair San Francisco, CA, October 7 2009
12
Context & Provenance in PREMIS – where?
What influenced or impacted the creation of the dataset to understand the data
Addition of metadata to the resource, e.g., USGS which added FGDC MD to HRO from
Monterey Bay Water Resource
<event>
Example Event to be documented: Merge c:\temp\states1;c:\temp \states2; c:\temp\
USA
PREMIS Implementation Fair San Francisco, CA, October 7 2009
13
Context & Provenance in PREMIS – where?
?Agency adding descriptive MD to HRO, e.g., USGS’ addition of descriptive info from the Monterey Water District
<agent>
PREMIS Implementation Fair San Francisco, CA, October 7 2009
14
Implementation issues
Heavy emphasis upon receiving documentation, metadata, and reference materials at time of creation and/or acquisition from: governmental agency sponsoring research scientists creating
In order to do this practicably, need to be able to determine factors for creation / archiving the artifact. -- include link to taxonomy in the PREMIS schema in lieu of TYPE semantic unit
How? Chance of being built into tools eventually, e.g., GeoMedia, ESRI
ArcGIS tools In process of building domain concensus of importance of archiving
PREMIS Implementation Fair San Francisco, CA, October 7 2009
15
Further investigations
ESIP Federation – building a MD testbed for preservation MD (NOAA & NASA, etc.,)
American Geophysical Union position statement & follow-up
NDIIPP geospatial projects including NGDA and collaborative states
Discussions of “significant properties / characteristics” Dappert, Farquhar article
PREMIS Implementation Fair San Francisco, CA, October 7 2009
16
Sources
US Global Change Research Program: www.knmi.nl/.../DSWG_Aura2009_Moses_ArchivingAuraData2009.ppt
“An Investigation into Archiving Geospatial data Formats “ prepared for NGDA Project, funded by NDIIPP (http://www.ngda.org/research.php)
NAP – ISO 19113:2003 http://www.fgdc.gov/standards/projects/incits-l1-standards-projects/NAP-Metadata
ESIP Federationhttp://www.esipfed.org/ • Preserving the Context of Science Data. Greg Janée and James Frew (2008). Eos,
Transactions, American Geophysical Union 89(53), Fall Meeting Supplement, abstract U13D-06.
• Preserving Geospatial Data: The National Geospatial Digital Archive's Approach. Greg Janée (2009). Archiving 2009: Final Program and Proceedings (Arlington, Virginia; May
4-7, 2009): 25-29. • Significance is in the Eye of the Stakeholder, Angela Dappert, Adam Farquhar, The
British Library (2009)
PREMIS Implementation Fair San Francisco, CA, October 7 2009
17
Questions? / comments?
Nancy J. Hoebelheinrich