Preservation Metadata: Theory and Evolving Practice
Jenn RileyMetadata LibrarianIndiana University Digital Library Program
Here’s (some of) what we’re facing
11/17/2010
Image from Indiana University Media Preservation Initiative http://research.iu.edu/resources/media_preservation/slideshow/index.html
Image by Paul Downey – CC by-nc-sa 2.0 http://www.flickr.com/photos/psd/211603602/
Image by Seth Anderson – CC by-sa 2.0
http://www.flickr.com/photos/swanksalot/2704017177/
Indiana Library Federation 2010 - Jenn Riley 3
Core issues
11/17/2010
What to preserve?Media? (some digital, some analog)Bits?Intellectual content?
How to preserve it?Migration?
Emulation?
How does preservation relate to access?
Not surprisingly, we’re learning that “it depends.”
Indiana Library Federation 2010 - Jenn Riley 4
Digital preservation is…
Not a solved problem, but also not an unknown one
An active research area Dependent on preservation
metadata, but… More about policy than technology
11/17/2010
Indiana Library Federation 2010 - Jenn Riley 5
Central framework: OAIS
OAIS = Open Archival Information System
“Reference Model” for an “organization of people and systems” to facilitate long term preservation of content
ISO 14721:2003 OAIS-compliant repositories meet
preservation responsibilities as laid out in the reference model11/17/2010
Indiana Library Federation 2010 - Jenn Riley 6
High level picture of the OAIS model
11/17/2010
Figure from OAIS Reference Model, p. 4-1 http://public.ccsds.org/publications/archive/650x0b1.pdf
Indiana Library Federation 2010 - Jenn Riley 7
Central standard: PREMIS
PREservation Metadata Implementation Strategies
Development began in 2003 under leadership of OCLC and (then) RLG
Data dictionary released 2005; revised 2008
LC is current maintenance agency
11/17/2010
Digital Library Brown Bag Series 82/7/2007
How PREMIS defines preservation metadata “The information a repository uses to
support the digital preservation process” Metadata that supports
viability renderability understandability authenticity identity
Mandatory elements represent “the minimum amount for [a] second repository to accept custody of [a] digital object and assume responsibility for its long-term preservation”
Indiana Library Federation 2010 - Jenn Riley 9
PREMIS data model
11/17/2010
Figure from PREMIS 2.0 report, p. 5, http://www.loc.gov/standards/premis/v2/premis-report-2-
0.pdf
Indiana Library Federation 2010 - Jenn Riley 10
Objects and intellectual entities
Intellectual entities are the things we’re interested in And can be groups of things
Objects are specific things a repository manages
11/17/2010
FileBitstream
Bitstream
FileFileFileFileFileFile
Representation(of an intellectual entity)
Indiana Library Federation 2010 - Jenn Riley 1111/17/2010
Indiana Library Federation 2010 - Jenn Riley 12
But it’s not just about PREMIS
11/17/2010
Figure from Priscilla Caplan, “Understanding PREMIS” p. 5, http://www.loc.gov/standards/premis/understanding-
premis.pdf
Indiana Library Federation 2010 - Jenn Riley 13
Image technical metadata
11/17/2010
NISO Metadata for Images in XMLhttp://www.loc.gov/standards/mix/
Indiana Library Federation 2010 - Jenn Riley 14
Text technical metadata
11/17/2010
Technical Metadata for Text (TextMD)http://www.loc.gov/standards/textMD/
Indiana Library Federation 2010 - Jenn Riley 15
Audio technical metadata
11/17/2010
Audio Engineering Society Core Audio (AES-X098B)http://www.aes.org/standards/meetings/project-status.cfm
Indiana Library Federation 2010 - Jenn Riley 16
Video technical metadata
11/17/2010
LC A/V Prototyping ProjectVideo Metadata Schemahttp://www.loc.gov/rr/mopic/avprot/metsmenu2.html
Indiana Library Federation 2010 - Jenn Riley 17
Preservation metadata seen in some other standard formats
11/17/2010
Indiana Library Federation 2010 - Jenn Riley 18
Wait, we have to type all of this in?
Nope. All this stuff represents a new approach to metadata creation.
Software that creates and manages digital file also creates technical, process history, preservation metadata about it Maybe also by asking the user for some specific
input Management tools package together metadata
and content for transfer and repository ingest But to implement your preservation policies,
you need to understand what metadata is being created, how it’s stored, and when.
11/17/2010
Indiana Library Federation 2010 - Jenn Riley 19
Some digital preservation initiatives to watch (1)
National Digital Information Infrastructure and Preservation Program (NDIIPP) <http://www.digitalpreservation.gov/>
PREMIS in METS <http://pim.fcla.edu/> DAITSS Digital Preservation Repository
Software, with a dark archive focus <http://daitss.fcla.edu/>
Trustworthy Repositories Audit & Certification (TRAC) <http://www.crl.edu/sites/default/files/attachments/pages/trac_0.pdf>
11/17/2010
Indiana Library Federation 2010 - Jenn Riley 20
Some digital preservation initiatives to watch (2)
Digital Repository Audit Method Based on Risk Assessment (DRAMBORA) <http://www.repositoryaudit.eu/>
Lots of Content Keeps Stuff Safe (LOCKSS) Initiative <http://lockss.stanford.edu/>
Preserving Virtual Worlds project (including video games!) <http://pvw.illinois.edu/pvw/>
Data Preservation Alliance for the Social Sciences (Data-PASS) <http://www.icpsr.umich.edu/icpsrweb/DATAPASS/>
11/17/2010
Indiana Library Federation 2010 - Jenn Riley 21
For more information
[email protected] These presentation slides:
http://www.dlib.indiana.edu/~jenlrile/presentations/ilf2010/ilf2010.pptx
OAIS reference model:http://public.ccsds.org/publications/archive/650x0b1.pdf
PREMIS data dictionary:http://www.loc.gov/standards/premis/
Thank you!11/17/2010
Top Related