METS in the OCLC Digital Archive
description
Transcript of METS in the OCLC Digital Archive
![Page 1: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/1.jpg)
OCLC Online Computer Library Center
METS in the OCLC Digital Archive
Taylor SurfaceDirector, Digital Content Management Services
October 27, 2003
![Page 2: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/2.jpg)
AgendaOCLC’s Digital ArchiveOur METS implementationExtension schemasDescription, vocabularies, requirements
![Page 3: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/3.jpg)
OCLC Digital Archive Tools
Web ArchivingItem-by-item archiving of web pages and web documentsHTML and PDF and associated filesDIP uses METS; SIP is constructed on the fly
Batch IngestCollection-based archiving of resources library has saved onto server, disc, or tapePrimarily TIFFsSIP uses METS; DIP not implemented at this time
![Page 4: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/4.jpg)
Implications for OCLC’s METS Implementation
Different profiles needed for batch ingest and web toolBatch ingest currently accepts nonhierarchical objects only
![Page 5: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/5.jpg)
METS in Batch IngestDownloadable Submission Builder application creates SIPSubmission Builder creates METS document based on user’s tab-delimited metadata file and manifest file (list of filenames)Manifest file, also part of SIP, is encoded in METS and has links to object-level METS file
![Page 6: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/6.jpg)
METS in Batch Ingest (SIP)
METS document (one per object) sent to OCLC as part of SIP, along with content objects for batch ingestObjects are ingested and preservation metadata records are generated automatically based on the information in SIP
![Page 7: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/7.jpg)
Submission Builder Requirements
Windows 2000, NT4, or XP Intel Pentium III, 864MzH or higherAt least 256 MB RAM8.5 MB disk spaceInternet connection active during SIP creation (validates against METS at LC web site)
![Page 8: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/8.jpg)
Submission Builder
![Page 9: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/9.jpg)
METS in Web Archiving Tools (DIP)
The dissemination of content objects ingested on an object-by-object basis results in a METS document.Hierarchical as well as non-hierarchical objects are encoded in METS for use as a DIP from OCLC Digital Archive.
![Page 10: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/10.jpg)
Development PlansMETS-based batch dissemination for both batch ingest and web toolsAcceptance of hierarchical objects in batch ingestKeeping profiles updated as tools change
![Page 11: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/11.jpg)
METS Extension Schemas
Header - No extensionDescriptive Metadata Section - OCLC descriptive schema http://digitalarchive.oclc.org/schemas/oclc_dm.xsd File Section - No extensionStructural Map Section - No extensionBehavior Section - No extension
![Page 12: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/12.jpg)
More Extension Schemas
Administrative Metadata Section –MIX schema http://www.loc.gov/standards/mix/mix.xsd textMD schema http://dlib.nyu.edu/METS/textmd.xsd OCLC provenance schema http://digitalarchive.oclc.org/schemas/oclc_prov.xsd
![Page 13: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/13.jpg)
Rules of Description, Controlled Vocabularies
Date: Must be in W3C-DTF format
Language: Must be in ISO 639-2 format
![Page 14: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/14.jpg)
Some of Our Structural RequirementsEvery METS document must have
<metsHdr>Descriptive section: METS document for each object contains one <dmdSec>; metadata conforms to oclc_md schemaAdministrative section: MIX used for image technical metadata; textMD used for text; section also contains provenance information using oclc_prov.xsd OCLC extension schema
![Page 15: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/15.jpg)
Technical Requirements
Any version of these formats:HTML (including .css and .js)PDFTXTTIF JPG GIF BMP
![Page 16: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/16.jpg)
ResourcesDigital Archive web site:
http://www.oclc.org/digitalarchive/default.htmNavigate to Support,then Documentationfor “Batch Ingest Guide,” and “Learning to
Use Web Archiving Tools”: each is a comprehensive guide to that part of the system
![Page 17: METS in the OCLC Digital Archive](https://reader035.fdocuments.us/reader035/viewer/2022062814/56816868550346895dded380/html5/thumbnails/17.jpg)
Questions?