PBCore, METS, PREMIS, MODS, METSRights...oh my!

20
PBCore, METS, PREMIS, MODS, METSRights... oh my! Kara van Malssen Senior Research Fellow, NYU Preserving Digital Public Television AMIA 2008

description

Presentation given at the Association of Moving Image Archivists Conference, November 14, 2009 in Savannah, GA. Part of the panel PBCore: What is it good for?

Transcript of PBCore, METS, PREMIS, MODS, METSRights...oh my!

Page 1: PBCore, METS, PREMIS, MODS, METSRights...oh my!

PBCore, METS, PREMIS, MODS, METSRights... oh my!Kara van MalssenSenior Research Fellow, NYUPreserving Digital Public Television

AMIA 2008

Page 2: PBCore, METS, PREMIS, MODS, METSRights...oh my!

A little bit about the Preserving Digital Public Television Project

• Identify at-risk born digital public television content

• Build an OAIS-compliant prototype repository

• Explore and apply standards• Create selection guidelines

• Research sustainability models, copyright encumbrances

GOALS:

Page 3: PBCore, METS, PREMIS, MODS, METSRights...oh my!

PBS

Library of Congress

NYU

WNET WGBH

SIP site

Repository

Project Partners

Page 4: PBCore, METS, PREMIS, MODS, METSRights...oh my!

Producing Stations

WGBH

WNET

Station

B

Station

A

Station

C

PBS

Satellite

NYU PDPTV Prototype

Repository

Transmitting

Stations

WNET

Station

A

Station

C

Station

E

Station

G

Station

I

WGBH

Station

B

Station

D

Station

F

Station

H

Station

J

Submission Workflow

Page 5: PBCore, METS, PREMIS, MODS, METSRights...oh my!

• Create a prototype repository for long term retention

• Aggregate content from partner stations + PBS for sample programs

• Populate records with metadata that already exists (in station databases, files, scheduling systems, etc)

• Transform data and package content, while preserving relationships between items

NYU Goals:

Page 6: PBCore, METS, PREMIS, MODS, METSRights...oh my!

Important Vocabulary•The Repository: NYU

prototype preservation repository

•OAIS: Open Archival Information System

•SIP: Submission Information Package

•AIP: Archival Information Package

OAIS Terms!

Page 7: PBCore, METS, PREMIS, MODS, METSRights...oh my!

Applying standards• Normalize disparate metadata

• XML based

• One uniform scheme

• Easier to manage over the long term

• Rules, vocabularies, schemas help maintain consistency

Page 8: PBCore, METS, PREMIS, MODS, METSRights...oh my!

Production Master (mov)

HD Broadcast

Master (mov/data)

SD Broadcast

Master (mov/aiff/

m2v)

SD Broadcast

Master (mpeg)

Production Master (mxf)

SIP Class 1: WNET National

Broadcast (Nature)

SIP Class 2: WGBH National

Broadcasts

SIP Class 3: WNET Local Broadcast

(New York Voices)

SIP Class 4: Religion and Ethics

PODS PROTRACK

TEAMSINMAGICDATABASE EXPORTS

ADDITIONAL ITEMS Scripts, etc

PODSPODS

PODS

Scripts, etcPRO

TRACK

PROTRACK

INMAGICINMAGIC

TEAMS

HD Broadcast

Master (mov/data)

SD Broadcast

Master (mov/aiff/

m2v)

Production Master (mxf)

Production Master (mxf)

SD Broadcast

Master (mpeg)

SD Broadcast

Master (mov/aiff/

m2v)

Production Master (mov)

Production Master (mov)

SD Broadcast

Master (mov/aiff/

m2v)

Challenge of managing

diverse

SIPs:

Page 9: PBCore, METS, PREMIS, MODS, METSRights...oh my!

PDPTV metadata modelMETS: Metadata Encoding and Transmission StandardStructural and administrative

PBCore: Public Broadcasting Metadata DictionaryDescriptive and technical

PREMIS: Preservation Metadata Implementation StrategyTechnical preservation metadata

Page 10: PBCore, METS, PREMIS, MODS, METSRights...oh my!

METS: Metadata Encoding and Transmission Standard

• Provides a structure to bundle all content (essence + metadata) in one AIP

• Identifies types of metadata, but not the terms to define them (with a few exceptions)

METS

dmdSec

amdSec

techMD rightsMD sourceMD digiprovMD

fileSec

structMap

behaviorSec

Page 11: PBCore, METS, PREMIS, MODS, METSRights...oh my!

PBCore: What is it good for?

• Descriptive metadata elements that are specific to public broadcasting

• Controlled vocabularies with broadcast terms

• Easy to map to from legacy station databases

• Granular technical metadata (PBCore 1.2)

➡ Accurately represents the file specific metadata➡ Can be auto populated using technical metadata

extraction tools & sytlesheets

Page 12: PBCore, METS, PREMIS, MODS, METSRights...oh my!

PREMIS: Preservation Metadata Implementation Strategy

Intellectual Entity

Object

Rights

Agents

Events

Object Entity:•Creating application info

•Playback environment (hardware and software

Page 13: PBCore, METS, PREMIS, MODS, METSRights...oh my!

PBCore

PREMIS

Issue of Redundancy between standards

METS

Agents

Checksums

Structure

File Size

HardwareSoftware

RightsRelationships

File Format

TitleCreatorDescription

Page 14: PBCore, METS, PREMIS, MODS, METSRights...oh my!

PBCore

PREMIS

Putting it all togetherMETS

Agents

Checksums

Structure

File Size

HardwareSoftware

RightsRelationships

File Format

TitleCreatorDescription

METSRights!

MODS

Descriptive elements only map to MODS

Page 15: PBCore, METS, PREMIS, MODS, METSRights...oh my!

METS

dmdSec

amdSec

techMD rightsMD sourceMD digiprovMD

fileSec

structMap

behaviorSec

Page 16: PBCore, METS, PREMIS, MODS, METSRights...oh my!

METS

amdSecfileSec

structMap

dmdSec techMD rightsMD

Page 17: PBCore, METS, PREMIS, MODS, METSRights...oh my!

1. Content submitted, verified

2. METS automatically generated (checksums into METS attributes)

3. Source database exports automatically converted to PBCore

4. Technical metadata extracted from files using MediaInfo, converted to PBCore

5. MODS created from completed PBCore

6. Rights metadata (METSRights), preservation metadata (PREMIS) created

7. AIP complete

AIP creation simplified

Page 18: PBCore, METS, PREMIS, MODS, METSRights...oh my!

AIPs:AIP Class 1: Nationally distributed content (Nature)

ESSENCE FILE

TYPES

METADATA

ADDITIONAL ITEMS Scripts, etc

METS PBCorePREMIS METS

RightsMODS

METS

PBCore PREMISMETS Rights

MODS

AIP Class 4: Religion and Ethics

METS

PBCore PREMISMETS Rights

MODSScripts,

etc

Production Master (mov)

HD Broadcast

Master (mov/data)

SD Broadcast

Master (mov/aiff/

m2v)

SD Broadcast

Master (mpeg)

Production Master (mxf)

Original database exports

HD Broadcast

Master (mov/data)

SD Broadcast

Master (mov/aiff/

m2v)

Production Master (mxf)

SD Broadcast

Master (mov/aiff/

m2v)

Production Master (mov)

Original database exports

Original database exports

Page 19: PBCore, METS, PREMIS, MODS, METSRights...oh my!

Production Master (mov)

HD Broadcast

Master (mov/data)

SD Broadcast

Master (mov/aiff/

m2v)

SD Broadcast

Master (mpeg)

Production Master (mxf)

SIP Class 1: WNET National

Broadcast (Nature)

SIP Class 2: WGBH National

Broadcasts

SIP Class 3: WNET Local Broadcast

(New York Voices)

SIP Class 4: Religion and Ethics

PODS PROTRACK

TEAMSINMAGICDATABASE EXPORTS

ADDITIONAL ITEMS Scripts, etc

PODSPODS

PODS

Scripts, etcPRO

TRACK

PROTRACK

INMAGICINMAGIC

TEAMS

HD Broadcast

Master (mov/data)

SD Broadcast

Master (mov/aiff/

m2v)

Production Master (mxf)

Production Master (mxf)

SD Broadcast

Master (mpeg)

SD Broadcast

Master (mov/aiff/

m2v)

Production Master (mov)

Production Master (mov)

SD Broadcast

Master (mov/aiff/

m2v)

Challenge of managing

diverse

SIPs: