Beyond the Record : OCLC & the Future of MARC Ted Fons Director WorldCat Global Metadata Network CCS...

Post on 29-Mar-2015

216 views 2 download

Tags:

Transcript of Beyond the Record : OCLC & the Future of MARC Ted Fons Director WorldCat Global Metadata Network CCS...

Beyond the Record : OCLC & the Future of MARC

Beyond the Record : OCLC & the Future of MARC

Ted FonsDirector WorldCat Global

Metadata Network

CCS ForumALA - Chicago

July 11, 2009

CDF

MARC 21-2709

OCLC MARC

OCLC CDF

MARC XML

DC XML

DC-Qualified

MODS

ONIX Books

MARC 21-2709

DC XML

OAI-DC XML

OAI-PMH XML

ONIX Serials

MARC XML

MODS

ONIX Books

DC-Qualified

Beyond the Record: OCLC and the Future of MARCBeyond the Record: OCLC and the Future of MARC

• The OCLC Context

• OCLC’s Role in RDA

• Beyond MARC

• Beyond the Record

The OCLC Context

The OCLC ContextThe OCLC Context

• A membership organization

• Diverse membership

69,826 libraries in 112 countries

1,3551,355

55,28455,284

882882

5,6395,6394,2534,253

1,0151,015

320320

The OCLC CooperativeThe OCLC Cooperative

1,0801,080

OCLC’s Role with RDA

OCLC & RDAOCLC & RDA

• Committee Contribution:

• ex-officio membership in the ALA Committee on Cataloging: Description and Access

• MARC Advisory Committee

• Staff Participation:

• Joint Steering Committee's two RDA Examples Groups

• RDA/MARC Working Group

• Representation on: ALA ALCTS RDA Implementation Task Force

• Various program sessions

OCLC & RDAOCLC & RDA

• OCLC Internal Activities:

• Discussions with the three U.S. national libraries to plan for the testing/evaluation period (late 2009)

• Planning for MARC21 format changes to support the testing/evaluation period

• OCLC Contract Services to staff have been selected to participate in the testing/evaluation period.

Beyond MARC21With thanks to Jean Godby of OCLC Research

The Crosswalk Web Service at OCLCThe Crosswalk Web Service at OCLC

•Enables OCLC to translate from one metadata format to another.

• A “metadata format” is a triple that consists of a metadata schema, a structural encoding, and a character encoding.

• Supported standards are bibliographic, but the software can handle other types of data.

•Can be called from any product or service that processes metadata.

•A version with a slightly different interface resides on the OCLC Enterprise Bus.

CDF

MARC 21-2709

OCLC MARC

OCLC CDF

MARC XML

DC XML

DC-Qualified

MODS

ONIX Books

MARC 21-2709

DC XML

OAI-DC XML

OAI-PMH XML

ONIX Serials

MARC XML

MODS

ONIX Books

DC-Qualified

Inputs and outputsInputs and outputs

MARC input

522 $a northwest

<datafield tag=‘522”> <subfield code=‘a’>northwest</subfield></datafield>

ISO 2709

MARC XML

or

Convert to input structure <record> <header> <schema name=‘marc21’ namespace=‘uri:”marc:21’/> </header> <field name=‘522’> <field name=‘a’> <value>northwest</value> </field> </field></record>

Translate to DC Terms

<record> <header> <schema name=‘DC-Terms’ namespace=‘uri:DC-Terms’/> </header> <field name=‘spatial’> <value>northwest</value> </field></record>Convert to output structure

<?xml version=“1.0” encoding=“UTF-8”?><qualifieddc xmlns dcterms=‘purl.org;dc/terms’ > <dctermsset> <dcterms:spatial> northwest </dcterms:spatial> </dctermsset></qualifieddc>

DC Terms output

Data flow for a single translationExample: MARC21 to Dublin Core via CDF

In sum…In sum…

•The Crosswalk Web service is engineered for reusability.

•It is abstract enough to handle any kind of metadata markup.

•It keeps a close connection between human-generated translation logic and executable code.

•It is flexible enough to handle many use cases.

Adoptions Adoptions

The Crosswalk Web Service has been incorporated into:

• Connexion Client 2.0

• ContentDM Ingest

• Data Load Enhancement

• eSerials, eSweep

• NetLibrary

• Next Generation Cataloging

Adoption is being studied for components of:

• Digital Collection Gateway

• WorldCat Cataloging Partners NCIP (NISO Circulation Interchange Protocol)

It is being used in research projects:

• Art and natural history museum metadata (with RLG partners)

• ISO 8459 bibliographic message exchange (with Janifer Gatenby)

Future prioritiesFuture priorities

1. Develop a user interface that accepts translation logic and automatically generates Seel scripts.

2. Streamline and enhance some of the Seel language features.

3. Investigate ways to interoperate with the crosswalking software developed at OCLC Leiden.

4. Develop translations for non-bibliographic metadata.

For more informationFor more information

1. Metadata translation at OCLC, pre-CWS

• A Survey of Metadata Translation Activity at OCLC

2. CWS documentation

• The Crosswalk Web Service Users’ Guide

• The Seel tutorial: Introduction; Seel in a Nutshell

3. 4. Research reports

• Encoding Application Profiles in a Computational Model of the Crosswalk

• Toward element-level interoperability in bibliographic metadata

• A Repository of Metadata Crosswalks

• Two Paths to Interoperable Metadata

Beyond the RecordWith thanks to Diane Vizine-Goetz of OCLC Research

WorldCat IdentitiesWorldCat Identities

FRBR Entity Levels RevisitedFRBR Entity Levels Revisited

The movie

Original Version

Based on a graphic in Tillett, Barbara "AACR2's Strategic Plan and IFLA Work towards an International Cataloguing Code“ (2002)

OCLC FRBR Work-set AlgorithmOCLC FRBR Work-set Algorithm

Provides a FRBR-based view of the data

1.Records clustered into works using author and title fields from bibliographic and authority records

2. Author names and titles normalized to construct a work key

3. All records with the same key are grouped together in a work set or cluster

Share data elements across a FRBR Work SetShare data elements across a FRBR Work Set

Work pages betaWork pages beta

Provides a rich context from cataloging data

Beyond the Record : OCLC & the Future of MARC

Beyond the Record : OCLC & the Future of MARC

Ted FonsDirector WorldCat Global

Metadata Networkfonst@oclc.org

CCS ForumALA - Chicago

July 11, 2009

CDF

MARC 21-2709

OCLC MARC

OCLC CDF

MARC XML

DC XML

DC-Qualified

MODS

ONIX Books

MARC 21-2709

DC XML

OAI-DC XML

OAI-PMH XML

ONIX Serials

MARC XML

MODS

ONIX Books

DC-Qualified