Smithsonian institution libraries

83
Smithsonian Institution Libraries “Metadata Mixing & Matching For Discovery” CLSC 877 Metadata and Project Management Suzanne C. Pilsk ~ Smithsonian Institution Libraries ~ 2010

description

 

Transcript of Smithsonian institution libraries

Page 1: Smithsonian institution libraries

Smithsonian Institution Libraries“Metadata Mixing & Matching For

Discovery”

CLSC 877

Metadata and Project Management

Suzanne C. Pilsk ~ Smithsonian Institution Libraries ~ 2010

Page 2: Smithsonian institution libraries

Facts and FiguresSmithsonian Institution Libraries

– Washington, D.C.• Anacostia Museum & Center for African American History and Culture

Library

• Anthropology Library

• Botany and Horticulture Library

• The Dibner Library of the History of Science and Technology

• Freer Gallery of Art and Arthur M. Sackler Gallery Library

• Hirshhorn Museum and Sculpture Garden Library

• Joseph F. Cullman 3rd Library of Natural History

Page 3: Smithsonian institution libraries

Facts and FiguresSmithsonian Institution Libraries

Washington, D.C. (continued)• Museum Studies & Reference Library

• National Air and Space Museum Library

• National Museum of American History Library

• National Museum of Natural History Library

• National Postal Museum Library

• National Zoological Park Library

• Smithsonian American Art Museum/National Portrait Gallery Library

• Warren M. Robbins Library, National Museum of African Art

Page 4: Smithsonian institution libraries

Facts and FiguresSmithsonian Institution Libraries

• Elsewhere– Suitland, Md.

• Museum Support Center Library

• National Museum of the American Indian Library

– Edgewater, Md.

• Smithsonian Environmental Research Center Library

– New York City

• Cooper-Hewitt, National Design Museum Library

– Republic of Panama

• Smithsonian Tropical Research Institute Library

Page 5: Smithsonian institution libraries

Facts and FiguresSmithsonian Institution Libraries

• African Art

• African American History and Culture

• Anthropology

• American Art

• American History

• Asian and Middle Eastern Art

• Aviation history and Space Flight

• Design and Decorative Arts

• Environmental Management and Ecology

• History of Science and Technology

• Latino History and Culture

• Materials Research

• Modern and Contemporary Art

• Museology

• Native American History and Culture

• Natural History

• Postal History

• Tropical Biology

• Trade Literature

• World’s Fair Ephemera

Page 6: Smithsonian institution libraries

What’s So Special?

Public Museum

Smithsonian Institution is the largest museum complex in the world …

“The Nation’s Attic”

Page 7: Smithsonian institution libraries

“Increase and Diffusion of Knowledge”

Unlock the Mysteries of the Universe

Understanding and Sustaining a Biodiverse Planet

Valuing World Cultures

Understanding the American Experience

Page 8: Smithsonian institution libraries

SIL Mission(Smithsonian Directive 500)

As the largest and most diverse museum library in the world, SIL leads the Smithsonian in taking advantage of the opportunities of the digital society. SIL provides authoritative information and creates innovative services and programs for Smithsonian Institution researchers, scholars and curators, as well as the general public, to further their quest for knowledge. Through paper preservation and digital technologies, SIL ensures broad and enduring access to the Libraries’ collections for all users.

Page 9: Smithsonian institution libraries

SIL’s Strategic Plan “Focus on Service”

• GOAL 1: COLLABORATING ACROSS BOUNDARIES– SIL creates a compelling environment for connecting, collaborating and

exploring across disciplines and information boundaries

• GOAL 2: DISCOVERING INFORMATION– SIL enhances and eases the discovery of information in our collections

for SI scholars, researchers, scientists, and the larger world of learners

• GOAL 3: CONNECTING WITH USERS– SIL understands and meets user needs, serving users where they live

and work

• GOAL 4: BUILDING EXPERTISE– SIL builds expertise on information discovery, navigation and

management

• GOAL 5: ENABLING OUR MISSION– SIL ensures its success through increased financial strength, effective

administrative support, and organizational excellence

Page 10: Smithsonian institution libraries

Facts and FiguresSmithsonian Institution Libraries

Total volumes

> 1.7 million

50,000 are rare books

10,000 manuscripts

Trade Catalogs

> 500, 000 items

> 30,000 companies

dating from the 1800s

Page 11: Smithsonian institution libraries

Facts and Figures

• 102 Smithsonian Libraries Staff

• 17 Souls in Cataloging Services (with contractors)

Page 12: Smithsonian institution libraries

• Traditional Library

• Traditional Services

Page 13: Smithsonian institution libraries

Integrated Library System

Smithsonian Institution Research Information System (SIRIS)

– MARC

– AACR2r

– ISBD

– LC Classification

– LC Subject Headings

Page 14: Smithsonian institution libraries

Traditional Cataloging

• Monographs

• Serials

• Videos

• Microfilm/fiche

• Sound Recordings

• CD/DVDs

• Electronic Resources

Page 15: Smithsonian institution libraries

Traditional Cataloging

• OCLC

• Program for Cooperative Cataloging

– NACO

– SACO

– BIBCO

Page 16: Smithsonian institution libraries

SI Libraries Serves

• Curators

• Researchers

• Post-Docs

• Museum Administrators

• Public

Page 17: Smithsonian institution libraries
Page 18: Smithsonian institution libraries

How to make THIS into 0’s and 1’s

Page 19: Smithsonian institution libraries

Virtual Library defined in theOnline Dictionary for

Library and Information Science

A "library without walls" in which the collections do not exist … [in] tangible form at a physical location but are electronically accessible in digital format via computer networks. The term digital library is more appropriate because virtual (borrowed from "virtual reality") suggests that the experience of using such a library is not the same as the "real" thing when in fact the experience of reading or viewing a document on a computer screen may be qualitatively different from reading the same publication in print, but the information content is the same regardless of format.~ http://lu.com/odlis/odlis_v.cfm

Page 20: Smithsonian institution libraries

Digital Library defined in theOnline Dictionary for

Library and Information Science

A library in which a significant proportion of the resources are available in machine-readable format … . The digital content may be locally held or accessed remotely via computer networks. … In libraries, the process of digitization began with the catalog, moved to periodical indexes and abstracting services, then to periodicals and large reference works, and finally to book publishing.~ http://lu.com/odlis/odlis_v.cfm

Page 21: Smithsonian institution libraries

Traditional Digital Library

• Electronic Journals & Databases

• Digital Editions

• Online Exhibitions

• Online Catalog

• Digital Reference

Page 22: Smithsonian institution libraries
Page 23: Smithsonian institution libraries

Will they find it?

If you digitize it …

Page 24: Smithsonian institution libraries

Search Gone BAD!

Page 25: Smithsonian institution libraries

IFLA’s Functional Requirements for Bibliographic Data

To Find

To Identify

To Select

To Obtain

To USE

Page 26: Smithsonian institution libraries

Metadata

Page 27: Smithsonian institution libraries

Metadata – failure to serve

Page 28: Smithsonian institution libraries
Page 29: Smithsonian institution libraries

Metadata: MARC

MARC

110 Oscar Mayer & Co.

650 Frankfurters

Page 30: Smithsonian institution libraries

Metadata

Dublin Core

Creator:

Oscar Mayer & Co.

Subject:

Frankfurters

Page 31: Smithsonian institution libraries

02761nam 2200469 4500001000700000005001700007008004100024010002300065019001300088035001400101035002300115040006100138049002700199050001500226100004200241245019300283260008300476300001700559504033500576505015400911590010901065590009601174650002601270945002101296945007301317945003101390945004801421945004801469945004701517945007901564945004401643945004601687945004801733945007601781945004401857945005101901945005101952945007102003945009002074945009602164945003102260-459797-20050131154400.0-731129m19021933enk b 000 0 lat c- ­aagr03000069 //r582- ­a14018362- ­aABY6485LB- ­a(OCoLC)ocm00751549- ­aU.S. Dept. of Agr. Libr.­cRIU­dOCL­dCHS­dSER­dSMI­dWaOLN- ­aSMI$­aSMIM­aSMIE­aSMIB-00­aQL354­b.S5-1 ­aOscar Mayer & Co.-10­aPronto pup:­bhot dogs hamburgers/­ca Oscar Mayer and Company.- ­aNew Orleans, La. :­bBourbon Street Foods,­c2000.

Metadata: Real MARC – Still failure to serve

Page 32: Smithsonian institution libraries

Metadata: MARCXML

<?xml version="1.0" encoding="UTF-8" ?><collection xmlns="http://www.loc.gov/MARC21/slim"

xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd">

<record><leader>02761nam a2200469 4500</leader><controlfield tag="001">459797</controlfield><controlfield tag="005">20050131154400.0</controlfield><controlfield tag="008">731129m19021933enk b 000 0 lat

c</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">agr03000069 //r582</subfield></datafield>

Page 33: Smithsonian institution libraries
Page 34: Smithsonian institution libraries

- Specimen- Plate or other visual image- Taxonomic description

Page 35: Smithsonian institution libraries
Page 36: Smithsonian institution libraries

Beyond the Traditional

Taxonomic Literature Needs/Requests

• Beyond the Scan

• Beyond the Re-Keyed

• Marking up the data in metadata schemas

Page 37: Smithsonian institution libraries
Page 38: Smithsonian institution libraries
Page 39: Smithsonian institution libraries

MARC

LCSH/LCCS

ISBDFeed the cat

Pick up dry cleaning

Milk, eggs, lactaid

Make dentist appt.

AACR

Page 40: Smithsonian institution libraries

MARC

AACR

LCSH/LCCS

ISBD

Feed the cat

Pick up dry cleaning

MODSXML

Dublin Core

ONIX

METs

TEI

FRBR

Access

Hierarchical

Faceted

relatedItem

Milk, eggs, lactaid

Add hotdogs to grocery list

Dewey

XMP

RDA

Page 41: Smithsonian institution libraries

Discoverable

Interoperability

Open AccessFeed the cat

Pick up dry cleaning

Milk, eggs, lactaid

Make dentist appt.

Collaboration

Page 42: Smithsonian institution libraries
Page 43: Smithsonian institution libraries

Biodiversity Heritage Library (BHL)

Page 44: Smithsonian institution libraries
Page 45: Smithsonian institution libraries

EOL species need

CuratorRequest

“gap-fill”for other BHL library

Pull from stacks

Circ in ILS Preliminary metadata checkAnd physical check

Goin’ down the rows

serial?

“Bid”on title, select in picklist

The Stacks

Meta-datacheck

Preser-vationreview

Other library“bid” ?

Circ to cataloging for MARC editing

Update picklist if item record has been changedDuring cataloging touch-upCirc to scanner

Put on shipping cart, generate‘packinglist’ invoice

Evaluate titleNeed is…

pass

yes

pass

fail

fail

no

Carts delivered to scanner

Picklist DatabaseStores Select / reject / shipstate & suppliesitem metadatato IA

Select title in picklist,upload to monograph de-duper

Duplicate?yes

BibliographicData from SIRIS

noReject in picklist,Circ in HorizonReturn to stacks

noReject in picklist,return to stacks

Page 46: Smithsonian institution libraries

IA scanning processUnique IA id is assignedMetadata is gathered fromSIRIS and the picklist dbAnd associated with the scanJP2000s generated& transformedServed on archive.orgQA is done by IA on 10%

Books are returned, cart contents areverified against invoice

SIL does 20% QAChecking for metadata matchingWith item, scan quality etc

Updated in picklist as scannedCirc in HorizonPlace BHL sticker near barcodeReturn to Stacks

Pass QA?

BHL PortalPeriodically harvestsMarc.xml (bib) and itemRecords, along with JP2000 fromArchive.orgTo index and displayIn the portal

yesno

Update picklist to indicate rescan

Put on shipping cart, generate ‘packinglist’ Invoice, alert scanning center

Carts delivered to scanner

Download .csv from portal with SIL barcodes, Portal URLs

Send URLs to SIRISOffice for batch updates

Page 47: Smithsonian institution libraries

Ernest IngersollHand-book to the National Museum … Smithsonian Institution, 1886

Mass Scanning Workflow

BHL

•Bid Lists•Serials Management•Pick Lists•Packing Lists•Monographic Management•Local data flow•WonderFetchtm

•Return of data•Return of material•Billing

Page 48: Smithsonian institution libraries

1. Select Book ~Pull from Shelf

2. Review Physically and Metadata

3. Establish viability and create Wonderfetchtm

4. Send to IA scanning center

5. Book is scanned & QA

6. Page images loaded

7. Derivatives created

8. Book returned to library

9. Files harvested from IA portal to BHL

10. Taxonomic Intelligence Added

11. Available through BHL

BHL

Page 49: Smithsonian institution libraries

Monographic DeDuper

Page 50: Smithsonian institution libraries
Page 51: Smithsonian institution libraries
Page 52: Smithsonian institution libraries
Page 53: Smithsonian institution libraries
Page 54: Smithsonian institution libraries
Page 55: Smithsonian institution libraries
Page 56: Smithsonian institution libraries
Page 57: Smithsonian institution libraries

The BHL Portal is not a library catalog

Page 58: Smithsonian institution libraries
Page 59: Smithsonian institution libraries
Page 60: Smithsonian institution libraries
Page 61: Smithsonian institution libraries
Page 62: Smithsonian institution libraries
Page 63: Smithsonian institution libraries
Page 64: Smithsonian institution libraries
Page 65: Smithsonian institution libraries
Page 66: Smithsonian institution libraries

Circulation

Sort and Filter for Appropriate Material

Packing/ Receiving Lists

Identify at time of scan –

quality review

Ingested in to

Discovery Tools

Linked

Discovery

to Delivery

Receive NEW Data

Incorporate,

Re-Purpose, Re-Use

Hooks for Known and Unknown Re-Use

Descriptive Data

Chronology Issues

Holdings

(Badly done graphic)

Page 67: Smithsonian institution libraries
Page 68: Smithsonian institution libraries

Collections.SI.edu ~ SI Libraries

842,000 Records in ILS27,805 Trade literature

74,613 Art and Artists files4,000 SI Digital Repository

(SI Research Online)

Page 69: Smithsonian institution libraries
Page 70: Smithsonian institution libraries

Not in Collections.Si.Edu

Page 71: Smithsonian institution libraries

Collections.SI.edu ~ Freer + Sackler

53% of the ENTIRE

collection at www.asia.si.edu

& collections.si.edu

12,269 objects online

NOT: F/S G’s Study Collection – 10,872 objects only for study not for exhibit – will never go online

Page 72: Smithsonian institution libraries

Collections.SI.edu ~ NPM

12,000 Records

Collections.si.edu

16,000 Records in the ARAGO

214,000 Records in the database

6 Million objects

= 0.2% in Collections.si.edu

Page 73: Smithsonian institution libraries

Collections.SI.edu ~ NMNH

NMNH estimates 126 Million Specimens

Page 74: Smithsonian institution libraries
Page 75: Smithsonian institution libraries

Collections.SI.edu ~ NMNH

NMNH estimates 126 Million Specimens

5,400,000 Catalog Records in collection management system –

5,218,793 available on collections.nmnh.si.edu (181,207 records not available)

Page 76: Smithsonian institution libraries

Collections.SI.edu ~ NMNH

Coming soon:

IZ 992,000 (68,000 with media)

Bot 788,000 (1,300 with media)

Page 77: Smithsonian institution libraries

Collections.SI.edu ~ NMNH

NMNH estimates 126 Million Specimens

5,400,000 Catalog Records in collection management system – 5,218,793 available on

collections.nmnh.si.edu (181,207 records not available)

6 out of 10 units supplying data to collections.si.edu = 2,527,557 records

(153,418 have images)

Page 78: Smithsonian institution libraries

Collections.SI.edu

50% of the records are from 1 source

(NMNH and still growing 2,527,557 records

with 153,418 images)

4,600,000 Records

445,000 Images

40 Data sources

Page 79: Smithsonian institution libraries

SI Wide Estimations

• 136.9 MILLION objects

• 13 MILLION digital records

• 821,000 digital images

Page 80: Smithsonian institution libraries

“The worth and importance of the Institution is not to be estimated by what it accumulates within the walls of its building, but by what it sends forth to the world.”

—Joseph HenryThe Smithsonian Institution’s First Secretary

1852

Page 81: Smithsonian institution libraries
Page 82: Smithsonian institution libraries

Credits

Thanks to staff at

NMAI SIL

NMNH MBL/WHOI Library

NPM MoBot

Freer/Sackler NYBG

BHL

Page 83: Smithsonian institution libraries

Smithsonian Institution Libraries“Metadata Mixing & Matching For

Discovery”

CLSC 877

Metadata and Project Management

Suzanne C. Pilsk Smithsonian Institution Libraries

[email protected]