On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

59
On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Transcript of On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Page 1: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

On Libraries & Linked Data

Antoine Isaac

UB Utrecht, April 6, 2011

Page 2: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Who am I?

• Europeana• Web & Media Lab, Vrije Universiteit Amsterdam

• W3C Library Linked Data group• (2006-2009) W3C Semantic Web Deployment group

SKOS

[email protected]

Page 3: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

DemoFollowing one’s nose to subject heading lists as linked data• American LCSH

http://id.loc.gov/authorities/sh85145447#concept

• French RAMEAUhttp://stitch.cs.vu.nl/vocabularies/rameau/ark:/12148/cb11931913j

• German SWDhttp://d-nb.info/gnd/4064689-0

• Agrovochttp://aims.fao.org/aos/agrovoc/c_8309

• STWhttp://zbw.eu/stw/descriptor/14188-0

• Further on to DBPediahttp://dbpedia.org/resource/Water

Page 4: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Demo (fallback option)

Subject heading lists as SKOS linked data• American LCSH http://id.loc.gov• French RAMEAU: http://stitch.cs.vu.nl/rameau• German SWD: http://d-nb.info/gnd/ • mapped using manual links from the MACS project

http://macs.cenl.org

Starting from http://id.loc.gov/authorities/sh85014310#concept

Page 5: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.
Page 6: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.
Page 7: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.
Page 8: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.
Page 9: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.
Page 10: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Linked Data?

1. Use URIs as names for things2. Use HTTP URIs so that people can look up those names3. When someone looks up a URI, provide useful information

using standards (RDF, SPARQL)4. Include links to other URIs, so that they can discover more

things

Tim Berners-Lee, http://linkeddata.org/

Page 11: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

(Linked) Data Representation

• That subject heading data follows a link-intensive data modelUniform resource identifiers (URI)Resource Description Framework (RDF)

Page 12: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

(Linked) Data Representation

• Use more-or-less the same standard vocabularySimple Knowledge Organization System (SKOS)http://www.w3.org/2004/02/skos/ For representing thesauri, classifications, etc. on the

Semantic Web

Page 13: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

A SKOS graphanimalscats

UF domestic catsRT wildcatsBT animalsSN used only for domestic

catsdomestic cats

USE catswildcats

Page 14: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

SKOS mappings

SKOS provides conceptual links to bridge across different contexts

KOS 1:animalscatswildcats

KOS 2:animalhumanobject

Page 15: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Links in the data

Page 16: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Links in the data

Page 17: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Growing interest for linked data in the library community

Page 18: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Linked Library Cloud beginning 2008

[Ross Singer, Code4Lib2010]

http://code4lib.org/conference/2010/singer

Page 19: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Linked Library “sector” in 2010

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

Page 20: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Libraries and LD, the perfect match?

• Libraries have been producing (meta)data for ages• Libraries (often) produce high-quality metadata

Page 21: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Libraries and LD, the perfect match?

• Library metadata was locked in record silos• But it maintain links to the outside world

• Bibliographic and web references• Shared vocabularies• Same books!

Page 22: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Libraries and LD, the perfect match?

LD is about• Citing object• Linking to them• Re-using data

Think of web-native union catalogues

Page 23: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Johan Stapel, Koninklijke Bibliotheek (now bibliotheek.nl)

A vision for the Dutch National Library

Page 24: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

A web of cultural heritage data?

?

Page 25: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

?

Page 26: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

The current portal

Page 27: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.
Page 28: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Towards semantic search: facets

Page 29: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Building a search engine on top of metadata is difficultIntrinsic quality problems: correctness, coverage

Especially when data is so heterogeneous100s of formatsFrom flat 5-fields records to 100-nodes XML treesLanguage issue!

We currently use a simple, flat interoperability formatQuick-win quickly showing its limits

Page 30: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

We can better use institutions’ original metadata

Accommodate their different practicesData structures and semantics

Access objects via a semantic layer of vocabularies for subjects, persons, places…

Semantic ThoughtLab: experimenting solutions

Page 31: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Towards semantics-enabled searchBuilding a "semantic layer" to help accessing content

Page 32: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Towards semantics-enabled search

• Enhance access to Europeana content by semantics– Query expansion, clustering of results

• Exploiting various types of relations– "located in", "lived in", "is more specific concept"…

• Semantics are already there, in metadata and "controlled vocabularies" used in metadata– Thesauri, classifications…

• Requires to make it properly machine-accessible

Page 33: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Europeana Data Model

Trying to evolve towards RDF and Linked Data• Representing objects, persons, places, etc. as

resources• Linking and re-using external sources• (Re-using) richer data modeling features

SKOS, CIDOC-CRM, OAI-ORE

• Enabling domain-specific data profiles• Separating original data from enrichments

http://version1.europeana.eu/web/europeana-project/technicaldocuments/

Page 34: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Prototype: Europeana Thought Lab

http://europeana.eu/portal/thought-lab.html

Page 35: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Clustering of results

Page 36: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Baseline: matching concepts' label

Controlled place name from a vocabulary at the Rijskmuseum

Metadata for the object

Page 37: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

A "more specific Egypte"?

Page 38: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

A "more specific Egypte"?Metadata for the object

Page 39: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

A place more specific than the Egypt one

Semantic information on the Giza place in the Rijskmuseum Vocabulary

Page 40: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Following other relations

Page 41: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Following other relations - creator

Metadata for the object

Controlled person name from a vocabulary at the Rijskmuseum

Page 42: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Following other relations - matchInformation on Gustave Le Gray from the Rijskmuseum Vocabulary

Matched to a "Gustave Le Gray" from another Vocabulary

Page 43: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Enabling bits & pieces

Exploiting semantic links in CH vocabulariesConcept “Giza” narrower than concept “Egypte”

Mapping/alignment between CH vocabulariesLouvre’s “Égypte” equivalent to Rijksmuseum’s “Egypte”

Enrichment of existing metadataThe string “Egypt” in a metadata record indicates the concept of

Egypt defined in Rijksmuseum thesaurus

Page 44: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Challenge #1: Linking

Page 45: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Challenge #1: Linking

Manual mapping of large vocabularies is labour-intensive• LCSH, RAMEAU and SWD mapped in the MACS project

http://macs.cenl.org • SWD and DDC mapped in the CRISS-CROSS project

http://linux2.fbi.fh-koeln.de/crisscross/

Automatic linking is not perfect but can help• STW, AGROVOC…• Some studies (and further pointers) for automatic library

thesaurus alignment in the STITCH projecthttp://stitch.cs.vu.nl

Page 46: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Challenge #1: Linking

• (Semi-)automatic techniques are necessary to– Connect objects to vocabularies (esp. for legacy data)– Connect objects themselves together

• Crowdsourcing?

• Making the way librarians create metadata evolve?

Page 47: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Linking strategy for libraries?

Page 48: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Linking strategy for libraries?

• Links to library-originated sources– VIAF, LCSH, DDC, UDC, Worldcat, PND…

• Links to resources from cultural environment– Museums, archives– Scientific communities: bibliographic data & research data– Publishers– Europeana and other aggregators

Page 49: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Semantic Annotation

Page 50: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Conclusion?

• Linked Data won’t not solve everything right now

• Just a set of techniques and a vision for better sharing, cross-linking and re-use data, fitting the web

• Which is not bad!

Page 51: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

If we stop here, thanks for your attention!

Any (more) questions?

Page 52: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Some references

Page 53: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

W3C Library LD Incubator

http://www.w3.org/2005/Incubator/lld• 1-year group• OCLC, LC, VU Amsterdam, DNB, etc.

• help increase global interoperability of library data on the Web

• bringing together people involved in Linked Data—in the library community and beyond

• building on existing initiatives and collaboration tracks for the future

Page 54: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Library LDUse Cases

• LLD use cases and case studies (work in progress)

• JISC cases for open bibliographic data http://obd.jisc.ac.uk

http://www.w3.org/2005/Incubator/lld/wiki/UseCases

Page 55: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Useful vocabularies to express data• Dublin Core• SKOS• BIBO• OAI-ORE• FOAF• MADSIn progress• RDA vocabularies• FRBR@IFLACf. Linked Open VocabulariesNote: vocabularies can be combined and articulated together

dublincore.org/www.w3.org/2004/02/skos/

bibliontology.com/www.openarchives.org/ore/

www.foaf-project.org/www.loc.gov/standards/mads/rdf/

metadataregistry.org/rdabrowse.htm

labs.mondeca.com/dataset/lov/

Page 56: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Datasets

• Controlled vocabularies (thesauri, etc.)LCSH, DDC, Agrovoc, VIAF, GND

• Bibliographic dataNat. Libraries of Hungary, Sweden

• Trying to keep track of some on CKANhttp://ckan.net/group/lld

Page 57: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

In the Netherlands

• DEN, Bibliotheek.nl, KB, Vrije Universiteit Amsterdam, Beeld en Geluid, UvA Library

• Amsterdam Museum as Linked Data http://semanticweb.cs.vu.nl/lod/am/

• Dutch Culture Link http://sites.google.com/site/dclod11/

• Dublin Core 2011http://dcevents.dublincore.org/index.php/IntConf/dc-2011

Page 58: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

Pictures• http://www.europeana.eu/portal/record/

03903/8C5C6AEFF6B50DCCEDF6A23A99DD3A2D66AEB2CC.html• http://www.europeana.eu/portal/record/03912/

E9666896A50FDDE5F7F15A17C11219A7FBCBBC50.html(Europeana links give access to resources on original sites)

Page 59: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011.

First Demo pointers

• American LCSH http://id.loc.gov• French RAMEAU: http://stitch.cs.vu.nl/rameau• German SWD: http://d-nb.info/gnd/ • Agrovoc: http://aims.fao.org/ • STW: http://zbw.eu/stw/ • DBPedia: http://dbpedia.org/