An introduction to open linked data for librarians Gordon Dunsire National Library of Finland,...
-
Upload
andrea-copeland -
Category
Documents
-
view
222 -
download
2
Transcript of An introduction to open linked data for librarians Gordon Dunsire National Library of Finland,...
![Page 1: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/1.jpg)
An introduction to open linked data for librarians
Gordon DunsireNational Library of Finland, Helsinki
11 December 2012
![Page 2: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/2.jpg)
Overview
Evolution of library linked dataResource Description Framework and the
Semantic WebIdentity managementSchema, mapping, interoperabilityOpen publishingUniversal Bibliographic Control
![Page 3: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/3.jpg)
Lee, T. B.
Cataloguing has a future. - Audio disc (Spoken word). - Donated by the author.
1. Metadata
In the beginning ...
... the catalogue card
![Page 4: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/4.jpg)
Author:
Title:
Content type:
Provenance:
Subject:
Lee, T. B.
Cataloguing has a future
Spoken word
Audio disc
Metadata
Donated by the author
Carrier type:
From flat-file record ...
... to relational record
Name:Biography:
...
Name authority
Term:Definition:
...
Subject authority
Bibliographic description
![Page 5: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/5.jpg)
Author:
Title:
Content type:
Provenance:
Subject:
Lee, T. B.
Cataloguing has a future
Spoken word
Audio disc
MetadataDonated by the author
Carrier type:
From flat-file description ...
... to FRBR record
Name:Biography:
...
Name authority
Term:Definition:
...
Subject authority
Bibliographic description
Item
Manifestation
Author:
Content type:
Subject:
Spoken word
Expression
Work
![Page 6: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/6.jpg)
Lee, T. B.
Metadata
From FRBR record ...
... to extinction!
Name:
Name authority
Term:
Subject authority
Item
Manifestation
Expression
Work
Provenance: Donated by the author
Subject:Author:
Title: Cataloguing has a future
Content type: Spoken word
Audio discCarrier type:Term:
RDA content type
Term:
RDA carrier type
Donor:
Title:
Amazon/Publisher
![Page 7: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/7.jpg)
Where is the record?
Implicit, not explicitEverywhere and nowhere
A “semantic” Web will allow machines to create the record just-in-timeWe will not have to maintain records just-in-case
The user will have control over the presentationI want to see an archive or library or museum or Amazon
or Google or Flickr or ? displayAnd by avoiding duplication, we can all get on with
describing new stuff ...
![Page 8: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/8.jpg)
Semantic Web
“provides a common framework that allows data to be shared and reused across application, enterprise, and community boundaries.““a Web of data” – W3C Semantic Web FAQ
Uses machine-readable metadataFaster! 24/7/365! Global!
Needs a standard machine-processable formatResource Description Framework (RDF)
![Page 9: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/9.jpg)
RDF
Resource Description FrameworkData format that supports simple, single metadata
statements known as triplesEach statement is in 3 parts
Based on description logicsubject-predicate-object statements
Also specifies relationships between thingsthing-relationship-thing statements
Can be used for navigating between, or integrating, information from multiple sources
![Page 10: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/10.jpg)
RDF triple
The title of this book is “Cataloguing”Subject of the statement = Subject: This bookNature of the statement = Predicate: (has) titleValue of the statement = Object: “Cataloguing”
This book – has title – “Cataloguing”subject – predicate – object
This presentation – has author – Gordon Dunsire
This seminar – has event place – Helsinki
![Page 11: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/11.jpg)
Identifiers
Need unambiguous way of identifying each part of the triple for efficient machine-processingHuman labels (“This book”, “has title”) no good
Same thing, different labels; different things, same label
Uniform Resource Identifier (URI)Exploits the utility of the URL
Machine-readable, regular syntax, unambiguous, global
![Page 12: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/12.jpg)
Uniform Resource Identifier
Can be any unique combination of numbers and lettersNo intrinsic meaning; it’s just an identifying label
Can look like a URLhttp://iflastandards.info/ns/isbd/elements/P1004But does not lead to a Web page (in principle ...)
RDF requires the subject and predicate of a triple to be URIsObject can be a URI, or a literal string (“Cataloguing”)
![Page 13: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/13.jpg)
RDF graph
URI:1 URI:2ThingThingRelationship
Subject ObjectPredicate
Property URI
“Literal”URI:3Property URI
![Page 14: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/14.jpg)
Linked RDF triples
“Literal”URI:3URI:B
URI:1URI:A
URI:2
URI:2
URI:CURI:3
URI:5 “stuff”“blah”
Cluster of triples
with same subject= record
Chain of triples
= linked data
![Page 15: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/15.jpg)
ex:Work1
ex:Expression1
ex:Manifestation1
ex:Item1
naf:Person1
saf:Subject1
rdacon:1013
rdacar:1004
pub:Title1
“metadata”
“spoken word”
“audio disc”
“Cataloguing has a future”
“Lee, T. B.”author name
donor
contentType
carrierType
title
term
term
term
subject
RDF graph ofcatalogue card
sameAs
![Page 16: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/16.jpg)
The hyperdimensional (Tardis) card
Lee, T. B.
Cataloguing has a future. - Audio disc (Spoken word). - Donated by the author.
1. Metadata
Audio shop
Lee MuseumSpoken word archive
W3C Library
“TARDIS four port USB hub, for office-bound Time Lords:Open a time vortex on your desk” – Pocket-lint
![Page 17: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/17.jpg)
Task: To publish local structured metadata as global linked data in the Semantic Web
So that users inside the local environment can benefit from data/information from
outside
And users outside the local environment can benefit from data/information from
inside
![Page 18: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/18.jpg)
Identifying library metadata
Assign URIs to things of interestThree types of thing
The things described in cataloguesBooks, digital resources, “works”, “manifestations”, etc.
The controlled terms used to describe themVocabularies, subject headings, classifications, etc.
The attributes of things, and relationships between thingsMetadata schema, record formats, etc.
![Page 19: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/19.jpg)
Assigning URIs
Must be unique at a global Web levelTypically in two parts:1: unique Web name
Root, base, domain, namespaceCommon to all URIs in “namespace”
E.g. http://iflastandards.info/ns/isbd/elements/Can be abbreviated in data representations
2: unique identifier in local contextLocal part
e.g. P1004
![Page 20: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/20.jpg)
Identifying library things
URI local part can be based on record numberOne record to one resource
Granularity of resource identity !!!National bibliography numbers good for
national contextsBut different libraries have copies of same
thingReflected in duplicate records!
![Page 21: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/21.jpg)
De-duplicating library things
Match local records to national recordsUsual problems and partial solutionsNeed to include authority record things as
well as bibliographic record thingsE.g. Persons, places, topics, etc.
Some local things identified by national URIs Some local things identified by local URIs
![Page 22: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/22.jpg)
Identifying library terms
Terms used for record attributesE.g. carrier and content types
URIs often assigned in Simple Knowledge Organization System (SKOS) value vocabularies
Same issues of de-duplication between local and national < international data
Additional multi-lingual and multi-cultural issues at Web scale
![Page 23: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/23.jpg)
URI:1 URI:2 is same as
URI:3 URI:4has exact match
Linking URIs
has close matchhas narrower term
Good for things(hard boundaries)
etc.
Good for terms(fuzzy boundaries)
![Page 24: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/24.jpg)
Who creates the links?
Not machines!That is, not directly …
Librarians?Other professionals?End-users?Machines!
Statistical analysis of associations (large numbers)Is less than 100 percent “accuracy” acceptable?
![Page 25: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/25.jpg)
Identifying library schema
Schema represented as RDF element setEntities/tables RDF classesAttributes/fields, relationships RDF properties
= predicatesEach class and property has own URI (from
namespace)E.g. dct:BibliographicResourceE.g. rda:carrierType
![Page 26: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/26.jpg)
Standard library element sets
Dublin Core Functional Requirements for Bibliographic Records
(FRBR) + Authority Date (FRAD), Subject Authority Date (FRSAD)
International Standard Bibliographic Description (ISBD)
Resource Description and Access (RDA) UNIMARC [2013/14] ?MARC 21 [BIBFRAME] ???
![Page 27: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/27.jpg)
Assigning element set URIs
Two scenarios:Library schema represented in RDF?
Use element set (class and property) URIsOne-to-one semantic preservation no loss of
information
Library schema not represented in RDF?Create element set for local schema Re-use URIs from other element sets …
![Page 28: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/28.jpg)
Dublin CoreBibliographic Ontology ISBD Etc.
Mapping from MARC 21 to multiple linked data element sets
![Page 29: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/29.jpg)
Mapping from local schema (MARC 21) to linked data (global) schema can be “lossy”
Some information may be lost, because the local attribute must have the same or narrower meaning as the global property
to maintain semantic coherency
Uniform title DC title Uniform title RDA manifestation title
![Page 30: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/30.jpg)
To avoid losing local information in the global Semantic Web, we should
represent the local schema as an RDF element set
British National Bibliography needs an element set for MARC 21
But MARC 21 has “messy” semantics, mixed up with syntax of tags, indicators,
and subfields
![Page 31: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/31.jpg)
>14000 properties
Not every tag, yet!
![Page 32: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/32.jpg)
Something less complicated than MARC 21:
![Page 33: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/33.jpg)
![Page 34: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/34.jpg)
Advantages of local RDF element set
Published linked data loses no informationOther communities can see the semantics and
structure of the local data schemaWhere the linked data comes from
Other communities can re-use the schemaFor their own local dataTo map from their own local schema (lossy!)
Element set can still be mapped to other elementsBibliographic Ontology, Dublin Core, ISBD, etc.
Have your cake, and eat it!
![Page 35: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/35.jpg)
Semantic reasoning: the sub-property ladder“sub-property of” is an RDF property
which links two other properties Ontological triple:
Property1 sub-property of Property2 Semantic rule:
If P1 sub-property of P2;And data triple: Resource P1 “stuff”Then data triple: Resource P2 “stuff”
![Page 36: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/36.jpg)
Ontology Data triples
dod:hasShortTitle Resource hasShortTitle “Tank”
Resource variantTitle “Tank”rda:variantTitle
dct:title Resource title “Tank”
Sub-property ladder
rdfs:subPropertyOf
rdfs:subPropertyOf
![Page 37: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/37.jpg)
Have your cake and eat it!
[You] Publish your local schema in RDF[You] Publish your local data triples using local
schema[Anyone] Publish mappings from local schema
to other, more global schema[Anyone] Publish mapped global data triples
using “reasoner” software
![Page 38: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/38.jpg)
Shrinking the silo
RDF dataset
RDF element set
RDF ontology
Data(RDBMS)
Schema(RDBMS)
Mappings(XML/XSLT)
Local silo Open Global Semantic Web
![Page 39: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/39.jpg)
Universal Bibliographic Control
Top-down approach has failedNo longer a core activity of IFLA
Not “one ring to rule them all”No one-size-fits-all global standardsNo matter how “core”, dumb, or encompassing
Virtual International Authority File (VIAF) uses bottom-up approachLocal data; global “focus” or cluster
![Page 40: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/40.jpg)
Ontological mapping & interoperability
Sub-property ladder is a powerful tool for interoperability
But every ladder rung (ontological link) must “dumb-up” and lose conflicting semanticsProperty definition broadens with super-property
Property domain/range must super-class with super-propertyOr super-property domain/range is blank
= owl:Thing
Need “unconstrained” properties
![Page 41: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/41.jpg)
rdfs:subPropertyOf
unc:“has note on use or
audience”
isbd:“has note on use or
audience”unc:“Intended audience”
rda:“Intended audience”
m21:“Target audience”
frbrer:“has intended
audience”
dct:“audience”
rdfs:subPropertyOf
m21:“Target audience of
…”
rdfs:subPropertyOf
![Page 42: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/42.jpg)
“Commons” properties
Unconstrained by domain or rangeBroad definitionCommon to bibliographic schemaConsensus?Who creates and maintains?
![Page 43: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/43.jpg)
dcterms:“extent”
commons:“extent”
rda:“extent”
rda:“extent of text”
isbd:“has specific material
designation and extent”
marc21:“Physical description”
rda:“duration”
rda:“duration (Expression)”
frbrer:“has extent of the
expression”bibo:
“numPages”
rda:“extent of text
(Manifestation)”
![Page 44: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/44.jpg)
Managing mappings
Duplicate mapsOntologies
Partial mapsFrom single mappings up
Semantic collisionsInconsistencies between maps
Map namesNamed graphs
![Page 45: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/45.jpg)
Bibliographic granularity
Aggregate resources: Collections, etc.Resource vs Work/Expression/Manifestion/Item
Or Work/Instance? [BIBFRAME] Aggregated statements: e.g. Publication
statementComponents: place, publisher, dateRepeatable: need to cluster components
Every granular level needs identification!Beware of blank nodes …
![Page 46: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/46.jpg)
Materials specified
“2001-2005”
ex:1
“Edinburgh :”
“Mudhut Publishing”
Name of publisher, distributor, etc.
Place of publication, distribution, etc.
Publicationstatement:
1Publication,Distribution, etc. (Imprint)
Materials specified
“2006”
“Edinburgh :”
“Castle Press”
Name of publisher, distributor, etc.
Place of publication, distribution, etc.
Publicationstatement:
2
Publication,Distribution, etc. (Imprint)
![Page 47: An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.](https://reader035.fdocuments.us/reader035/viewer/2022062314/56649e2d5503460f94b1d879/html5/thumbnails/47.jpg)
Bottom line: trust a librarian?
Provenance is importantAnyone can say Anything about Any thing (AAA)No intrinsic test of truth – only inconsistency
“Who said that?”Competing data from many different sources: social
networks, publishers and sellers, governments, propagandists, etc.
Library data generally of higher qualityEthos of trust, neutrality, etc.
Can we keep it that way?