Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

24
World Library and Information Congress: 77th IFLA General Conference and Assembly. Semantic Web Special Interest Group. Puerto Rico. August 17 Linked Data at the BNE. Departamento de Proceso Técnico Elena Escolano Rodríguez – Jefa Servicio de Coordinación y Normalización Daniel Vila Suero – Universidad Politécnica de Madrid

description

Presentada en "World Library and Information Congress: 77th IFLA General Conference and Assembly. Semantic Web Special Interest Group. 17 de agosto. Puerto Rico

Transcript of Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Page 1: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

World Library and Information Congress: 77th IFLA General Conference and Assembly. Semantic Web Special Interest Group. Puerto Rico. August 17

Linked Data at the BNE. Departamento de Proceso Técnico

Elena Escolano Rodríguez – Jefa Servicio de Coordinación y NormalizaciónDaniel Vila Suero – Universidad Politécnica de Madrid

Page 2: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE 2

Indice

01 Project background02 BNE Standards and Ontology Selection03 Data source set: Cervantes and surrounding data04 RDF Modelling05 URI Design06 Process Overview07 Main activities08 Process overview09 Process steps10 Main activities11 Some results

Page 3: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Project background

Cooperation project between BNE and national and regional libraries of Spain

GOAL: Create a Unified Authority System (similar to VIAF approach)

MAIN ISSUE: Multilinguality

Various approaches were analysed and tested, but were notsuccesful

Page 4: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Project background

Around January 2011 the joint project “Preliminarystudy of Linked Data” between the BNE and OEG starts

Page 5: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

BNE Standards and Ontology selection

Page 6: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Data source set: Cervantes and surrounding data

MARC data selection involved 3 phases:

Phase 1: Authority records: Cervantes + each record that containedCervantes as author (author-title, author-title-lang, etc.) –> 550 records

Bibliographic records: Associated to selected set ofauthoritities 8552 records

Phase 2:Authority records: Records associated to selected set ofbibliographic records in Phase 1 7351 records

Phase 3:Authority records: Authority records related within anyfield with selected set in Phase 2. (Mainly themas andworks and expressions from authors of phase 2) 53000 records

6

Page 7: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

RDF Modelling

7

frbr:Manifestation

frbr:Person

frbr:Work

Frbr:Expression

frsad:Thema

ISBD Elements

MARC AUTHORITY

RECORD

FRBR

FRAD

MARC BIBLIOGRAPHIC

RECORD

Page 8: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

URI Design

Followed Cool URIs and Linked Data patterns (Natural keys)

A-Box:http://cultura.linkeddata.es/BNE/resource/<Class>/<ID>

T-Box:Opaque URIs

Multilingual labels (EN, ES, HR..)

Available at metadataregistry.org

Base URI: http://iflastandards.info/ns/

Not published, will be published soon not dereferenceable

Page 9: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Process Overview

OBJECTIVES: 1. Find a systematic and repeteable transformation methodology

2. Design and implementation of lifecycle supporting tools(mapping, cleaning, transformation, linkage).

3. Proove applicability of IFLA RDF/OWL models

Lifecycle Process : Iterative and incremental

Joint effort from two different worlds: Libraries and SemanticWeb (Linked Data)

9

Page 10: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Main activities

1. Data analysis:Understand records’ organization and structure(Authorities and bibliographic)

Development of analytical tools (fields and subfieldscombination reports, etc.)

2. Mapping MARC21 to chosen ModelsVery complex process

ASSET: Tool for mapping templates generation, withanalytical data from input records

3. Data transformation to RDF:Ad-hoc transformation tool

10

Page 11: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Process overview

11

Page 12: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

PROCESS STEPS

MAPPINGDEVELOPMENT

INSTANCESINTERLINKING

INSTANCESANOTATION

INSTANCESGENERATION

RECORDSANALYSIS

PUBBY

4STORECSV

1

2 3

45

USER INTERFACE

6

Page 13: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Main activities

4. Cultura.linkeddata.es domain for CH resources

5. RDF publishing:Virtuoso Server

Pubby

6. Linkage to other datasets:Phase 1: VIAF y other libraries (BL, DNB, Libris Sweden)

Phase 2: DBPEDIA, Geo, etc.

13

Page 14: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Some results

14

Page 15: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

WORK is in Progresshttp://cultura.linkeddata.es/visualizer/

Page 16: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 17: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 18: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 19: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 20: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 21: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 22: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 23: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE

Page 24: Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero

Linked Data at the BNE 24

Elena Escolano RodríguezDepartamento de Proceso Té[email protected]

Pº de Recoletos 20 -22 28071 Madrid EspañaT +34 915 807 800

www.bne.es