Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

17
Data processing for digital libraries the experience of the BnF with Europeana Sounds project Anila Angjeli, Bertrand Caron, Emmanuelle Bermes atellite meeting "Data in libraries: the big picture" – Chicago, 10

Transcript of Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Page 1: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Data processing for digital libraries

the experience of the BnF with Europeana Sounds project

Anila Angjeli, Bertrand Caron, Emmanuelle Bermes

Satellite meeting "Data in libraries: the big picture" – Chicago, 10 August 2016

Page 2: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Europeana Sounds: improving access to Europe’s digital audio archives

Page 3: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

3

7 National Libraries

5 Archive & Research Centres

2 other Public Bodies

4 Non-profit Organisations

3 Universities

3 Companies

24 organisations from 12 countries

Page 4: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

4

Improving access and user experience

Page 5: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

5

the BnF usecase

Page 6: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

BnF sound recordings collection

Page 7: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Digital object (disk 3)

Digital object (disk 2)

Digital object (disk 1)

Bibliographic record

Contents note

AnalyticSub-record

Analog item

Authority record

Collection record

InterXMarc file

(XML)Digital item

Table of contents

(XML)

Information from authority record

Information from collection record

Table of contents

(XML)

Table of contents

(XML)

Descriptive record in DC

BnF input data: cultural heritage objects and digital representations

Page 8: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Metadata workflow: iterative processing

Content selection

Metadata curation

Import in MINT

Mapping to EDM

Preview on MINT Transformation Transformation

reports

Feedback to metadata creators

Review of mapping

Metadata extraction

Analysis of the dataset

MINT environment

Publication on Europeana

portal

Page 9: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Europeana Data Model (EDM)Core classes

Reuses classes and properties from existing ontologies (in addition to EDM specific ones)

Page 10: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Privileging the user experience

</ore:Aggregation>

<edm:isShownBy rdf:resource="http://gallica.bnf.fr/ark:/12148/bpt6k1279113/f1.audio"/>

<edm:isShownAt rdf:resource="http://gallica.bnf.fr/ark:/12148/bpt6k1279113"/>

<edm:aggregatedCHO rdf:resource="http://…/ark:/12148/cb385163820"/>

<edm:hasView rdf:resource="http://gallica.bnf.fr/ark:/12148/bpt6k1279113/f2.media"/>

<edm:hasView rdf:resource="http://gallica.bnf.fr/ark:/12148/bpt6k1279113/f1.highres"/>

<edm:hasView rdf:resource="http://gallica.bnf.fr/ark:/12148/bpt6k1279113/f2.highres"/>

<ore:Aggregation rdf:about="http://…/data/sounds/Aggregation_ark:/12148/cb385163820">

Object in institutional context

Web ressources: Digital components of the CHO

Direct link to (part of the) content, on Europeana

original Cultural Heritage Object

Page 11: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Generating "Contextual classes"

Page 12: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Import in MINT

Partial mapping into EDM within MINT

Enriched mapping Transformation

Enrichment of XSLT outside MINT

Download of XSLT

Upload of enriched

XSLT

MINT environment

Mapping enrichment

Import in MINT

Partial mapping into EDM within MINT

Enrichment of XSLT outside MINT

Page 13: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Highly structured metadata form

at245 1. $a La |Marseillaise $f Rouget de l'Isle, comp.

$c La Brabançonne $f F. Van Campenhout, comp.

<datafield tag="245" ind1="1" ind2=" "><subfield code="a" Barre="3">La Marseillaise</subfield><subfield code="f">Rouget de l’Isle, comp.</subfield><subfield code="c">La Brabançonne</subfield><subfield code="f">F. Van Campenhout, comp.</subfield>

</datafield>

La Marseillaise ; . La Brabançonne : . , / Rouget de l'Isle, comp.F. Van Campenhout, comp. ;

Source format INTERMARC

Expressed in XML (InterXMarc)

Transformation - round 1

Page 14: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

<datafield tag="245" ind1="1" ind2=" "><subfield code="a" Barre="3">La Marseillaise</subfield><subfield code="f">Rouget de l’Isle, comp.</subfield><subfield code="c">La Brabançonne</subfield><subfield code="f">F. Van Campenhout, comp.</subfield>

</datafield>

InterXMarc

Transformation - round 2

<dc:title>La Marseillaise / Rouget de l'Isle, comp.</dc:title><dc:title>La Brabançonne / F. Van Campenhout, comp.</dc:title>

La Marseillaise / Rouget de l'Isle, comp. and La Brabançonne / F. Van Campenhout, comp.

Page 15: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

SPAR<rdf:RDFxmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"xmlns:dcterms="http://purl.org/dc/terms/"xmlns:dc="http://purl.org/dc/elements/1.1/"xmlns:ns4="http://www.europeana.eu/schemas/edm/" >

<rdf:Description rdf:about="http://data.bnf.fr/ark:/12148/cb30373095k"> <rdf:type rdf:resource="http://www.europeana.eu/schemas/edm/ProvidedCHO" /> <dcterms:extent>In-8° , 523 p., fig. et pl., portrait de Dumas</dcterms:extent> <dcterms:issued>1846</dcterms:issued> <dc:title>Les Trois mousquetaires, par M. Alexandre Dumas</dc:title> <dc:publisher>Paris : J.-B. Fellens et L.-P. Dufour , 1846</dc:publisher> </rdf:Description> <rdf:Description rdf:about="ark:/12148/bpt6k61336787/f1.version0.release0"> <rdf:type rdf:resource="http://www.europeana.eu/schemas/edm/WebResource" /> </rdf:Description> <rdf:Description rdf:about="ark:/12148/bpt6k61336787/f10.version0.release0"> <rdf:type rdf:resource="http://www.europeana.eu/schemas/edm/WebResource" /> </rdf:Description>

Building EDM with SPARQL cross-querying different repositories

Page 16: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Outcome: new skills, new work methods, interactions

Metadata librarian WITH advanced technology skills

Metadata librarian WITHOUT advanced technology skills

Sound recordings curator

Page 17: Data processing for digital libraries: the experience of the BnF with Europeana Sounds project

Thank you!