ER Final Conference - Managing a European Project: The Example of Europeana Regia
TEI$metadata$as$source$to$Europeana$Regia$ – prac5cal ...TEI...
Transcript of TEI$metadata$as$source$to$Europeana$Regia$ – prac5cal ...TEI...
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
• Mo/va/on • Reference transforma/on • Technical details • TEI as a source • Seman/c approach • Conclusion TEI
Stefanie Gehrke, 05.10.2013 2
Content
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Europeana Regia (funded by the European Commission -‐ ICT PSP) • 30 months effort 2010-‐2012
Stefanie Gehrke, 05.10.2013 3
Mo5va5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Europeana Regia (funded by the European Commission -‐ ICT PSP) • 30 months effort 2010-‐2012 • 5 par/cipa/ng libraries
Stefanie Gehrke, 05.10.2013 4
Mo5va5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Europeana Regia (funded by the European Commission -‐ ICT PSP) • 30 months effort 2010-‐2012 • 5 par/cipa/ng libraries
• Over 1000 digi/sed manuscripts • Arranged in 3 virtual collec/ons
Stefanie Gehrke, 05.10.2013 5
Mo5va5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Europeana Regia (funded by the European Commission -‐ ICT PSP) • 30 months effort 2010-‐2012 • 5 par/cipa/ng libraries
• Over 1000 digi/sed manuscripts • Arranged in 3 virtual collec/ons • 1 Virtual Exhibi/on • Available through Europeana and TEL portal Stefanie Gehrke, 05.10.2013 6
Mo5va5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Challenges with respect to metadata inges/on • Mul/ple sources with different depth/quality • Varying technical exper/se
Goals for metadata inges/on • Harmonized display in Europeana and TEL portal • High quality of informa/on given
Stefanie Gehrke, 05.10.2013 7
Mo5va5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Objec/ves : • Project specific standards for the ESE metadata fields
• Op/onal support for the par/cipa/ng libraries
• A reference transforma/on to allow quality assurance
Stefanie Gehrke, 05.10.2013 8
Mo5va5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
What is meant by reference transforma/on? • A transforma/on to include all rules. • A transforma/on not part of the collec/on of transforma/ons
of the produc/ve environment. • A tool for quality assurance:
• Explicit for the output metadata • Implicit for the input metadata
Stefanie Gehrke, 05.10.2013 9
Reference transforma5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
How to build a reference transforma/on? • Single output module for all input formats • Clearly structured • A lot of documenta/on / comments • Implemented in an easily accessible form as an XSL-‐
transforma/on • Possibility to be used not only by experts • Good to maintain
Stefanie Gehrke, 05.10.2013 10
Reference transforma5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke, 05.10.2013 11
Reference transforma5on
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke, 05.10.2013 12
Reference transforma5on – Mapping
…
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke, 05.10.2013 13
Technical details
Standard XSL code: -‐ one purpose -‐ one format / varia/on -‐ recursive
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke, 05.10.2013 14
Technical details
Standard XSL code: small differences à many new transforma/ons
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Reference transform • Modularity
xsl:include
modules
localiza/on
Stefanie Gehrke, 05.10.2013 15
Technical details
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Reference transform • Modularity
adding a format = 1 line + 1 module
Consistency
Stefanie Gehrke, 05.10.2013 16
Technical details
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Differen/a/ng templates by input format: overloading templates by mode statement Stefanie Gehrke, 05.10.2013 17
Technical details
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Ambigui/es
• dc:creator = author Aeen/on : unique list !
• dc:format = page dimensions
• dcterms:extent = number of pages (with or without msPart)
Stefanie Gehrke, 05.10.2013 18
TEI as source
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Mul/ple TEI input formats are possible, but for Europeana Regia • single export file format was chosen • ENRICH compliance was preferred
Stefanie Gehrke, 05.10.2013 19
TEI as source
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Mul/ple TEI input formats are possible, but for Europeana Regia • single export file format was chosen • ENRICH compliance was preferred
TEI advantages: • Highly structured tagging • Mul/language support • Encouraged use of iden/fiers / authority data Stefanie Gehrke, 05.10.2013 20
TEI as source
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke, 05.10.2013 21
Results
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke, 05.10.2013 22
Results
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Stefanie Gehrke, 05.10.2013 23
Results
Homogenous display for different sources
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
ESE (Europeana Seman/c Elements) data model • describes one single Cultural Heritage Object (CHO) • has well defined but limited number of elements • emphasis on retrieval of CHO
Next step is the seman/c EDM (Europeana Data Model) which adds in 2012 the rela/ons between CHOs or a CHO and concepts, agents and places. Stefanie Gehrke, 05.10.2013 24
2012 : Europeana towards a seman5c approach
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
What is already there? • Use of authority data and iden/fiers (essen/al!)
• Otherwise automa/c rou/nes make e.g. the French monastery Wissembourg a Romanian town
• Mul/lingual data structures What is s/ll missing? • InterCHO links
Stefanie Gehrke, 05.10.2013 25
Seman5c approach -‐ TEI as source to EDM -‐ Observa5ons
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
The transforma/on works well for the intra CHO data:
Stefanie Gehrke, 05.10.2013 26
Seman5c approach -‐ TEI as source to EDM
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
The transforma/on works well for the intra CHO data:
Stefanie Gehrke, 05.10.2013 27
Seman5c approach -‐ TEI as source to EDM
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
While EDM represents a kind of “Coarse Grain” seman/cs • Linking CHOs, agents, loca/ons, concepts, … • Generaliza/on of ESE
There is also some kind of “Fine Grain” seman/cs • Seman/c descrip/on of the inner structure of CHO and
its digital representa/on, e.g. Shared Canvas (IIIF)
And of course holis/c approaches like CIDOC-‐CRM, FRBRoo … Stefanie Gehrke, 05.10.2013 28
Other seman5c approaches than EDM
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
What is already there? • Some “fine grain” stuff, especially academic annota/ons,
transcripts, chapters, … • Use of authority data and iden/fiers • Mul/lingual data structures • Links to images (digital facsimile) What is missing? • Explicit informa/on about digital facsimile/images like
technical descrip/on, format (“jp2”), viewing direc/on (“right-‐to-‐lem”), viewing hint (“paged”)
Stefanie Gehrke, 05.10.2013 29
Seman5c approach -‐ TEI as source to Shared Canvas
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Transforma/on modules can be re-‐used for other seman/c formats, e.g. Shared Canvas manifest serialized as JSON-‐LD:
re-‐used modules
Stefanie Gehrke, 05.10.2013 30
Seman5c approach -‐ Shared Canvas
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Transforma/on modules can be re-‐used for other seman/c formats, e.g. Shared Canvas manifest serialized as JSON-‐LD: SC -‐ Manifest
Stefanie Gehrke, 05.10.2013 31
Seman5c approach -‐ Shared Canvas
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Biblissima : Manifest for BnF MSS français 1728
Stefanie Gehrke, 05.10.2013 32
Seman5c approach -‐ Shared Canvas
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Possible solu/ons concerning the missing encoding possibili/es : • TEI as an embedded format to e.g. METS/MODS or Shared
Canvas
or • Extensions / new subsets to TEI regarding
• inter-‐document rela/ons (collec/on, volume of series, …) • “technical” aspects of digital facsimiles (images files, …)
Stefanie Gehrke, 05.10.2013 33
Seman5c approach -‐ TEI as source to Seman5c metadata
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
SIG Ontologies TEI MM 2013 Proposal (e.g.) : “Extend the scope of rela/on to the object elements and to event and add the type aeribute.” …
Stefanie Gehrke, 05.10.2013 34
Seman5c approach -‐ TEI as source to Seman5c metadata
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
TEI has turned out to be an excellent source format for the Europeana Regia project. Future seman/c approaches would need TEI to adopt to inter-‐CHO rela/ons.
Stefanie Gehrke, 05.10.2013 35
Conclusion
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Invita/on to SourceForge hep://sourceforge.net/projects/eregia2eseedm/
Stefanie Gehrke, 05.10.2013 36
TEI metadata as source to Europeana Regia – prac5cal example and future challenges
Ques/ons?
Stefanie Gehrke, 05.10.2013 37
biblissima (2013-‐2016) and TEI • Miroir des Classiques • Inventaires du Mont Saint-‐Michel • reliures.bnf.fr • e-‐ktobe • Bibliothèques Virtuelles Humanistes • glossae.net • Telma • Tradi/o Hollandrini • Sanderus Electronicus • sermones.net • Catalogues régionaux des Incunables Informa/sés (CRII)