CLARINO WP2 National Registry and Long- Term Archiving Freddy Wetjen and Oddrun Pauline Ohren...
-
Upload
avice-lyons -
Category
Documents
-
view
215 -
download
0
Transcript of CLARINO WP2 National Registry and Long- Term Archiving Freddy Wetjen and Oddrun Pauline Ohren...
CLARINOWP2 National Registry and Long-Term Archiving
Freddy Wetjen and Oddrun Pauline Ohren National Library of Norway
Bergen, 12. September 2013
National Registry of metadata• Goal– Joint metadata registry of resources
in all Clarino centres• Harvest data from all CLARINO centres• Exchange data with other national
CLARIN centres
• Status – current situation• On-going and planned activities
National Registry of metadataStatus (1)• Metadata registry version 1 is running
– Search/browse, editing and management, but no harvesting facilities
– Infrastructure:• META-SHARE infrastructure 3.0
– http://metashare.nb.no/, proxied by the managing node http://metashare.tilde.com/
– Metadata complying META-SHARE metadata format 3.0– No harvesting facilities
– Metadata content:• 71 resources
– Usage:• 11.9.2013: 37 of the resources downloaded 1-17 times
– Norwegian Wordnet (Bokmål) at the top– Topmost downloading locations: Norway, Germany, Greece,
Sweden
National Registry of metadataStatus (2)• Decision made: Migrate to CMDI
(CLARIN platform)–Uncertain future for META-SHARE• 2 ys guaranteed life span
–Need for more adaptability and expressivity in metadata model
– Increased involvement with the CLARIN community
National Registry of metadataPlanned activities
• Build a basic CMDI infrastructure– Repository, editor, search service, PID
scheme, harvesting • Convert metadata from META-SHARE to
CMDI – Use META-SHARE profile as specified in
Component Registry• Extend/adapt metadata model according to
need– In collaboration with the other CLARINO centres
CMDI Metadata framework
SearchService
Joint MetadataRepository
TextLab EDD
Relation Registry
ISOcatConcept Registry
Other trusted concept
Registries
CLARINComponent
Registry
Bergen Centre
LAP
META-SHARE components, a.o
<xxxx><yyyy><zz><xxxx>
Other centre…
Componenteditor
Metadataeditor
Adaptation of Broeder, D. A Data Category Registry- and Component-based Metadata Framework. LREC 2010.
«My profile»
Definitions of concepts used in metadata components
Metadata modeler
Metadata creator
Språk-banken
User
Infrastr
ucture
provided by CLARIN
centrally
National Registry of metadata; Services
Repository
CMDI
MetadataEditor
(Arbil..?)
Metadata creator
OAI/PMH harvesting
SearchServices
WeblichtVLOFCS?
«Our profiles»
Clarin common infrastructure
Data Repository
Metadataeditor
-Resoures DataDelivery
client
Processing and adaptation for long term storage (Checksum,pid,metadata etc.)
NB long term storage (preservation)
Long term archiving
Time perspective
• Metadata registry version 2 : Primo 2014– Basic CMDI infrastructure
• existing metadata converted from META-SHARE
• OAI/PMH endpoint, but no harvesting from other centres
• Metadata registry version 3: Mid 2015– Extended/adapted metadata model– Harvesting from other CLARINO centres
• Long term archiving: Mid 2014 with both data and metadata.
CLARINOWP2 National Registry and Long-Term Archiving
Freddy Wetjen and Oddrun Pauline Ohren National Library of Norway
Bergen, 12. September 2013