EurolibPlenarymeeting

download EurolibPlenarymeeting

of 61

Transcript of EurolibPlenarymeeting

  • 8/12/2019 EurolibPlenarymeeting

    1/61

    News from the Publications Office

    Norbert HohnPublications OfficeEurolib Plenary Meeting, Lisbon, 19-20 May 2011

  • 8/12/2019 EurolibPlenarymeeting

    2/61

    News about

    Virtua OPac EUBookshop

    Cellar Eurovoc Metadata Registry (MDR)

  • 8/12/2019 EurolibPlenarymeeting

    3/61

    News from the cataloguing service

  • 8/12/2019 EurolibPlenarymeeting

    4/61

    virtua : an off-the-shelf cataloguing tool for OP

    Agenda

    Project background Going into production Challenges and benefits What next?

  • 8/12/2019 EurolibPlenarymeeting

    5/61

    virtua : project background

    What were we looking for?

    An off-the-shelf ILMS cataloguing module and OPAC module To enable cataloguing to be done in-house

    A web OPAC To allow external users to search and download OP

    bibliographical records (replacement for LIBCO)

  • 8/12/2019 EurolibPlenarymeeting

    6/61

    virtua: project background

    25/02/2009 Launch of Call for Tender AO 10021 for an integratedlibrary management system

    13/07/2009 Award of contract to VTLS Europe, S.L. (virtua)

    01/09/2009 Kick-off meeting

    27/04/2010 Initial projected start date

    14/12/2010 Start of production

  • 8/12/2019 EurolibPlenarymeeting

    7/61

    virtua: project background

    What caused the delays?

    Requirement to communicate with several proprietarysystems

    Complex migration scenario

    Two additional projects required in order to go live: Codes Punctuation

  • 8/12/2019 EurolibPlenarymeeting

    8/61

    virtua: codes projectExample notice from virtua with codes

    041 0 $a eng $1 EN044 $c eu084 $a M11 $2 LU-LuOPE245 1 0 $a CEMP, the creation of European

    management practice : $b final report.260 $a {LUXB} : $b OPL, $c 2004.300 $a III, 127 NPAG : $b NFIG, NTAB ; $c A4 $d

    BR .440 $a EUR_SER_C ; $v 20968, $x 1018-5593504 $a UA_BIB : NPAG 90-97.540 $a REPRO1.650 7 $a 003656 . $2 EUROVOC

    710 2 $a CEU. $b RTD.773 1 8 $t EUR_SER_C $q 2004, NPER 20968

    910 $a GR920 $a 702

    Codes and their translations

    M11 = Theme (Social Sciences Research) {LUXB} = Luxembourg OPL = Publications Office NPAG = p. NFIG = ill.

    NTAB = tab. BR = softcover UA_BIB = Bibl. REPRO1 = Reproduction is authorised

    provided the source is acknowledged 003656 = Community research policy CEU = European Commission

    RTD = Directorate-General for Research EUR_SER_C = EUR. EU socio-economic

    research NPER = No GR = Free 702 = Specialised

  • 8/12/2019 EurolibPlenarymeeting

    9/61

    virtua: punctuation projectExample notice from virtua with automatically added punctuation

    245 1 0 $a CEMP, the creation of European management practice : $bfinal report .260 $a {LUXB} : $b OPL, $c 2004 .300 $a III, 127 NPAG : $b NFIG, NTAB; $c A4 $d BR.440 $a EUR_SER_C; $v 20968 , $x 1018-5593

    499 $a Project SOE1-CT97-1072504 $a UA_BIB: NPAG 90-97.

    540 $a REPRO1.650 7 $a 003656 . $2 EUROVOC

    700 1 $a Engwall, Lars , $e ED.710 2 $a CEU. $b RTD.773 1 8 $t EUR_SER_C $q 2004 , NPER 20968

  • 8/12/2019 EurolibPlenarymeeting

    10/61

    virtua: going into production

    Migration of >200 000 records from PROCATX (OPs database forlegal and general publications metadata) to virtua

    Re-import of these records from virtua to PROCATX(synchronisation of both systems)

    Parallel running with external cataloguing contractor until18.01.2011

  • 8/12/2019 EurolibPlenarymeeting

    11/61

    virtua: challenges and benefits

    Challenges:Learning a new systemCreating bibliographical records (as opposed to controllingthem)Adapting our workflowsIndexation of records using EUROVOC

    Benefits:Reduced time delays for cataloguing publications (3 daysreduced to 24 hours)Automated validation checks to ensure quality and consistencyAutonomy, enabling rapid intervention in records whenrequestedAnd not least, increased team spirit

  • 8/12/2019 EurolibPlenarymeeting

    12/61

    virtua: what next?

    Opening of OPac to current LIBCO users March 2011Possibility for users to export notices in MARC, CSV andEndnote

    Deep-linking to EUbookshop of all records held in Virtua(MARC21 field 856)

    Production of prepublication records

    Automatic activation of DOIs via an export from virtuaReduction of delays from moment publication is on EUB toactivation of DOI

  • 8/12/2019 EurolibPlenarymeeting

    13/61

    OPac - the OP online public access catalogue

    Out of the box OPAC of Virtua (Chamo)http://opac.publications.europa.eu/ Interface does not require a specific login etc. but we dont publicise it andgive the address only to 'approved' users.Lets users discover materials quickly, using familiar search methods such asQuick Search and faceted result links.Refining a search is as easy as picking a facet from a list or typing additionalterms in the search box and letting OPac add them to the original searchstring.Advanced search give users the advantage of applying multiple filterssimultaneously.

    Users are able to export references to EU general publications in the formatmore specifically designed for the library world (e.g. MARC21) as well as inEndNote or CSV format.

    http://opac.publications.europa.eu/http://opac.publications.europa.eu/
  • 8/12/2019 EurolibPlenarymeeting

    14/61

    OPac - the OP online public access catalogueThe tabs of the menu bar

    Login For administrators only.

    Heading To make searches by author, subject, title, and PUB_ID/workflow (catalogue number).

    Cart To store all records selected by user and to export them.

    Clear session Resets all searches done during the current session, cleans the cart and returns user tofirst page.

  • 8/12/2019 EurolibPlenarymeeting

    15/61

    Caveat: one peculiarity of the OPac service.

    OP uses a system of codes in records (e.g. 260 $a {LUXB} :) in order toproduce each record in the language of the publication catalogued. Although the facets display the translated values of these codes for the end-

    user, the MARC records themselves are displayed on the screen still coded.However, when adding the records to the cart and then downloading them (byselecting 'Export records to MARC'), the codes are automatically translated andyou will receive decoded notices in the resulting file (e.g. 260 $a Luxembourg :)for import into your system.

    Feedback As this is a new service, we welcome any feedback from our users, including

    ways in which we can improve it. If you need any further help or would like topropose any improvements, please contact our team using the followingaddress:[email protected]

    mailto:[email protected]:[email protected]:[email protected]:[email protected]:[email protected]:[email protected]
  • 8/12/2019 EurolibPlenarymeeting

    16/61

    Deep-linking to EUbookshop of all records held in Virtua(MARC21 field 856)

    Purpose:

    Only records since February 2011 systematically have a deep-link to EUBookshop.By adding a deep-link to the bibliographical records for General Publications anyoneusing these records will automatically be able to redirect their end-users to thePublication Details page on EUB where the user can order or download thepublication they are looking for, even if the library/information centre displaying therecord does not hold a copy of the publication themselves.

    Actions: Retrospectively adding a deep-link to field 856 in each bibliographical record to allexisting records (250.000). Multiple assignment possible, i.e. in addition to DOI (linkto resolver)Updating the import workflows into virtua so that all new records are given this deep-link by default.

    Proposed date of putting into production: June 2011Customers:

    Current LIBCO clientsPilot project with the British Library, which would result in some 50, 000 records beingmade available in the UK through a syndication of libraries

  • 8/12/2019 EurolibPlenarymeeting

    17/61

  • 8/12/2019 EurolibPlenarymeeting

    18/61

    News from the EUBookshop

  • 8/12/2019 EurolibPlenarymeeting

    19/61

    Metadata added to the publication detail page target audience and Eurovoc descriptors. These terms keywords are browsable.

  • 8/12/2019 EurolibPlenarymeeting

    20/61

    New "discover" section - a menu through which users can access thematic collections of publications that cannotbe easily retrieved by site search or browsing. The compilation is often informed by frequently searched terms,such as map or comics

  • 8/12/2019 EurolibPlenarymeeting

    21/61

    A "just published" section - recently published titles

  • 8/12/2019 EurolibPlenarymeeting

    22/61

    News from the CELLAR

    Common Access to EU Information

  • 8/12/2019 EurolibPlenarymeeting

    23/61

  • 8/12/2019 EurolibPlenarymeeting

    24/61

    24/7

    Production

    Citizens/Professionals

    O f f i c i a l P u

    b l i c a

    t i o n s

    T e n

    d e r

    i n g

    D o c u m e n

    t s

    G e n e r a

    l P u

    b l i c a

    t i o n s

    C O R D I S

    Authors

    Common access to EU information

    EUR-Lex TEDDissemination -Specialized

    portals

    EUBookshop CORDIS

    Present : silos = independent solutions

  • 8/12/2019 EurolibPlenarymeeting

    25/61

    25/7

    Dissemination

    Production

    Citizens/Professionals

    O f f i c i a l P u

    b l i c a

    t i o n s

    T e n

    d e r

    i n g

    D o c u m e n

    t s

    G e n e r a

    l P u

    b l i c a

    t i o n s

    C O R D I S

    Authors

    Common portalSpecialized portals

    Future : harmonized architecture = common & shared solutions

    Common access to EU information

  • 8/12/2019 EurolibPlenarymeeting

    26/61

    Common access to EU information

    Target architecture

  • 8/12/2019 EurolibPlenarymeeting

    27/61

    Data flows:

    Dissemination layer

    Data layer

    Definition layer

    Tenderingdocuments

    Generalpublications

    Externalsources(Court of

    Justice )CORDIS

    Validation

    Reference

    Publishing

    Post-Production

    Production

    ArchiveLong term

    preservation

    Officialpublications

    CELLAR Functional architecture 1/3

    Reception, technical validation and storageof content and metadata.

    Common access to EU information

  • 8/12/2019 EurolibPlenarymeeting

    28/61

    Data flows:

    Dissemination layer

    Data layer

    Definition layer

    Tenderingdocuments

    Generalpublications

    Externalsources(Court of

    Justice )CORDIS

    Validation

    Reference

    Publishing

    Post-Production

    Production

    ArchiveLong term

    preservation

    Officialpublications

    CELLAR Functional architecture 2/3

    Repository models (CCR and CMR),business rules (for uploading, archiving anddissemination),transformation rules, EuroVoc dissemination,authority tables including translations.

    Common access to EU information

  • 8/12/2019 EurolibPlenarymeeting

    29/61

    Data flows:

    Dissemination layer

    Data layer

    Definition layer

    Tenderingdocuments

    Generalpublications

    Externalsources(Court of

    Justice )CORDIS

    Validation

    Reference

    Publishing

    Post-Production

    Production

    ArchiveLong term

    preservation

    Officialpublications

    CELLAR Functional architecture 3/3

    Access to and provision of content and metadata in therequested format and/or presentation.

    Common access to EU information

  • 8/12/2019 EurolibPlenarymeeting

    30/61

    Data flows:

    Dissemination layer

    Data layer

    Definition layer

    Tenderingdocuments

    Generalpublications

    Externalsources(Court of

    Justice )CORDIS

    Validation

    Reference

    Publishing

    Post-Production

    Production

    ArchiveLong term

    preservation

    Officialpublications

    METS

    FRBR

    METS

    CELLAR Based on standards

    OAISReference

    model

    XML

    Common access to EU information

  • 8/12/2019 EurolibPlenarymeeting

    31/61

    Data flows:

    Dissemination layer

    Data layer

    Definition layer

    Tenderingdocuments

    Generalpublications

    Externalsources(Court of

    Justice )CORDIS

    Validation

    Reference

    Publishing

    Post-Production

    Production

    ArchiveLong term

    preservation

    Officialpublications

    RDFSKOS

    SPARQLendpoint

    CELLAR Web 3.0,semantic technology

    OWL

  • 8/12/2019 EurolibPlenarymeeting

    32/61

    Complete collection of EU legal documents includingTreatiesOfficial JournalCase-law

    Preparatory actsConsolidated acts

    General publications

    Research reports

    Merger taskforce decisions

    Digital archive of the EU

    Content

  • 8/12/2019 EurolibPlenarymeeting

    33/61

    CELLAR A service enablerOn-line access Provide on-line access through the Internet portals of the

    Publications Office.

    Automated access Provide suitable interfaces for access by automatedagents.

    External indexing Enable indexing by Internet search engines.

    Notification Provide configurable notification services (RSS- feeds).

    Downloading Support sporadic and regular downloading of resources(subscription). Regular downloading should beconfigurable.

    Strategic formats PDF, in particular PDF/A-1a and PDF/A-1b; XML; TIFF

    Specific formats Provide formats, which are not natively available in theCELLAR (LegisWrite, ONIX notices), i.e. transformationservices.

    Deep linking Enable external referencing of resources and guaranteepersistence of links over time.

    Common access to EU information

  • 8/12/2019 EurolibPlenarymeeting

    34/61

    2010/2011 development (ongoing)

    2011 data migration and upload (ongoing)

    2012 online (planned)

    Common access to EU information

    CELLAR ROADMAP

  • 8/12/2019 EurolibPlenarymeeting

    35/61

    News from Eurovoc

  • 8/12/2019 EurolibPlenarymeeting

    36/61

    EuroVoc Next releases 4.4

    Next release in Summer 2011 (EuroVoc 4.4)

    Update linked to the new Lisbon Treaty EC EU European Community European Union

    You can contribute via the website

    Permanent URI and ID for thesaurus Terms and conceptsLOD (Linked Open Data)

    No deletion for conceptsobsolete (use instead)deprecated (move as Non Preferred Term of a new concept)

    E V TAE P j P

  • 8/12/2019 EurolibPlenarymeeting

    37/61

    EuroVoc TAE Project - Purpose

    TAE = Thesaurus Alignment EnvironmentInitiative of the Publications Office

    Mapping = matchingCreate semantic correspondences between concepts of two thesauri

    Objective: Map EuroVoc to

    ETT - European Vocational Training Thesaurus (Cedefop)GEMET - General Multilingual Environmental Thesaurus (European EnvironmentalAgency)Directory of European Legislation in force (EUR-Lex)EuroVoc 4.2Taxonomy EUB

    ETT

  • 8/12/2019 EurolibPlenarymeeting

    38/61

  • 8/12/2019 EurolibPlenarymeeting

    39/61

    EuroVoc TAE Project Examples for Automated alignements

    Types of correspondences generated by algorithms

    ExactMatch concept T1 = concept T2 T1 acid rain exact match T2 acid rainT1=Gemet T2=EuroVoc

    BroadMatch - concept T1 has a generic concept in T2 T1 animal genetics broad match T2 genetics

    NarrowMatch - concept T1 has a specific concept in T2 T1 mammal narrow match T2 wild mammal

  • 8/12/2019 EurolibPlenarymeeting

    40/61

    EuroVoc TAE Project Practical use (overview) Indexing

    Detailed and enriched indexing

    Automatic indexing and re-indexingDouble annotation

    Retrieving - Semantic extensionIntegration of results into search engines

    Facilitate users researches Did you mean.. ? Redefinition of the research : Extend or Narrow the search results

    Results stored in CELLARA unique storage and dissemination platform of the PO to accessEuropean law and publications

    SKOS web services and Sparql-end point for accessing and queryingthe mapping results

  • 8/12/2019 EurolibPlenarymeeting

    41/61

    EuroVoc TAE Project Practical use: Help to indexing

    Annotation of a document by indexing of a specialized thesaurus whaling is not represented in EuroVoc but GEMET contains whaling

    Example in EUR-Lex

    http://www.eionet.europa.eu/gemet/concept?cp=9305&langcode=en&ns=1http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=CELEX:52009IP0067:EN:NOThttp://www.eionet.europa.eu/gemet/concept?cp=9305&langcode=en&ns=1
  • 8/12/2019 EurolibPlenarymeeting

    42/61

    EuroVoc TAE Project Practical use: Help to indexing

    Correspondences (Gemet EuroVoc) proposes in TAEWhaling exactMatch whale AND hunting regulation Compound Mapping

    EuroVoc TAE Project Practical use: Help to information retrieval

  • 8/12/2019 EurolibPlenarymeeting

    43/61

    EuroVoc TAE Project Practical use: Help to information retrieval

    Search enginesDid you mean ?Automatic query expansion or restriction

    Search for Whale

    Did you mean ?

    Whaling Restrict the search results towards a more specific

    concept in the target thesaurus

    Whale or Marine mammal

    Expand the search results towards a more genericconcept in the source thesaurus

    EuroVoc Future actions

    http://eurovoc.europa.eu/6403http://eurovoc.europa.eu/6403
  • 8/12/2019 EurolibPlenarymeeting

    44/61

    EuroVoc Future actions

    MetaThesaurus Working Group

    Main purposeSet up a specialized, multilingual thesauri network around EuroVocMeeting foreseen in June 2011

    AdvantagesUse the same standards and formatsDelegate the maintenance of specific domainsShare candidates and translations

    Participants:

    EU Institutions, European agenciesInternational institutions (FAO, Unesco)Other multilingual thesauri ( EINIRAS)First approach made during the EuroVoc Conference (Luxembourg,November 2010)

    http://www.ireon-portal.de/http://www.ireon-portal.de/
  • 8/12/2019 EurolibPlenarymeeting

    45/61

    EuroVoc Refresher of its benefits

    Enterprise Content Categorization Develop from the scratch

    Time consuming to build a taxonomy or controlled vocabularyUse Starter metadata to speed -up the development

    Import external metadata, taxonomies or controlled vocabulary

    in your ECM system Avoiding duplicate efforts Minimize the cost of adding and managing metadata

    EuroVoc = a Building block of your ECM application

    A high-level controlled vocabulary Cost benefit : maintained by the Publications Office Offers different levels of specificity (TAE, thesauricollaboration network)

  • 8/12/2019 EurolibPlenarymeeting

    46/61

    EuroVoc within the OP Cellar

    In the repository will be stored:EuroVoc, the thesaurusThe mapping or alignment results

    On the Cellar service layer EuroVoc will be implemented as webservices and Sparql-Endpoint for e.g.

    Linked Open DataCrosswalk EuroVoc and Semantic web applicationsDereferencable URIExamples

    Search a term (expression or URI) and retrieves the alignments Search a term (expression or URI) and retrieves its relations

    (Broader Term, Specific Term, Related Terms) Search a Microthesaurus and retrieves all the terms

    E V Li i li

  • 8/12/2019 EurolibPlenarymeeting

    47/61

    EuroVoc Licensing policy Free of charge (4-Years)

    Email: [email protected] Information in the website under legal notice Login and Password to download the SKOS or XMLAlert once a new release is available

    405 licences (64 for 2010, 64 for 2009)

    Types of licenceIndexing

    Text mining and extraction, automatic indexing and categorization, Library Information System, Knowledge Management & ECM

    Translation (Albanian)

    Academic, project, research Semantic technologies Term matching

    E i

    mailto:[email protected]://eurovoc.europa.eu/drupal/?q=legalnotice&cl=enhttp://eurovoc.europa.eu/drupal/?q=legalnotice&cl=enmailto:[email protected]:[email protected]:[email protected]
  • 8/12/2019 EurolibPlenarymeeting

    48/61

    Eurovoc mappings

  • 8/12/2019 EurolibPlenarymeeting

    49/61

    Contact

    Ms Christine [email protected]

  • 8/12/2019 EurolibPlenarymeeting

    50/61

    News from the Metadata Registry

  • 8/12/2019 EurolibPlenarymeeting

    51/61

    What is the Metadata Registry (MDR)?

    A central reference point for the registration and maintenance ofmetadata definitions and related authority data used by

    The interinstitutional systems supporting the decision makingprocessThe production and dissemination systems of the Publications

    OfficeA framework for the harmonisation and standardisation of themetadata used in this context

    DocumentationOrganisation

    ProceduresProvide the reference metadata for reuse and validation purposesto internal and external clients/client systems in human andmachine-readable format

  • 8/12/2019 EurolibPlenarymeeting

    52/61

    Metadata Register Scope

    Core metadataLimited set of metadata, which needs to be adopted by everyinstitution to enable interoperability, in particular in thecontext of the decision making processCommon part of the Metadata register

    Management on interinstitutional level (IMMC)

    Specific metadataMetadata dedicated to the specific internal needs of eachinstitution

    Out-of-scope for the common part of the Metadata registerPrivate workspace inside the Metadata register could beprovided to facilitate management by the owner

  • 8/12/2019 EurolibPlenarymeeting

    53/61

    Metadata Registry Expected benefits

    Central reference location for metadata definitions and authoritydataReference source for consultation/validation purposesStimulates reuse of metadata and increase interoperabilityFramework for harmonization and standardizationPlatform for collaboration and knowledge exchange in metadatadomain on interinstitutional level

  • 8/12/2019 EurolibPlenarymeeting

    54/61

    Metadata Register - Architecture

    Back-end applicationMaintenance of metadata definitions and authority dataAccess limited to restricted number of expert usersBased on same tool as used for Eurovoc back-end (ITM)Possibility to create individual workspaces

    Registration workflow (JIRA)

    Metadata Registry website (front-end)Browse MDR content (read access)Detailed information about registered itemsPossibility to submit proposal for registration/feedback (e.g. byEurolib members)

  • 8/12/2019 EurolibPlenarymeeting

    55/61

    Metadata Register Workflow overview

  • 8/12/2019 EurolibPlenarymeeting

    56/61

    Metadata Registry - Organisation (proposal) 1/2

    Publications Office levelManagement of changes in MDR by Metadata Register Team(MRT)

    Interinstitutional levelProposals for registration by Interinstitutional Metadata

    Maintenance Committee (IMMC) (2 members per institution)Submission of relevant proposals by MRT to IMMC for approvalTechnical support/evaluation by MRT on requestManagement of changes in MDR by MRTSupervision by Interinstitutional Metadata Steering Committee

    (IMSC) composed of the supplants of the management boardof the Publications Office

  • 8/12/2019 EurolibPlenarymeeting

    57/61

    Metadata Registry Organisation 2/2

    Common Authority Tables (CAT) April 2011

  • 8/12/2019 EurolibPlenarymeeting

    58/61

    Common Authority Tables (CAT) April 2011Common Authority Tables Source

    Languages (ISO 639/1, 639/2B|T, 639/3) ISO

    Countries (ISO 3166/1- 2 and 3, 3166/3) ISO

    NTU (incl. NUTS and ISO 3166-2) ISO + UNO + Eurostat

    Currencies (ISO 4217) ISO

    Corporate Bodies Various

    Roles LC + EurLex + Prelex

    Places (locations, towns) UN-LOCODE

    Resource format (incl. dimensions) ONIX + IANA

    Resource type (categories of resources) Internal sources

    Target Audience ONIX

    Procedures PreLex

    Events PreLex

    Etc.

    stable versionin progress to be started

  • 8/12/2019 EurolibPlenarymeeting

    59/61

    Metadata Registry - Roadmap

    Project kick-off: 20/12/2010Phase 1: Implementation of back-end application (managementof ontology, authority tables, export)Target date: June 2011

    Phase 2: Implementation of front-end application

    Target date: August 2011

  • 8/12/2019 EurolibPlenarymeeting

    60/61

    MDR project contacts

    Metadata Registry team:Holger BAGOLACorinne FRAPPARTMadeleine KISSMartin SCHERBAUMWillem VAN GEMERTContact:[email protected]

  • 8/12/2019 EurolibPlenarymeeting

    61/61

    Thank you for your attention!

    We appreciate your questions andsuggestions.