Digital Libraries at EEN
CERN Open Source Collaborative tools: Digital Library Software
Tim Smith CERN/IT
CERN IT DepartmentCH-1211 Genve 23Switzerlandwww.cern.ch/itEEN [Jun 2014] - #Libraries
EEN [Jun 2014] - #A Visionary Perspective
Sharing Knowledge..
..to accelerate Science..to foster Collaboration..to enrich the World
EEN [Jun 2014] - #An early Open Access manifesto Founding fathers visionBiggest scientific apparatus3Preprint Culture
EEN [Jun 2014] - #Dissemination
EEN [Jun 2014] - #CERN Users around the World10,000 scientists and engineers, 98 countries
EEN [Jun 2014] - #Dawn of Internet Age
EEN [Jun 2014] - #SPIRES: first web site in the USAAnd the first DataBase on the web
EEN [Jun 2014] - #Accelerating ScienceScientific dialogue on repositories
Gentil-Beccot, Mele, Brooks arXiv:0906.5418
EEN [Jun 2014] - #Towards Digital Libraries1993:CERN Preprint Server serves HEP & CERN preprints1996:CERN Library Server provides access to Library Catalog2000:CERN Document Server includes multimedia, restricted notes2002:CDSWare SW is released open source2006:CDSWare becomes Invenio; start of I18N collaborations2010:Invenio 1.0 released and adopted world-wideEEN [Jun 2014] - #One Stop Shop> 1 million records
EEN [Jun 2014] - #Digital Library Services
CollectionAggregationConversionStampingWatermarkingCurationCataloguingOrganisationEnrichmentPreservationAccessIndexingRankingClusteringClassifyingEEN [Jun 2014] - #12Plot ExtractionCaption extraction and search
EEN [Jun 2014] - #Visualizing Patterns of Connection
EEN [Jun 2014] - #Open and Closed Data !
Workflows
Transformations
RestrictionsEEN [Jun 2014] - #Digital Age ServicesCollaboration Web2.0Comments, reviews, basketsImmediacyEmail alerts, RSS feedsIntensive tasksKeyword & reference extractionCitation analysisFull text indexing & rankingConversion services: multiple download formatsFlexible formatsRemove constraints of print versionsInternationalisationEEN [Jun 2014] - #Authors
EEN [Jun 2014] - #Authors
EEN [Jun 2014] - #Author Disambiguation
EEN [Jun 2014] - #The Invenio PlatformMature digital library platformArticles, books, notes, photos, videos, software, dataOAIS-inspired preservation practicesTypical use cases:Institutional document repositories, e.g. CERN, EPFL, GSIInternal collections, pre-publication workflows with approvalSubject-based information systems, e.g. INSPIRE, ILCPublic collections, worldwide data with citation analysisLarge libraries and library networks, e.g. ILO, RERO, FZ Co-developed by international collaboration
EEN [Jun 2014] - #Invenio @ M9
EEN [Jun 2014] - #Scientific dialogue 2.0EEN [Jun 2014] - #BlogForever - PreservationEC funded project, 20112013 (Invenio based)
Platform to harvest, manage, preserve and disseminate blog contentBlog posts, comments, embedded material (images, videos)Ensure authenticity, integrity, completeness, long-term usabilityOAIS AIPEEN [Jun 2014] - #Open Archival Information System
EEN [Jun 2014] - #Open Access alwaysDOI10.1103/PhysRevLett.105.161801Citation networksFormat
Transformation: PDF/AOAIS (ISO 14721:2012)Preservation meta data: provenance, context, usage
EEN [Jun 2014] - #Preservation25Data Intensive ScienceEEN [Jun 2014] - #Data Analysis and PreservationPapersTabular DataCorrelation Matrices
Internal NotesWikisPresentations
Quality monitoring dataFilter / selection algorithmsFormatters
Calibration DataConditions DataLog BooksResearchersT2s, T1s
Analysis CoordinatorsT1s
Production ManagersT0, T1s
WorkflowsContextual metadataSW: 10M LoCEEN [Jun 2014] - #Root 4M LoCATLAS: 2008 estimate of 5M LoC27Big Data in small pieces
Long tail of scienceBig facilitiesData Sizex (a small number)x (a large number)DedicatedBig Data Stores
EEN [Jun 2014] - #28
http://zenodo.org
EEN [Jun 2014] - #
Features
http://www.altmetric.comhttp://www.datacite.orghttp://www.openaire.euEEN [Jun 2014] - #Research Repository
EEN [Jun 2014] - #
Communities
Direct community uploadExportAccept/reject uploadsEEN [Jun 2014] - #Research Repository
EEN [Jun 2014] - #Reusability: Software Preservation
EEN [Jun 2014] - #Open Data as a Service
RESTAPIOAI-PMHAPI
OrchestrateEEN [Jun 2014] - #ConclusionsInformation is a valuable asset that is multiplied when it is shared
Mandates and policiesOpenness, preservation
Open DataDiscoverable, Accessible, Intelligible, Assessable, Useable
Digital Libraries make this possible !EEN [Jun 2014] - #EEN [Jun 2014] - #
Top Related