GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)
-
Upload
dag-endresen -
Category
Technology
-
view
765 -
download
0
description
Transcript of GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)
![Page 1: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/1.jpg)
Dag Endresen ([email protected])Knowledge Systems EngineerGBIF
New Orleans (Louisiana, USA)20 October 2011
Biodiversity Information Standards, TDWGAnnual Meeting 2011, New Orleans
The GBIF KOS Work Program:Prioritized Requirements and Proposed Solutions
![Page 2: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/2.jpg)
Outline
Element vocabularies and value vocabularies
Vocabulary management tools Vocabularies exchange format (SKOS) Vocabulary registry (portal) New data types
![Page 3: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/3.jpg)
Standards
Biodiversity Information Standards (TDWG), Dublin Core Metadata Initiative (DCMI), Genomics Standards Consortium (GSC), etc... provide domain standards. We want to reuse, map and relate terms across these standards.
Why: Gain understanding across domains
![Page 4: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/4.jpg)
Element vocabulary (glossary)
Darwin Core (DwC), Dublin Core (DCMI), Ecological Metadata Language (EML), Gene Ontology (GO), TDWG Ontology, etc... provide definitions for conceptual terms. We want to reuse, map and relate terms from basic vocabularies with concept definitions.
Why: reuse terms and share a common definitions and understanding of biodiversity concepts.
![Page 5: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/5.jpg)
Vocabulary management tools GBIF Vocabularies
Custom Scratchpad Tool (Drupal)
Semantic Wiki (SpeciesID, Key to Nature) Protégé (collaborative Protégé)
SKOSEd plugin, Web-Protégé Top Quadrant EVN (commercial) Pool Party (commercial) ThManager (open source) ISOcat (Clarin, linguistics) iQvoc (open source) TemaTres (open source, Spanish)
![Page 6: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/6.jpg)
GBIF Vocabularieshttp://vocabularies.gbif.org
Collaborative development of community terminology, including biodiversity concept definitions and controlled value lists.
![Page 7: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/7.jpg)
Controlled Vocabularies
The “Vocabularies” are Value Vocabularies (authority files) of accepted values for terms where controlled values are already available - or appropriate to develop.
dwc:basisOfRecordPreservedSpecimenFossilSpecimenLivingSpecimenHumanObservationMachineObservationNomenclaturalChecklistOccurrenceTaxonLocation
dc:TypeCollectionDatasetEventImageInteractiveResourceServiceSoftwareSoundTextPhysicalObjectStillImageMovingImage
gbif:nomenclatural_codeICBNICZNICVCNICNBICNCPBioCode
Why: standardize how biodiversity data is provided when controlled values are appropriate
![Page 8: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/8.jpg)
Controlled Vocabularies
“Extensions” are Element Vocabularies defining new terms organized as extensions to Core Types (dwc: Taxon and dwc: Occurrence).
•Audubon Core (multimedia/images)•DwC-Germplasm (plant genetic resources)•EOL Data Object (species profiles)•GISIN Species Status (invasive species)•…etc
Why: Provide a mechanism for thematic communities to define their own specific terms.
![Page 9: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/9.jpg)
GBIF Vocabularies
Core types – could be more than DwC: Taxon and DwC: Occurrence •habitat, spatial areas, lines, grid, places, images/multimedia, literature, people, institutes, collections, collection specimens, etc…?
“Extensions” = element/attribute vocabulary, definition of terms•Separate the definition of terminology from application models•Is “extensions” the appropriate label?
“Vocabularies” = value vocabulary, authority files•external examples: countries, languages, …•biodiversity domain: taxonRank, basisOfRecord, …
http://vocabularies.gbif.org
![Page 10: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/10.jpg)
GBIF VocabulariesGBIF Vocabularies is hosted by the Scratchpads server in London• Install the GBIF Vocabulary Service in Copenhagen?• Further developments are needed.• Package the Vocabulary Service as an open-source tool?• Develop as Drupal modules, migrate to Drupal 7?
Element vocabularies are not always an “extension” of Darwin Core…?• Add management interface with definitions for new core types?• Rename “Extensions” to “Element-” or ”Attribute Vocabularies”?• Rename “Vocabularies” to “Value Vocabularies” or “Authority files”…?
Export and import of vocabularies to and from other management systems (SKOS, RDF, OWL as vocabulary exchange format?)• SKOS import and export features to be developed?
Improved Human readable interface• Export to HTML/PDF format for human readable documentation of a vocabulary?
![Page 11: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/11.jpg)
Vocabulary Registry/Portal GBIF Vocabulary Registry
Is the present registry sufficient?
GBIF Vocabularies Develop the Scratchpads solution further
as a vocabulary registry?
NCBO BioPortal alternative Start using the NCBO BioPortal software
Why: Support the discovery of biodiversity terminology and standard vocabularies.
![Page 12: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/12.jpg)
GBIF Vocabulary Registry
The official versions of the “vocabularies” and “extensions” for deployment are available from the GBIF Registry (http://rs.gbif.org). They are used from here by the GBIF infrastructure such as the IPT and HIT.
Separate service for discovery – different service from the GBIF Vocabulary site (management ≠ discovery).
![Page 13: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/13.jpg)
GBIF Vocabulary Registry
Promote SKOS as the preferred vocabulary (exchange) format? Gradually replace XML Schema for defining standards?
Why: Promote ease of vocabulary exchange, import and export.
http://rs.gbif.org
Simple Knowledge Organization System (SKOS)
![Page 14: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/14.jpg)
GBIF Vocabulary Registry Add human interface to explore SKOS
documents at the GBIF Registry? OWLDoc (CO-ODE, static HTML) OWL Ontology Browser (CO-ODE, dynamic)
![Page 15: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/15.jpg)
Using the BioPortal Registry
GBIF KOS Task Group:
“GBIF should deploy an instance of the BioPortal platform for biodiversity ontologies as a complement to the GBIF Vocabularies Server.”
![Page 16: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/16.jpg)
Using the BioPortal Registry Include Biodiversity Vocabularies to the
NCBO BioPortal…? Will support the mapping of terms to the
major Genomics Vocabularies.
Establish a “GBIF BioPortal” using the same BioPortal software? Will focus on Biodiversity Community identity
and relevance.
![Page 17: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/17.jpg)
WorkflowDraft vocabulary
Review version
Published version
Approve?
… and other SKOS compliant vocabulary management tools.
-> Uptake by the GBIF infrastructure including the IPT and the data portal.
![Page 18: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/18.jpg)
“In anticipation of the integration and serving of future data types, GBIF will work closely with partners to enable data integration and interoperability across phenotypic, genomic, taxonomic, geospatial and ecosystem domains.”
GBIF Strategic Plan 2012-2016:
![Page 19: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/19.jpg)
“Further activities as part of the Plan will include improving the Data Portal system and expanding the depth and range of data types“
“specimen, observation, descriptive, literature, name/concept, image, character, OGC, etc”
GBIF Strategic Plan 2007-2011:
![Page 20: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/20.jpg)
New Core Types?
DwC: Taxon DwC: Occurrence
Aububon Core (images/multimedia) Invasive Species (invasive in region/country) New Spatial Objects (from point locations to include
polygon, poly-line and grid objects) etc…
Is the general principle on Extension of Core Types also suitable for new data types?
![Page 21: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/21.jpg)
Data typesdwc:identificationIDdwc:dateIdentifieddwc:identifiedBydwc:taxonIDdwc:scientificNameIDdwc:scientificName…
dwc:measurementIDdwc:measurementValuedwc:measurementUnitdwc:measurementDeterminedBy…
dwc:taxonIDdwc:scientificNameIDdwc:scientificNamedwc:taxonConceptIDdwc:kingdomdwc:familydwc:genusdwc:specificEpithet…
dwc:occurrenceIDdwc:basisOfRecorddwc:eventIDdwc:eventDatedwc:locationIDdwc:decimalLongitudedwc:decimalLatitudedwc:taxonIDdwc:scientificNameIDdwc:scientificName…
dc:identifierdc:bibliographicCitationdc:titledc:creatordc:datedc:sourcedc:languagedwc:taxonRemarks…
dc = http://purl.org/dc/terms/dwc = http://rs.tdwg.org/dwc/terms/ gbif = http://rs.gbif.org/terms/1.0/
etc…
dwc:vernacularNamedc:languagedc:temporaldwc:locality…
![Page 22: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/22.jpg)
Star schema
dc = http://purl.org/dc/terms/dwc = http://rs.tdwg.org/dwc/terms/ gbif = http://rs.gbif.org/terms/1.0/ audubon: http://rs.tdwg.org/ac/terms/
![Page 23: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/23.jpg)
Star schema (??)
dc = http://purl.org/dc/terms/dwc = http://rs.tdwg.org/dwc/terms/ gbif = http://rs.gbif.org/terms/1.0/ audubon: http://rs.tdwg.org/ac/terms/
etc…
![Page 24: GBIF Vocabularies, at TDWG 2011 (20 Oct 2011)](https://reader036.fdocuments.us/reader036/viewer/2022062513/554e8227b4c90545698b53bc/html5/thumbnails/24.jpg)
Dag Endresen ([email protected])Knowledge Systems EngineerGBIF
New Orleans (Louisiana, USA)20 October 2011
Biodiversity Information Standards, TDWGAnnual Meeting 2011, New Orleans
The GBIF KOS Work Program:Prioritized Requirements and Proposed Solutions