Semantically supporting data discovery, markup and aggregation in EMODnet

23
Semantically supporting data discovery, markup and aggregation in the European Marine Observation and Data Network (EMODnet) Roy Lowry & Adam Leadbetter British Oceanographic Data Centre [email protected] & alead.ac.uk

description

 

Transcript of Semantically supporting data discovery, markup and aggregation in EMODnet

Page 1: Semantically supporting data discovery, markup and aggregation in EMODnet

Semantically supporting data discovery, markup and

aggregation in the European Marine Observation and Data

Network (EMODnet)Roy Lowry & Adam LeadbetterBritish Oceanographic Data Centre

[email protected] & alead.ac.uk

Page 2: Semantically supporting data discovery, markup and aggregation in EMODnet

Why?• Creating information products for EMODnet

from SeaDataNet data• Increased discoverability of Parameter Usage

Vocabulary Codes• Increased interoperability with work of CSIRO

under the ODIP project

Page 3: Semantically supporting data discovery, markup and aggregation in EMODnet

A Little History

Page 4: Semantically supporting data discovery, markup and aggregation in EMODnet

A Little History

Page 5: Semantically supporting data discovery, markup and aggregation in EMODnet

A Little History

Page 6: Semantically supporting data discovery, markup and aggregation in EMODnet

The EMODnet Use Case• Semantic aggregation:

1. Deciding what the aggregated parameter is (and what it is to be called)

2. Deciding which “Parameter Usage Vocabulary” codes are to be included in the aggregated parameter

3. Providing this information to the aggregation software

4. Aggregating the data

Page 7: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

• Two fold:

•Develop semantic aggregation work from other projects

•Expose the underlying semantic model beneath the “BODC Parameter Usage Vocabulary (P01)”

Page 8: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

Page 9: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

• RDF/XML driver file always accessible from the URL of the aggregation parameter

• Application software could either call the URL in real time or cache the knowledge base

• Governance simply(!) supplies agreed aggregation names and their mappings to P01

• BODC keeps governance informed of additions to P01 so mappings can be kept up to date.

http://vocab.nerc.ac.uk/collection/P35/current/

Page 10: Semantically supporting data discovery, markup and aggregation in EMODnet

NETMAR Aggregation / Validation

Latitude

Depth

Temperature

Salinity

Sea Water Density

Calculation

http://netmar.nersc.no

Page 11: Semantically supporting data discovery, markup and aggregation in EMODnet

NETMAR Aggregation / Validation

Latitude

Depth

Temperature

Wave period

Sea Water Density

Calculation

http://netmar.nersc.no

Page 12: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

Concentration of tributyltin cation {tributylstannyl TBT+ CAS 36643-28-4} per unit dry weight of biota {Mytilus

galloprovincialis (ITIS: 79456: WoRMS 140481) [Subcomponent: flesh]}

http://vocab.nerc.ac.uk/collection/P01/current/MMUSDTBT/

Page 13: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

Concentration of tributyltin cation {tributylstannyl TBT+ CAS 36643-28-4} per unit dry weight of biota {Mytilus

galloprovincialis (ITIS: 79456: WoRMS 140481) [Subcomponent: flesh]}

Page 14: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution{"measurement":"Concentration", "substance": {

"primaryName":"tributyltin cation","synonym":["tributylstannyl","TBT+"],"CAS":"36643-28-4"},

"measurementMatrixRelationship":"per unit dry weight of the", "matrix":"biota", "organism": {

"taxon":"Mytilus galloprovincialis","aphiaID":"140481","name":"unspecified ","gender":"unspecified","stage":"unspecified","part":"flesh","specifics":"unspecified"},

"technique":"unspecified", "definition":"Unavailable"}

Page 15: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

MDMAP014 ALKYSPTX PHXXPR01 TCO2C1TX

Measurement

Concentration Total alkalinity pH Concentratio

n

Substance carbon (total inorganic) {TCO2}

n/a n/a carbon (total inorganic) {TCO2}

Relationship

per unit mass of the

per unit volume of the

per unit volume of the

per unit mass of the

Matrix

water body [dissolved plus reactive particulate phase]

water body water body

water body [dissolved plus reactive particulate phase>0.2um]

Analysis n/a spectrophotometry

pH electrode

Page 16: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

P01 URI

MarineSpecies

S25 URI

Page 17: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

Substance Or Taxon

Property Kind

MarineSpecies

P01 URI

S25 URI

Page 18: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

Substance Or Taxon

Property Kind

P01 URI#organism

#substance

ChEBIMarineSpecies

Page 19: Semantically supporting data discovery, markup and aggregation in EMODnet

The NERC Vocabulary Server solution

Substance Or Taxon

Property Kind Matrix Technique

P01 URI#organism

#substance#matrix #technique

ChEBIMarineSpecies

https://github.com/adamml/semantic_model

Page 20: Semantically supporting data discovery, markup and aggregation in EMODnet

Benefits• Easily integrated into software

• Ocean Data View • SISSVoc • Drupal

• Fits the Linked Data model• Which we’ve been exploring with:• Biological & Chemical Data Management Office• Rolling Deck to Repository • and others

http://odv.awi.de/https://www.seegrid.csiro.au/wiki/Siss/SISSVoc

http://linked.bco-dmo.org/ http://linked.rvdata.us/

Page 21: Semantically supporting data discovery, markup and aggregation in EMODnet

Benefits

Page 22: Semantically supporting data discovery, markup and aggregation in EMODnet

Benefits

Journal of Ocean Technology 8(3):7-12https://github.com/adamml/LinkedOceanDataCloud

Page 23: Semantically supporting data discovery, markup and aggregation in EMODnet

Summary• NERC Vocabulary Server• Existing NVS uses allow for semantic

aggregation of data• But lacking ability to discover which concepts

can be marked up• This will be achieved by exposing the

underlying semantic model• Collaboration through Ocean Data Interoperability

Platform (ODIP)• Compatible with CSIRO work

[email protected] & [email protected]