Data and Data Management: Introduction to the BCO-DMO

28
November 16, 2009 Page 1 of 28 Data and Data Management: Introduction to the BCO-DMO Presented to Professor Keiichi Uchida November 16, 2009 Robert C. Groman

description

   Data and Data Management: Introduction to the BCO-DMO. Presented to Professor Keiichi Uchida November 16, 2009 Robert C. Groman. Conversation Overview. NSF now says: Your data or your funding Data: – facts, statistics, or items of information; and metadata - PowerPoint PPT Presentation

Transcript of    Data and Data Management: Introduction to the BCO-DMO

Page 1:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 1 of 28

    Data and Data Management:Introduction to the BCO-DMO

Presented to

Professor Keiichi Uchida

November 16, 2009

Robert C. Groman

Page 2:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 2 of 28

Conversation Overview

• NSF now says: Your data or your funding

• Data: – facts, statistics, or items of information; and metadata

• Accessing data: Data discovery, display and retrieval

• Data Interoperability

Page 3:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 3 of 28

Points to make

• Permanent archive of data

• Benefits of early open access to data (with minimum/no restrictions)

• Metadata are data and critical for data reuse

Page 4:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 4 of 28

Venn Diagram:Data and Metadata

All data and information (D)necessary to use the data. Data (d)

Metadata (m)D ≠ m + d

facts, statistics, or

items of information

Page 5:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 5 of 28

Probability Inversely Proportional to Time

Second order effects:

•Length of cruise

•Success of cruise

•Participants

•Immediate activity following the cruise

Page 6:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 6 of 28

Theorems†

• Theorem 1: The probability that all the necessary data and information are collected and preserved to allow another researcher to properly use your data is inversely proportional to time since the data were collected.

• Corollary: Unless data and information are collected and preserved during the experiment (e.g. cruise), subsequent researchers will have a difficult time using your data.

• Theorem 2: The longer the time since the data were collected the less likely the data will be considered “final”.

†Left to the reader as an exercise.

Page 7:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 7 of 28

Seeing Versus Using Someone’s Data

• Maybe you don’t want others to use your data. – Not done publishing papers based on the data– Graduate student is still analyzing the data– It’s not final yet

• Old policies and practices about data archiving• New policies about data publishing and data

archiving– Web accessible– NSF mandate

Page 8:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 8 of 28

Better, higher quality data

The more people look at the data the higher their quality.

Page 9:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 9 of 28

Ocean Observing → Sharing Data

• Northeast Coastal and Ocean Data Partnership (née Gulf of Maine Ocean Data Partnership)

– “… to promote and coordinate the sharing, linking, electronic dissemination, and use of data in the Gulf of Maine region. “

– “… linking databases that are created and individually maintained by Participants ….”– “… develops the web-based, visualization, and other information technologies needed for

the seamless exchange ….”– 24 member organizations consisting of research, educational, non-profit, commercial, and

local, state, and federal agencies.

• Ocean observing systems– Oceans.us: National Office for Integrated and Sustained Ocean Observations – NFRA: National Federation of Regional Associations– NERACOOS (Northeast Coastal Regional Observing System), MACOORA, ….– ORION: Ocean Research Interactive Observatory Networks– GOOS: Global Ocean Observing System

Page 10:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 10 of 28

Biological and Chemical Oceanography Data Management Office

BCO-DMO

• NSF funded 3 year project to provide short and medium term data management, including web based access, to all NSF funded projects from the biological and chemical oceanographic programs

• Large NSF projects are expected to have their own data management offices – a person

• Web site: http://www.bco-dmo.org/

Page 11:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 11 of 28

MapServer interface and interoperability enhancements

• Provides access to geo-referenced scientific data and metadata

• Presents distributed data sets in a unified way• Uses MapServer as the visualization application• Visualize data with graphics generated on-the-fly• Request custom subsets of data in a variety of

file formats – flat file, Matlab, netCDF, WFS, WMS.

• Compare data from different sources

Page 12:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 12 of 28

JGOFS/GLOBEC Data Management System

Page 13:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 13 of 28

http://www.bco-dmo.org/

Page 14:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 14 of 28

Cruise Tracks

Page 15:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 15 of 28

Select 5 Cruises

Page 16:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 16 of 28

Click on “Show Data” Button

Page 17:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 17 of 28

Select CD data in EN307

Page 18:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 18 of 28

Shows stations

Page 19:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 19 of 28

EN307 graph it options

Page 20:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 20 of 28

Depth versus salinity and versus temperature

Page 21:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 21 of 28

Select another cruise: AL9906

Page 22:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 22 of 28

Map it options for abundances

Page 23:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 23 of 28

Graph it option for AL9906

Page 24:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 24 of 28

AL9906 Nutrient/Phytoplankton Plot

Page 25:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 25 of 28

Interoperability features (for free)

Page 26:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 26 of 28

MapServer Supports Interoperability Features

• Open Geospatial Consortium standards– Web Mapping Service (WMS), and

– Show me the data

– Web Feature Service (WFS)– Get me the data

• Retains the functionality of the JGOFS/GLOBEC Data Management System– Download data as ASCII, CSV, Matlab, netCDF

• Will be adding Google Earth output file option

Page 27:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 27 of 28

Metadata Schema

Page 28:    Data and Data Management: Introduction to the  BCO-DMO

November 16, 2009

Page 28 of 28