Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru...

2
Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1 , P. Dru Clark 1 , Jacob M. Perez 1 , Aaron D. Sweeney 1 , John Helly 1,2 , Karen I. Stocks 2 , and Donald W. Sutton 2 1 Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA, USA 2 San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, USA The fully electronic IODP Site Survey Data Bank (SSDB; http://ssdb.iodp.org ) is primarily used to support Site Survey Panels as they evaluate each proposal for drilling, site by site. The SSDB has been in operation since 2005, and has now grown to hold more than 7000 digital objects, such as seismic sections, bottom photographs, bathymetric maps and background reports. By design, the SSDB allows for the secure holding of proprietary data when necessary. Since every object is quality- controlled, geo-referenced, indexed with standard metadata, and stored in a long-term searchable digital library, the Data Bank serves a wider purpose, from conception of proposal ideas, through site evaluation, operations, publication, and future research and education. With this investment in information technology, the system reliably preserves data acquired over many decades. Its flexible and extensible infrastructure makes it applicable to a range of data types and disciplines, allowing it to respond to changing IODP science priorities. To enable searches across all aspects of drilling, metadata are now being mapped for harvest in the next generation Scientific Earth Drilling Information Service (SEDIS) format. In addition, an “SSDB-in-a-Box” development will allow the entire collection to be transported to vessel or a remote office on a laptop. The SSDB shares technology with the SIOExplorer Digital Library of Scripps expeditions dating back to the 1950’s. The most recent related project is the “Rolling Deck to Repository” initiative that seeks to archive all standard underway shipboard data from US research vessels, with up to 500 expeditions per year conducted by 18 operating institutions (http://www.rvdata.us ). Hosted at SDSC machine room for high bandwidth 24/7 operations. System replicated at SIO and deep archived at 3 remote sites across the U.S. The SSDB also archives 33,381 legacy physical objects, including maps, reports, seismic sections, disks, and tapes. Complete Site Characterization: Proponents upload the data they need to support a proposal. Objects on proprietary hold are protected by username/password security. Persistence: Preserving data for the long term as a digital library with metadata, so they can always be discovered. The Site Survey Data Bank Quality Control: Every upload must meet IODP data and metadat requirements, including use of certified site names. Select from a community- accepted list of standard categories. Seafloor features from sidescan data Location map to relate activities, sites, and data sets Subsurface profile from interpreted seismic section Flexible design allows for new data types and domains as needed in the future. SSDB internal QC checklist QC map viewer checks spatial relationships. Data used with permission of Greg Mountain, Rutgers. Go to http://ssdb.iodp.org

Transcript of Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru...

Page 1: Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.

Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB

Stephen P. Miller1, P. Dru Clark1, Jacob M. Perez1, Aaron D. Sweeney1, John Helly1,2, Karen I. Stocks2, and Donald W. Sutton2

1Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA, USA2San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, USA

The fully electronic IODP Site Survey Data Bank (SSDB; http://ssdb.iodp.org) is primarily used to support Site Survey Panels as they evaluate each proposal for drilling, site by site. The SSDB has been in operation since 2005, and has now grown to hold more than 7000 digital objects, such as seismic sections, bottom photographs, bathymetric maps and background reports. By design, the SSDB allows for the secure holding of proprietary data when necessary. Since every object is quality-controlled, geo-referenced, indexed with standard metadata, and stored in a long-term searchable digital library, the Data Bank serves a wider purpose, from conception of proposal ideas, through site evaluation, operations, publication, and future research and education. With this investment in information technology, the system reliably preserves data acquired over many decades. Its flexible and extensible infrastructure makes it applicable to a range of data types and disciplines, allowing it to respond to changing IODP science priorities.  To enable searches across all aspects of drilling, metadata are now being mapped for harvest in the next generation Scientific Earth Drilling Information Service (SEDIS) format. In addition, an “SSDB-in-a-Box” development will allow the entire collection to be transported to vessel or a remote office on a laptop. The SSDB shares technology with the SIOExplorer Digital Library of Scripps expeditions dating back to the 1950’s. The most recent related project is the “Rolling Deck to Repository” initiative that seeks to archive all standard underway shipboard data from US research vessels, with up to 500 expeditions per year conducted by 18 operating institutions (http://www.rvdata.us).

Hosted at SDSC machine room for high bandwidth 24/7 operations. System replicated at SIO and deep archived at 3 remote sites across the U.S.

The SSDB also archives 33,381 legacy physical objects, including maps, reports, seismic sections, disks, and tapes.

Complete Site Characterization: Proponents upload the data they need to support a proposal. Objects on proprietary hold are protected by username/password security.

Persistence: Preserving data for the long term as a digital library with metadata, so they can always be discovered.

The Site Survey Data Bank

Quality Control: Every upload must meet IODP data and metadatarequirements, including use of certified site names.

Select from a community-accepted list of standard categories.

Seafloor features from sidescan data

Location mapto relate activities, sites, and data sets

Subsurface profilefrom interpreted seismic section

Flexible design allows for new data types and domains as needed in the future.

SSDB internal QC checklist

QC map viewer checks spatial relationships.

Data used with permission of Greg Mountain, Rutgers. Go to http://ssdb.iodp.org

Page 2: Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.

The SSDB Team: The SSDB is developed by a team from the Scripps Institution of Oceanography and the San Diego Supercomputer Center, at the University of California, San Diego, under a contract with IODP-MI, funded by the NSF.

Stephen Miller, John Helly, Don Sutton, Jake Perez, Caryn Neiswender, Chris Massell Symons (top) Dru Clark, Aaron Sweeney, Karen Stocks, students Andrea Cardenas, Goldy Thach, Jenny Smith, Katie Foster (bottom)

Review Panels: Decision support system provides access to site-specific data for each proposal. INTViewer enables interactive display of seismic data across the web.

Seismic INTViewer

Mobile Support: SSDB-In-a-Box is a laptop for remote use at meetings or on vessels, independent of Internet connection.

Researchers: Furthering science through data discovery and download. Search by Proposal, Expedition, Leg, data type, format, lat/lon, date, etc.

Supporting IODP Science

Operations and Expeditions: Packaging the data needed to make on-site operational decisions.

Integrating into SEDIS: Entire SSDB Collection can be searched by SEDIS.

SSDB

Related Projects Sharing technology and resources

SIOExplorer Digital Library - 1000 SIO cruises since 1950’s http://SIOExplorer.ucsd.edu

Rolling Deck To Repository (R2R) - Archiving all US routine underway shipboard data, ~ 30 vessels, hundreds of cruises/year http://www.rvdata.us Marine Metadata Interoperability Project (MMI) – Community collaborations advancing marine data integration and re-usehttp://marinemetadata.org

Data used with permission of Peter Clift, University of Aberdeen.