Leinfelder Earth Grid Jam2008

19
EarthGrid Sharing data across networks Ben Leinfelder National Center for Ecological Analysis and Synthesis, University of California Santa Barbara JAM 2008 October 22th, 2008

description

The EarthGrid network gives researchers from around the world seamless and persistent access to shared environmental data that are valuable for synthesis and analysis. Researchers use a consistent system to locate and download datasets housed in disparate, loosely-coupled repositories that have committed to providing a standard communication interface. Evolution of the existing EarthGrid network is underway to increase the breadth and depth of available data.

Transcript of Leinfelder Earth Grid Jam2008

Page 1: Leinfelder Earth Grid Jam2008

EarthGridSharing data across networks

Ben Leinfelder

National Center for Ecological Analysis and Synthesis, University of California Santa Barbara

JAM 2008October 22th, 2008

Page 2: Leinfelder Earth Grid Jam2008

“Provide access to disparate data on different networks.”

Page 3: Leinfelder Earth Grid Jam2008

Knowledge Network for Biocomplexity

Page 4: Leinfelder Earth Grid Jam2008

• Distributed data system

• Archive data and metadata

Metacat Data Repository

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

Page 5: Leinfelder Earth Grid Jam2008

Global Metacat Deployments

Page 6: Leinfelder Earth Grid Jam2008

SouthAfrican

DataNetwork

Mozambique

Mapungubwe

MarakeleKrugerSAEON

Grahamstown

Cape TownSan ParksWilderness

Cape Town U

Addo

Karoo

Tsitsikama Phalabora

Savannah ClusterMarine Cluster

KNB 1KNB II

PISCOAND

... (26)

GCE LTER

NCEAS

ESA

OBFSKnowledge Network for

Biocomplexity (KNB)

Page 7: Leinfelder Earth Grid Jam2008

KNB Global Data Distribution

Page 8: Leinfelder Earth Grid Jam2008

Diverse Data Systems

• KNB Repository–Experimental data, survey data, spatial

raster and vector data–Ecological Metadata Language (EML)

• KU DiGIR–Museum specimen collection and

taxonomic information–Darwin Core

http://www.specifysoftware.org/Informatics/informaticsdigir

Page 9: Leinfelder Earth Grid Jam2008

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

Synthetic Data Analysis

Identify species

Morpho

Store specimen data

Publish specimen data

Document data Publish datasets

Store observation data

!

Page 10: Leinfelder Earth Grid Jam2008

• Light-weight interface to underlying systems

• Hide complexity

• Low threshold forimplementation

EarthGrid Data Providers

a

Ea

rth

Gri

d

b

c

Page 11: Leinfelder Earth Grid Jam2008

• Standard communication protocol• Common methods across systems

• Allows simplified data access by clients• Exposes data to more software

EarthGrid Data Consumers

EarthGrid

x

y

z

Page 12: Leinfelder Earth Grid Jam2008

The Usual Suspects

✓Search

✓Authenticate

✓Read

✓Write

EarthGridProvider

Page 13: Leinfelder Earth Grid Jam2008

<attribute id="att.5"> <attributeName>avesr91</attributeName> <attributeLabel>Average Species Richness for 1991</attributeLabel> <attributeDefinition>The average species richness for the field in 1991 </attributeDefinition> <storageType>float</storageType> <measurementScale> <ratio> <unit><standardUnit>dimensionless</standardUnit></unit> <precision>0.1</precision> <numericDomain id="nd.5"> <numberType>real</numberType> <bounds> <minimum exclusive="true">0</minimum> </bounds> </numericDomain> </ratio> </measurementScale> </attribute>

KNB Software Suite

Dat

a A

naly

sis

Dat

a St

orag

e

Dat

a M

anag

emen

t

Morpho

<eml/>M

etad

ata

You are here

Page 14: Leinfelder Earth Grid Jam2008

EarthGrid Search in Kepler

Page 15: Leinfelder Earth Grid Jam2008

Species Distribution Modeling

Page 16: Leinfelder Earth Grid Jam2008

Distribution Predictions

Current 2020 2050

Page 17: Leinfelder Earth Grid Jam2008

DataNetONE (Observation Network for Earth)

• ‘New institution’ for data preservation

Page 18: Leinfelder Earth Grid Jam2008

DataNetONE (Observation Network for Earth)

• Scalable. Flexible. Sustainable.

Current ????30+ years horizon

Page 19: Leinfelder Earth Grid Jam2008

Acknowledgements

• This material is based upon work supported by:

– The National Science Foundation under Grant Numbers 9980154 (KDI), 0618501 (FIRST) and 0225676 (SEEK).

– The National Center for Ecological Analysis and Synthesis, a Center funded by NSF (Grant Number 0072909), the University of California, and the UC Santa Barbara campus.

– The Andrew W. Mellon Foundation.

• Resources

– http://www.nceas.ucsb.edu/ecoinfo

– http://seek.ecoinformatics.org

– http://knb.ecoinformatics.org

– http://lno.lternet.edu/projects/pasta

– http://sbc.lternet.edu