Distributed Biodiversity Information Databases A. Townsend Peterson.

Post on 05-Jan-2016

214 views 1 download

Tags:

Transcript of Distributed Biodiversity Information Databases A. Townsend Peterson.

Distributed Biodiversity Distributed Biodiversity Information DatabasesInformation Databases

A. Townsend PetersonA. Townsend Peterson

Paris MuseumParis Museum

British MuseumBritish Museum

Field MuseumField Museum

KU MuseumKU Museum

““World Museum”World Museum”

FishUniversity of Florida

FishTulane University

FishUniversity of Michigan

Fish“World Museum”

Making Simple Databases BETTERMaking Simple Databases BETTER

GeoreferencingGeoreferencing

Standardization of taxonomyStandardization of taxonomy

Error-detection and cleaningError-detection and cleaning

Integration with the ‘World Museum’Integration with the ‘World Museum’

Data Sharing - Issues of Data Sharing - Issues of ImportanceImportance

SecuritySecurity

OwnershipOwnership

ControlControl

Updates of informationUpdates of information

Funding and chargingFunding and charging

Safeguards ISafeguards I

Legal disclaimer:Legal disclaimer:– No for-profit uses without permission of data No for-profit uses without permission of data

owners (curators)owners (curators)– No repackaging and redistribution without No repackaging and redistribution without

permission of data owners (curators)permission of data owners (curators)– Data owners not responsible for data quality Data owners not responsible for data quality

or accuracyor accuracy– Negative data do not apply – absence of Negative data do not apply – absence of

records is not indicative of absence of records is not indicative of absence of species, etc.species, etc.

Safeguards IISafeguards IIData remain at the owner institution – no Data remain at the owner institution – no centralization involvedcentralization involvedNo hacking – data are isolated from the No hacking – data are isolated from the original/master datasetoriginal/master datasetInstitutions may restrict access to classes of data, Institutions may restrict access to classes of data, e.g.,e.g.,– Data from particular regions or taxa not servedData from particular regions or taxa not served– Particular fields for sensitive species not servedParticular fields for sensitive species not served– Data for particular collectors or time periods not Data for particular collectors or time periods not

servedserved– Etc.Etc.

AdvantagesAdvantages

Data ownership retained by institution that holds Data ownership retained by institution that holds primary voucher specimenprimary voucher specimenData are updated as often as wished … daily, if Data are updated as often as wished … daily, if preferred by owner institutionpreferred by owner institutionData quality improves over timeData quality improves over timeDetailed reporting of use of collections data to Detailed reporting of use of collections data to data owners (soon!)data owners (soon!)Free and open access to users worldwideFree and open access to users worldwideCommunity cooperation opens many doorsCommunity cooperation opens many doors

Construction of the NABIN NetworkConstruction of the NABIN Network

Data Sets in the NABIN Network

0

2

4

6

8

10

12

14

16

Jun-97 Sep-97 Jan-98 Apr-98 Jul-98 Nov-98

Date

Nu

mb

er

of

da

ta s

ets

Construction of the NABIN NetworkConstruction of the NABIN Network

Data Points in the NABIN Network0

200000

400000

600000

800000

1000000

1200000

1400000

Jun-97 Sep-97 Jan-98 Apr-98 Jul-98 Nov-98

Date

TSA Use ITSA Use I

Monthly Use of TSA

0

10000

20000

30000

40000

50000

60000

70000

Feb-99 May-99 Aug-99 Dec-99 Mar-00 Jun-00 Oct-00 Jan-01 Apr-01 Jul-01 Nov-01

Date

Nu

mb

er o

f h

its

Evolution of TechnologyEvolution of Technology

Dublin Core/Darwin CoreDublin Core/Darwin Core

Z39.50Z39.50

DiGIRDiGIR

TAPIRTAPIR

DiGIRDiGIR

Distributed Generic Distributed Generic Information Information Retrieval Retrieval

http://http://digir.sourceforge.netdigir.sourceforge.net//

Distributed Biodiversity Information NetworksDistributed Biodiversity Information Networks

REMIB http://www.conabio.gob.mx/REMIB http://www.conabio.gob.mx/SpeciesLink http://splink.cria.org.br/SpeciesLink http://splink.cria.org.br/MaNIS http://elib.cs.berkeley.edu/manis/ MaNIS http://elib.cs.berkeley.edu/manis/ HerpNET http://www.herpnet.org/ HerpNET http://www.herpnet.org/ ORNIS http://ornisnet.org/ ORNIS http://ornisnet.org/ AVH http://www.chah.gov.au/avh/ AVH http://www.chah.gov.au/avh/ GBIF http://www.gbif.net/ GBIF http://www.gbif.net/ ATREE http://www.ecoinfoindia.org ATREE http://www.ecoinfoindia.org

Primary Species’ Occurrence Data

Recordings, images, videos

Field notes, otherancillary information

Stomach contents, etc.

Scientific literature

Geospatial datadescribing locality

Parasites etc.

Stable isotope data

Gene sequence data

Remote-sensingdata showing

locality in space and time

☺☺

Taxonomic data