Dataset Citation and Identifiers

40
Dataset Citation and Identifiers Joan Starr California Digital Library April, 2012 @joan_starr DOIs, ARKs, and EZID

description

Presentation at National Center for Atmospheric Research workshop, Bridging Data Lifecycles: Tracking Data Use via Data Citations, April 2012

Transcript of Dataset Citation and Identifiers

Page 1: Dataset Citation and Identifiers

Dataset Citation and Identifiers

Joan StarrCalifornia Digital Library

April, 2012@joan_starr

DOIs, ARKs, and EZID

Page 2: Dataset Citation and Identifiers

Dataset Citation & Identifiers

Data CitationIdentifiers 101Dataset Identification with EZIDChoosing an IdentifierLife Cycle Data ManagementLooking ahead

By Brain farts (Joschua) http://www.flickr.com/photos/brainfarts/97676505/

Page 3: Dataset Citation and Identifiers

Partnership between CDL | 10 UC campuses | Peer institutions

Provide solutions, services, resources for digital assets

Pool & distribute diverse experience, expertise, & resources

Page 4: Dataset Citation and Identifiers

Data Citation

By barryegan (Vitor Leite) http://www.flickr.com/photos/vixon/116447718/

Page 5: Dataset Citation and Identifiers

What?

• Key identifying elements• Emerging recommendations• Variation among the domains

Page 6: Dataset Citation and Identifiers

How?

• Key identifying elements• Emerging recommendations• Variation among the domains• In common: Persistent identifier

Page 7: Dataset Citation and Identifiers

What this means…

Page 8: Dataset Citation and Identifiers

What this means…

Page 9: Dataset Citation and Identifiers

Dataset Citation & Identifiers

Data CitationIdentifiers 101Dataset Identification with EZIDChoosing an IdentifierLife Cycle Data ManagementLooking ahead

By Brain farts (Joschua) http://www.flickr.com/photos/brainfarts/97676505/

Page 10: Dataset Citation and Identifiers

Identifiers 101

Page 11: Dataset Citation and Identifiers

What is an identifier?

What you see: alphanumeric string (never changes)Associated with: location of object (such as a URL)

Optional: who, what, when, etc (i.e. metadata)

By Joelk75: http://www.flickr.com/photos/75001512@N00/2728233597/

Page 12: Dataset Citation and Identifiers

Identifier example

string: doi:10.9999/FK40K2GTVhtml version: http://dx.doi.org/10.9999/FK40K2GTV

location: http://www.bologna.edu/biology/xfg/123.xls

metadatacreator: Dr. Felix Kottortitle: Data for chromosomal study of catfish (Ictalurus punctatus)publisher: University of Bolognadate: 8/31/2011

Page 13: Dataset Citation and Identifiers

Identifier example

string: doi:10.9999/FK40K2GTVhtml version: http://dx.doi.org/10.9999/FK40K2GTV

location: http://www.state.edu/ecology/783sdr/123.xls

metadatacreator: Dr. Felix Kottortitle: Data for chromosomal study of catfish (Ictalurus punctatus)

publisher: Dryad Data Repository date: 10/01/2011

Page 14: Dataset Citation and Identifiers

Identifiers 201

By Christi Nielsen http://www.flickr.com/photos/christinielsen/476326980/

Page 15: Dataset Citation and Identifiers

Identifiers 201

• string: doi:10.9999/FK40K2GTV

“prefix” “suffix”

Page 16: Dataset Citation and Identifiers

Dataset Citation & Identifiers

Data CitationIdentifiers 101Dataset Identification with EZIDChoosing an IdentifierLife Cycle Data ManagementLooking ahead

By Brain farts (Joschua) http://www.flickr.com/photos/brainfarts/97676505/

Page 17: Dataset Citation and Identifiers

EZID: long-term identifiers made easy

take control of the management

and distribution of your research,

share and get credit for it, and

build your reputation through its

collection and documentation

Primary Functions1. Create persistent identifiers2. Manage identifiers over time3. Manage associated metadata over time

Page 18: Dataset Citation and Identifiers
Page 19: Dataset Citation and Identifiers

http://n2t.net/ezid

Page 20: Dataset Citation and Identifiers

http://n2t.net/ezid

Page 21: Dataset Citation and Identifiers

http://n2t.net/ezid

Page 22: Dataset Citation and Identifiers

http://n2t.net/ezid

Page 23: Dataset Citation and Identifiers

Dataset Citation & Identifiers

Data CitationIdentifiers 101Dataset Identification with EZIDChoosing an IdentifierLife Cycle Data ManagementLooking ahead

By Brain farts (Joschua) http://www.flickr.com/photos/brainfarts/97676505/

Page 24: Dataset Citation and Identifiers

http://content.cdlib.org/ark:/13030/tf0v19n605/, courtesy of UC Davis Special Collections

• both can work like regular hyperlinks.• both can refer to a

subset or portion of a resource.

• both become persistentwhen the target URL is maintained.

DOIs and ARKs

Page 25: Dataset Citation and Identifiers

• Case sensitive• Special feature supports granularity• Informative• Less costly

DOIs vs ARKs

Page 26: Dataset Citation and Identifiers

• string: ark:/99999/Big4

• location: http://x.y.z/foo/Big4/db /*

/*

DOIs vs ARKs: suffix pass-through

Page 27: Dataset Citation and Identifiers

• string: ark:/99999/Big4

• location: http://x.y.z/foo/Big4/db

DOIs vs ARKs: suffix pass-through

/table/cell/45-8.txt

/table/cell/45-8.txt

Page 28: Dataset Citation and Identifiers

• Established brand in publishing• Indexed by major A&I citation databases • Cannot be deleted• More costly

DOIs vs ARKs

Page 29: Dataset Citation and Identifiers

Dataset Citation & Identifiers

Data CitationIdentifiers 101Dataset Identification with EZIDChoosing an IdentifierLife Cycle Data ManagementLooking ahead

By Brain farts (Joschua) http://www.flickr.com/photos/brainfarts/97676505/

Page 30: Dataset Citation and Identifiers

By jfcherry http://www.flickr.com/photos/67272961@N03/6123892769/

The Life of Data

Page 31: Dataset Citation and Identifiers

A life cycle approachCDL Curation and Publishing Services

http://www.cdlib.org Create, edit, share, and save

data management plans

Open access scholarly publishing services: papers, journals, books, seminars & more

Curation repository: store, manage, and share research data

Create and manage persistent identifiers

Open source add-in for Microsoft Excel as a data collection tool

An infrastructure to publish and get credit for sharing research data

Data Publication

Page 32: Dataset Citation and Identifiers

Identifiers and the data life cycleTrack your

results

Get more

citations

Organize your data

Meet funder requirements

Page 33: Dataset Citation and Identifiers

Dataset Citation & Identifiers

Data CitationIdentifiers 101Dataset Identification with EZIDChoosing an IdentifierLife Cycle Data ManagementLooking ahead

By Brain farts (Joschua) http://www.flickr.com/photos/brainfarts/97676505/

Page 34: Dataset Citation and Identifiers

1. New User Interface.

By Leonard John Matthews http://www.flickr.com/photos/mythoto/3964995003/

Page 35: Dataset Citation and Identifiers
Page 36: Dataset Citation and Identifiers
Page 37: Dataset Citation and Identifiers

2. Growing user community

http://www.cdlib.org/services/uc3/ezid/clients.html

Thanks to Scott Edmunds, GigaScience Journal for input

Page 38: Dataset Citation and Identifiers

2. Growing user community

Page 39: Dataset Citation and Identifiers

3. A&I Indexing

Page 40: Dataset Citation and Identifiers

For more information

EZIDEZID application: http://n2t.net/ezid/ EZID website: http://www.cdlib.org/services/uc3/ezid/EZID on Twitter: @ezidCDL

Joan Starr: [email protected] @joan_starr