Identity in research data publication - meeting with SageCite people march2011

50
G. A. Thorisson, University of Leicester Identity Workshop prep-meeting, Helsinki, January 27 2011 Identity in research data publication 1 Gudmundur ‘Mummi’ Thorisson <[email protected] > Brookes lab Tuesday, 22 March 2011

description

 

Transcript of Identity in research data publication - meeting with SageCite people march2011

Page 1: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Identity in research data publication

1

Gudmundur ‘Mummi’ Thorisson<[email protected]>

Brookes lab

Tuesday, 22 March 2011

Page 2: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Non-unique names are a majorproblem in the scholarly literature

2

Tuesday, 22 March 2011

Page 3: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Non-unique names are a majorproblem in the scholarly literature

2

Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory

Tuesday, 22 March 2011

Page 4: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Non-unique names are a majorproblem in the scholarly literature

2

Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory

J. SmithJ. SmithJ. SmithJ. SmithJ. Smith [etc.]

Or these?

Tuesday, 22 March 2011

Page 5: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Non-unique names are a majorproblem in the scholarly literature

2

How about these?

Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory

J. SmithJ. SmithJ. SmithJ. SmithJ. Smith [etc.]

Or these?

Tuesday, 22 March 2011

Page 6: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Non-unique names are a majorproblem in the scholarly literature

2

How about these?

Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory

J. SmithJ. SmithJ. SmithJ. SmithJ. Smith [etc.]

Or these?

∼2/3 of the ∼6 million authors in MEDLINE share a last name and first initial with at least one other author, and an ambiguous name refers to ∼8 persons on average.Torvik and Smalheiser. Author name disambiguation in MEDLINE. ACM Transactions on Knowledge Discovery from Data (2009) vol. 3 (3)

Tuesday, 22 March 2011

Page 7: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 8: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 9: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 10: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 11: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 12: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 13: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 14: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 15: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 16: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 17: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 5

Geoffrey BilderDirector of Strategic Initiatives

Tuesday, 22 March 2011

Page 18: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 6

Tuesday, 22 March 2011

Page 19: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

• Contributor recognition - attribute published works to the person(s) who contributed to them

7

Tuesday, 22 March 2011

Page 20: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Unique identifiers for authors contributors

8

Tuesday, 22 March 2011

Page 21: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Unique identifiers for authors contributors

8

automated author disambiguation+

author involvement

Tuesday, 22 March 2011

Page 22: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Unique identifiers for authors contributors

8

automated author disambiguation+

author involvement

Tuesday, 22 March 2011

Page 23: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Unique identifiers for authors contributors

8

Dec’09: launch of the Open Researcher Contributor Identification Initiative - ORCID

automated author disambiguation+

author involvement

Tuesday, 22 March 2011

Page 24: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 9

Centrally-managed informatics infrastructure:i) for researchers to manage & use profileii) for tracking author-to-publication attribution linksiii) interaction with other systems (e.g. publishers, digital libraries

ORCID

F67572010

?

ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.

ORCID ID: G-1442-2009J. Smith, Univ. North Pole

ORCID ID: D-2400-2010J. Smith, Luthor Corporation

Tuesday, 22 March 2011

Page 25: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 10

Manuscript submission to journal

Attribution: ORCID ID for author <--> DOI for article

Tuesday, 22 March 2011

Page 26: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Why publishers want this

– single sign-on (SSO) for manuscript tracking systems

– Disambiguating contact information for use by editorial offices, royalty payments systems, copyright clearances, etc.

– Automatic updating of email addresses for table of contents (TOC) alerts and other automated email communications

– Automated tools for detecting potential reviewers, including tools for detecting potential conflicts of interest

– Synchronization with publisher web site user profiles and granting researchers customized, privileged access to content based on profiles

– Understanding all of the manifold ways in which an individual “contributes” to a publisher or a field (e.g. As an editor, reviewer, letter writer, conference chair, etc.).

– Etc.

11

Tuesday, 22 March 2011

Page 27: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 12

-Verified publication record-streamlined MS submission-attribution for non-traditional scholarly output-Etc.

Why researchers will want this

Tuesday, 22 March 2011

Page 28: Identity in research data publication - meeting with SageCite people march2011

Identity Workshop prep-meeting, Helsinki, January 27 2011

G. A. Thorisson, University of Leicester www.gen2phen.org

13

Tuesday, 22 March 2011

Page 29: Identity in research data publication - meeting with SageCite people march2011

Identity Workshop prep-meeting, Helsinki, January 27 2011

G. A. Thorisson, University of Leicester www.gen2phen.org

13

>150 Organisations

Tuesday, 22 March 2011

Page 30: Identity in research data publication - meeting with SageCite people march2011

Tuesday, 22 March 2011

Page 31: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

• Provenance - I trust dataset X generated by a certain J. Smith

• Contributor recognition - publication credit for sharing data

15

Research data asscholarly output

Tuesday, 22 March 2011

Page 32: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

• Provenance - I trust dataset X generated by a certain J. Smith

• Contributor recognition - publication credit for sharing data

15

• Access management - control access to sensitive research data

Research data asscholarly output

Tuesday, 22 March 2011

Page 33: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Lab meeting Fri 11 Feb 2010 16

Need to IDENTIFY people as theycontribute to

&

access

Internet resources

• The basic identity problem the Internet poses is establishing one party’s identity to another party’s satisfaction through communication across the network.

Weitzner. In Search of Manageable Identity Systems. IEEE Internet Computing (2006) vol. 10 (6) pp. 84-86

Tuesday, 22 March 2011

Page 34: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Lab meeting Fri 11 Feb 2010 16

Need to IDENTIFY people as theycontribute to

&

access

Internet resources

• The basic identity problem the Internet poses is establishing one party’s identity to another party’s satisfaction through communication across the network.

Weitzner. In Search of Manageable Identity Systems. IEEE Internet Computing (2006) vol. 10 (6) pp. 84-86

Tuesday, 22 March 2011

Page 35: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson www.gen2phen.org

3rd Human Variome Project Meeting, Paris, 10-14 May, 2010

• Access management– Controlling access to non-public resources on the Web

• Analytical resources - incl. high-performance computing clusters

• Potentially identifiable biomedical data

• Contribution tracking– Data submissions to central repositores

– Data curation / micro-attribution

– Bio-resource impact factor + nanopublications

17

Data-related applications for an online digital identity (a.k.a ‘researcher IDs’)

Tuesday, 22 March 2011

Page 36: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

The data sharing problem

18

From http://www.nature.com/news/specials/datasharing/

Tuesday, 22 March 2011

Page 37: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

The data sharing problem

18

From http://www.nature.com/news/specials/datasharing/

analysedsynthesisedinterpreted

Information

published

Knowledge

Publication

Data

Tuesday, 22 March 2011

Page 38: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 19

Publishing a journal article

Tuesday, 22 March 2011

Page 39: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 19

Publishing a dataset

Publishing a journal article

Tuesday, 22 March 2011

Page 40: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Outcome 3/3

20

DOI <-> ORCID ID

• Thorisson, G. (A-883-2010), Bilder, G.W. (C-035-2009) and Fenner, M. (A-101-2010). Icelandic 9th century viking bowl. Psychoceramics Archive. Sep 2 2010.doi:10.4259/psycho.5gtpq-thorisson

Tuesday, 22 March 2011

Page 41: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

Outcome 3/3

20

DOI <-> ORCID ID

• A-883-2010 <created> 10.4259/psycho.5gtpq-thorisson

• C-035-2009 <created> 10.4259/psycho.5gtpq-thorisson

• A-101-2010 <created> 10.4259/psycho.5gtpq-thorisson

• Thorisson, G. (A-883-2010), Bilder, G.W. (C-035-2009) and Fenner, M. (A-101-2010). Icelandic 9th century viking bowl. Psychoceramics Archive. Sep 2 2010.doi:10.4259/psycho.5gtpq-thorisson

Tuesday, 22 March 2011

Page 42: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Lab meeting Fri 11 Feb 2010

Cafe RouGE

21

10

1. Diagnostic laboratories

2. Central mutation depot

3. End-users (e.g. LSDB curators)

Publish data

Retrieve RSS feeds

•Digital IDs for security / access management

•Attribution for published data, via digital IDs

Tuesday, 22 March 2011

Page 43: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011

• Who contributed to dataset 10.4259/psycho.5gtpq-thorisson?

• All data publications by A-883-2010 ?

• Which papers have cited the works of A-883-2010 ?

• Total no. citations to datasets by A-883-2010 in the last 2 years?

• Total no. downloads of datasets by A-883-2010?

• [....]

22

G. Thorisson, Univ. [email protected]

ORCID ID: A-883-2010

Tuesday, 22 March 2011

Page 44: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 23

A digital identity for researchers centred on

scholarly profile?

Tuesday, 22 March 2011

Page 45: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 23

ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.

http://mummi.myopenid.com

A digital identity for researchers centred on

scholarly profile?

Tuesday, 22 March 2011

Page 46: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 23

ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.

http://mummi.myopenid.com

A digital identity for researchers centred on

scholarly profile?

Tuesday, 22 March 2011

Page 47: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 23

ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.

http://mummi.myopenid.com

A digital identity for researchers centred on

scholarly profile?

Tuesday, 22 March 2011

Page 48: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 23

ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.

http://mummi.myopenid.com

A digital identity for researchers centred on

scholarly profile?

Tuesday, 22 March 2011

Page 49: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Lab meeting Fri 11 Feb 2010

Coming autumn 2011, to a venue near you! Int’l workshop on researcher identity

• Co-organized by CSC (Finland IT Centre for Science)

• Provisional title: “Identity in research infrastructure and scientific communication" - IRISC

• Location: Helsinki

• Time: September 12-13

24

Tuesday, 22 March 2011

Page 50: Identity in research data publication - meeting with SageCite people march2011

G. A. Thorisson, University of Leicester

Identity Workshop prep-meeting, Helsinki, January 27 2011 25

GEN2PHEN Consortiumhttp://www.gen2phen.org/about-gen2phen/partners

Prof Anthony J. Brookes Bioinformatics Group

This work has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013)under grant agreement number 200754 - the GEN2PHEN project.

Acknowledgements

Contact me! Gudmundur A. Thorisson

<[email protected]>http://friendfeed.com/mummi

http://www.linkedin.com/in/mummi

Tuesday, 22 March 2011