Post on 17-Dec-2014
description
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Identity in research data publication
1
Gudmundur ‘Mummi’ Thorisson<gt50@le.ac.uk>
Brookes lab
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Non-unique names are a majorproblem in the scholarly literature
2
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Non-unique names are a majorproblem in the scholarly literature
2
Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Non-unique names are a majorproblem in the scholarly literature
2
Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory
J. SmithJ. SmithJ. SmithJ. SmithJ. Smith [etc.]
Or these?
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Non-unique names are a majorproblem in the scholarly literature
2
How about these?
Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory
J. SmithJ. SmithJ. SmithJ. SmithJ. Smith [etc.]
Or these?
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Non-unique names are a majorproblem in the scholarly literature
2
How about these?
Are these authors all the same person?G. Thorisson, University of LeicesterG. A. Thorisson, University of LeicesterG. A. Thorisson, Cold Spring Harbor Laboratory
J. SmithJ. SmithJ. SmithJ. SmithJ. Smith [etc.]
Or these?
∼2/3 of the ∼6 million authors in MEDLINE share a last name and first initial with at least one other author, and an ambiguous name refers to ∼8 persons on average.Torvik and Smalheiser. Author name disambiguation in MEDLINE. ACM Transactions on Knowledge Discovery from Data (2009) vol. 3 (3)
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 5
Geoffrey BilderDirector of Strategic Initiatives
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 6
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
• Contributor recognition - attribute published works to the person(s) who contributed to them
7
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Unique identifiers for authors contributors
8
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Unique identifiers for authors contributors
8
automated author disambiguation+
author involvement
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Unique identifiers for authors contributors
8
automated author disambiguation+
author involvement
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Unique identifiers for authors contributors
8
Dec’09: launch of the Open Researcher Contributor Identification Initiative - ORCID
automated author disambiguation+
author involvement
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 9
Centrally-managed informatics infrastructure:i) for researchers to manage & use profileii) for tracking author-to-publication attribution linksiii) interaction with other systems (e.g. publishers, digital libraries
ORCID
F67572010
?
ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.
ORCID ID: G-1442-2009J. Smith, Univ. North Pole
ORCID ID: D-2400-2010J. Smith, Luthor Corporation
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 10
Manuscript submission to journal
Attribution: ORCID ID for author <--> DOI for article
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Why publishers want this
– single sign-on (SSO) for manuscript tracking systems
– Disambiguating contact information for use by editorial offices, royalty payments systems, copyright clearances, etc.
– Automatic updating of email addresses for table of contents (TOC) alerts and other automated email communications
– Automated tools for detecting potential reviewers, including tools for detecting potential conflicts of interest
– Synchronization with publisher web site user profiles and granting researchers customized, privileged access to content based on profiles
– Understanding all of the manifold ways in which an individual “contributes” to a publisher or a field (e.g. As an editor, reviewer, letter writer, conference chair, etc.).
– Etc.
11
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 12
-Verified publication record-streamlined MS submission-attribution for non-traditional scholarly output-Etc.
Why researchers will want this
Tuesday, 22 March 2011
Identity Workshop prep-meeting, Helsinki, January 27 2011
G. A. Thorisson, University of Leicester www.gen2phen.org
13
Tuesday, 22 March 2011
Identity Workshop prep-meeting, Helsinki, January 27 2011
G. A. Thorisson, University of Leicester www.gen2phen.org
13
>150 Organisations
Tuesday, 22 March 2011
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
• Provenance - I trust dataset X generated by a certain J. Smith
• Contributor recognition - publication credit for sharing data
15
Research data asscholarly output
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
• Provenance - I trust dataset X generated by a certain J. Smith
• Contributor recognition - publication credit for sharing data
15
• Access management - control access to sensitive research data
Research data asscholarly output
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Lab meeting Fri 11 Feb 2010 16
Need to IDENTIFY people as theycontribute to
&
access
Internet resources
• The basic identity problem the Internet poses is establishing one party’s identity to another party’s satisfaction through communication across the network.
Weitzner. In Search of Manageable Identity Systems. IEEE Internet Computing (2006) vol. 10 (6) pp. 84-86
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Lab meeting Fri 11 Feb 2010 16
Need to IDENTIFY people as theycontribute to
&
access
Internet resources
• The basic identity problem the Internet poses is establishing one party’s identity to another party’s satisfaction through communication across the network.
Weitzner. In Search of Manageable Identity Systems. IEEE Internet Computing (2006) vol. 10 (6) pp. 84-86
Tuesday, 22 March 2011
G. A. Thorisson www.gen2phen.org
3rd Human Variome Project Meeting, Paris, 10-14 May, 2010
• Access management– Controlling access to non-public resources on the Web
• Analytical resources - incl. high-performance computing clusters
• Potentially identifiable biomedical data
• Contribution tracking– Data submissions to central repositores
– Data curation / micro-attribution
– Bio-resource impact factor + nanopublications
17
Data-related applications for an online digital identity (a.k.a ‘researcher IDs’)
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
The data sharing problem
18
From http://www.nature.com/news/specials/datasharing/
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
The data sharing problem
18
From http://www.nature.com/news/specials/datasharing/
analysedsynthesisedinterpreted
Information
published
Knowledge
Publication
Data
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 19
Publishing a journal article
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 19
Publishing a dataset
Publishing a journal article
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Outcome 3/3
20
DOI <-> ORCID ID
• Thorisson, G. (A-883-2010), Bilder, G.W. (C-035-2009) and Fenner, M. (A-101-2010). Icelandic 9th century viking bowl. Psychoceramics Archive. Sep 2 2010.doi:10.4259/psycho.5gtpq-thorisson
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
Outcome 3/3
20
DOI <-> ORCID ID
• A-883-2010 <created> 10.4259/psycho.5gtpq-thorisson
• C-035-2009 <created> 10.4259/psycho.5gtpq-thorisson
• A-101-2010 <created> 10.4259/psycho.5gtpq-thorisson
• Thorisson, G. (A-883-2010), Bilder, G.W. (C-035-2009) and Fenner, M. (A-101-2010). Icelandic 9th century viking bowl. Psychoceramics Archive. Sep 2 2010.doi:10.4259/psycho.5gtpq-thorisson
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Lab meeting Fri 11 Feb 2010
Cafe RouGE
21
10
1. Diagnostic laboratories
2. Central mutation depot
3. End-users (e.g. LSDB curators)
Publish data
Retrieve RSS feeds
•Digital IDs for security / access management
•Attribution for published data, via digital IDs
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011
• Who contributed to dataset 10.4259/psycho.5gtpq-thorisson?
• All data publications by A-883-2010 ?
• Which papers have cited the works of A-883-2010 ?
• Total no. citations to datasets by A-883-2010 in the last 2 years?
• Total no. downloads of datasets by A-883-2010?
• [....]
22
G. Thorisson, Univ. Leicestergthorisson@gmail.com
ORCID ID: A-883-2010
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 23
A digital identity for researchers centred on
scholarly profile?
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 23
ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.
http://mummi.myopenid.com
A digital identity for researchers centred on
scholarly profile?
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 23
ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.
http://mummi.myopenid.com
A digital identity for researchers centred on
scholarly profile?
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 23
ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.
http://mummi.myopenid.com
A digital identity for researchers centred on
scholarly profile?
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 23
ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.
http://mummi.myopenid.com
A digital identity for researchers centred on
scholarly profile?
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Lab meeting Fri 11 Feb 2010
Coming autumn 2011, to a venue near you! Int’l workshop on researcher identity
• Co-organized by CSC (Finland IT Centre for Science)
• Provisional title: “Identity in research infrastructure and scientific communication" - IRISC
• Location: Helsinki
• Time: September 12-13
24
Tuesday, 22 March 2011
G. A. Thorisson, University of Leicester
Identity Workshop prep-meeting, Helsinki, January 27 2011 25
GEN2PHEN Consortiumhttp://www.gen2phen.org/about-gen2phen/partners
Prof Anthony J. Brookes Bioinformatics Group
This work has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013)under grant agreement number 200754 - the GEN2PHEN project.
Acknowledgements
Contact me! Gudmundur A. Thorisson
<gt50@le.ac.uk>http://friendfeed.com/mummi
http://www.linkedin.com/in/mummi
Tuesday, 22 March 2011