CU Anschutz Health Science Library Data Services

25
CU Anschutz HSL Data Services C. Tobin Magle, Biomedical Sciences Research Support Specialist http://www.slideshare.net/CTobinMagle/cu-anschutz-health- science-library-data-services

Transcript of CU Anschutz Health Science Library Data Services

Page 1: CU Anschutz Health Science Library Data Services

CU Anschutz HSL Data ServicesC. Tobin Magle, Biomedical Sciences Research Support Specialist

http://www.slideshare.net/CTobinMagle/cu-anschutz-health-science-library-data-services

Page 2: CU Anschutz Health Science Library Data Services

Questions

• Why should I care about data management?

• What do libraries have to do with it?

• What does the HSL provide?

Page 3: CU Anschutz Health Science Library Data Services

Why should I care about data management?

Rinehart, AK. “Getting emotional about data” College & Research Libraries News September 2015 vol. 76 no. 8 437-440

Page 6: CU Anschutz Health Science Library Data Services

17% data is lost per year post publication

doi:10.1016/j.cub.2013.11.014

Page 7: CU Anschutz Health Science Library Data Services

The majority of research data aren’t curated

doi:10.1353/lib.0.0036

<22% NIH grants require a Data Sharing Plan

Page 8: CU Anschutz Health Science Library Data Services

We are losing vast amounts of data

00

0

0

0

0

0

0

0

00

0

0

1

1

1

11

1

11

1

1

1

1

1

1

1

0

00

0

0

0

000

000 0

1

1

1 1

10

Page 9: CU Anschutz Health Science Library Data Services

Research funding is tight

From: The Anatomy of Medical Research:  US and International ComparisonsJAMA. 2015;313(2):174-189. doi:10.1001/jama.2014.15939

NIH

Pharma

Med. Device Companies

Biotech

State/localPrivate funds

Other Fed.

Page 10: CU Anschutz Health Science Library Data Services

Funders want to do more with lessHence, data sharing

http://figshare.com/blog/2015_The_year_of_open_data_mandates/143

Page 11: CU Anschutz Health Science Library Data Services

Whitehouse’s 2013 OSTP

“The Obama Administration is committed to the proposition that citizens deserve easy access to the results of research their tax dollars have paid for. That’s why, in a policy memorandum released today, OSTP Director John Holdren has directed Federal agencies with more than $100M in R&D expenditures to develop plans to make the results of federally funded research freely available to the public—generally within one year of publication.”http://www.whitehouse.gov/blog/2013/02/22/expanding-public-access-results-federally-funded-research

Page 12: CU Anschutz Health Science Library Data Services

NSF post-award requirements

“Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants. Grantees are expected to encourage and facilitate such sharing.”http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/aag_6.jsp#VID4

NIH is preparing to release a similar policy

Page 13: CU Anschutz Health Science Library Data Services

Research Lifecycle: new complications

FormHypothesis

Collect Data

Design Experiment

Previous research

Clean Data

Analyze DataWrite

manuscript

Share findings

1. Increased capability to share2. Funder mandates to share data3. Huge, complex datasets4. Protected information

Complications

Page 14: CU Anschutz Health Science Library Data Services

Research Lifecycle Support

FormHypothesis

Collect Data

Design Experiment

Previous research

Clean Data

Analyze DataWrite

manuscript

Share findings

Metadata

Data Management Plans

Working data storage

Version control

Requires expertise

and infrastructureRepositories/

Privacy

Software/ Computation/Reproducability

Searching/Unique identifiers

Page 15: CU Anschutz Health Science Library Data Services

How are libraries getting involved?

• We don’t make the rules

• We want to provide guidance

• Research data management services

• NLM Administrative Supplements

Page 16: CU Anschutz Health Science Library Data Services

Libraries are changing:

Strength: organizing and finding information• Old role: Finding and cataloging books • New role: Finding and cataloging electronic resources• Informationist’s role: Finding datasets for data

repurposing and helping researchers curate their own

Page 17: CU Anschutz Health Science Library Data Services

Services

• Consultations

• Classes

• Topics• Data Management plans• Research Reproducibility• Metadata• Repositories

See http://hslibrary.ucdenver.edu/research/data-management for more information

Page 18: CU Anschutz Health Science Library Data Services

Librarians are receiving grant funding

Page 19: CU Anschutz Health Science Library Data Services

Informationist Funding

NLM Administrative Supplements for Informationist Services

Purposes:

(1) To enhance collaborative, multi-disciplinary basic and clinical research by integrating an information specialist into the research team in order to improve the capture, storage, organization, management, integration, presentation and dissemination of biomedical research data

(2) To assess and document the value and impact of the informationist’s participation.

http://www.nlm.nih.gov/ep/AdminSupp.html

Page 20: CU Anschutz Health Science Library Data Services

Project backgroundDr. Kechris’s R01 proposal generated miRNA expression data from LXS recombinant inbred mouse panel as a resource for the research community.

Planned to share data in PhenoGen database

Page 21: CU Anschutz Health Science Library Data Services

NLM Informationist Awards

Aims:

1. Make data and code publicly available with appropriate metadata

2. Create tutorials to facilitate data reuse

3. Assess efficacy of Aims 1 + 2

Page 22: CU Anschutz Health Science Library Data Services

Aim 1: Make data/code/metadata public

• Deposit raw miRNA data public repositories• NCBI (SRA/GEO/BioProject/BioSample)• PhenoGen (new functionality to support NGS data)

• Standardize and apply metadata

• Make analysis workflows (R code) available in GitHub

• Repository entry to link all materials from this project• Including tutorials from Aim 2

Page 23: CU Anschutz Health Science Library Data Services

Aim 2: Facilitate data reuse with tutorials

Variety of formats:

• Video Tutorials: Adobe Captivate

• Written tutorials: Blog• https://hslnews.wordpress.com/category/bioinformatics-

bites/

• Guide on the Side: • http://hslibrarytraining.ucdenver.edu

Page 24: CU Anschutz Health Science Library Data Services

Aim 3: Assess efficacy of Aims 1 and 2

• Monitor data usage• Citation• Downloads (Google Analytics)

• Surveys and assessments about tutorials• Are the tutorials helping others use the data?

Page 25: CU Anschutz Health Science Library Data Services

HSL Research Support Services

http://hslibrary.ucdenver.edu/research/

Need help? AskUS!

http://hslibrary.ucdenver.libanswers.com/index.php

[email protected]: 303-724-2114Twitter: @tobinmaglehttp://orcid.org/0000-0003-3185-7034

Contact Information

http://www.slideshare.net/CTobinMagle/cu-anschutz-health-science-library-data-services