CU Anschutz Health Science Library Data Services
-
Upload
c-tobin-magle -
Category
Data & Analytics
-
view
134 -
download
1
Transcript of CU Anschutz Health Science Library Data Services
CU Anschutz HSL Data ServicesC. Tobin Magle, Biomedical Sciences Research Support Specialist
http://www.slideshare.net/CTobinMagle/cu-anschutz-health-science-library-data-services
Questions
• Why should I care about data management?
• What do libraries have to do with it?
• What does the HSL provide?
Why should I care about data management?
Rinehart, AK. “Getting emotional about data” College & Research Libraries News September 2015 vol. 76 no. 8 437-440
More researchers
https://nexus.od.nih.gov/all/2012/06/27/what-weve-learned-about-graduate-students/
More data than ever before
See arXiv:1402.4578 for details
17% data is lost per year post publication
doi:10.1016/j.cub.2013.11.014
The majority of research data aren’t curated
doi:10.1353/lib.0.0036
<22% NIH grants require a Data Sharing Plan
We are losing vast amounts of data
00
0
0
0
0
0
0
0
00
0
0
1
1
1
11
1
11
1
1
1
1
1
1
1
0
00
0
0
0
000
000 0
1
1
1 1
10
Research funding is tight
From: The Anatomy of Medical Research: US and International ComparisonsJAMA. 2015;313(2):174-189. doi:10.1001/jama.2014.15939
NIH
Pharma
Med. Device Companies
Biotech
State/localPrivate funds
Other Fed.
Funders want to do more with lessHence, data sharing
http://figshare.com/blog/2015_The_year_of_open_data_mandates/143
Whitehouse’s 2013 OSTP
“The Obama Administration is committed to the proposition that citizens deserve easy access to the results of research their tax dollars have paid for. That’s why, in a policy memorandum released today, OSTP Director John Holdren has directed Federal agencies with more than $100M in R&D expenditures to develop plans to make the results of federally funded research freely available to the public—generally within one year of publication.”http://www.whitehouse.gov/blog/2013/02/22/expanding-public-access-results-federally-funded-research
NSF post-award requirements
“Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants. Grantees are expected to encourage and facilitate such sharing.”http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/aag_6.jsp#VID4
NIH is preparing to release a similar policy
Research Lifecycle: new complications
FormHypothesis
Collect Data
Design Experiment
Previous research
Clean Data
Analyze DataWrite
manuscript
Share findings
1. Increased capability to share2. Funder mandates to share data3. Huge, complex datasets4. Protected information
Complications
Research Lifecycle Support
FormHypothesis
Collect Data
Design Experiment
Previous research
Clean Data
Analyze DataWrite
manuscript
Share findings
Metadata
Data Management Plans
Working data storage
Version control
Requires expertise
and infrastructureRepositories/
Privacy
Software/ Computation/Reproducability
Searching/Unique identifiers
How are libraries getting involved?
• We don’t make the rules
• We want to provide guidance
• Research data management services
• NLM Administrative Supplements
Libraries are changing:
Strength: organizing and finding information• Old role: Finding and cataloging books • New role: Finding and cataloging electronic resources• Informationist’s role: Finding datasets for data
repurposing and helping researchers curate their own
Services
• Consultations
• Classes
• Topics• Data Management plans• Research Reproducibility• Metadata• Repositories
See http://hslibrary.ucdenver.edu/research/data-management for more information
Librarians are receiving grant funding
Informationist Funding
NLM Administrative Supplements for Informationist Services
Purposes:
(1) To enhance collaborative, multi-disciplinary basic and clinical research by integrating an information specialist into the research team in order to improve the capture, storage, organization, management, integration, presentation and dissemination of biomedical research data
(2) To assess and document the value and impact of the informationist’s participation.
http://www.nlm.nih.gov/ep/AdminSupp.html
Project backgroundDr. Kechris’s R01 proposal generated miRNA expression data from LXS recombinant inbred mouse panel as a resource for the research community.
Planned to share data in PhenoGen database
NLM Informationist Awards
Aims:
1. Make data and code publicly available with appropriate metadata
2. Create tutorials to facilitate data reuse
3. Assess efficacy of Aims 1 + 2
Aim 1: Make data/code/metadata public
• Deposit raw miRNA data public repositories• NCBI (SRA/GEO/BioProject/BioSample)• PhenoGen (new functionality to support NGS data)
• Standardize and apply metadata
• Make analysis workflows (R code) available in GitHub
• Repository entry to link all materials from this project• Including tutorials from Aim 2
Aim 2: Facilitate data reuse with tutorials
Variety of formats:
• Video Tutorials: Adobe Captivate
• Written tutorials: Blog• https://hslnews.wordpress.com/category/bioinformatics-
bites/
• Guide on the Side: • http://hslibrarytraining.ucdenver.edu
Aim 3: Assess efficacy of Aims 1 and 2
• Monitor data usage• Citation• Downloads (Google Analytics)
• Surveys and assessments about tutorials• Are the tutorials helping others use the data?
HSL Research Support Services
http://hslibrary.ucdenver.edu/research/
Need help? AskUS!
http://hslibrary.ucdenver.libanswers.com/index.php
[email protected]: 303-724-2114Twitter: @tobinmaglehttp://orcid.org/0000-0003-3185-7034
Contact Information
http://www.slideshare.net/CTobinMagle/cu-anschutz-health-science-library-data-services