Research Data Management
-
Upload
sarah-jones -
Category
Technology
-
view
538 -
download
0
description
Transcript of Research Data Management
![Page 1: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/1.jpg)
Funded by:
Research Data ManagementUniversity of East London, 1st May 2013
Sarah JonesDigital Curation Centre
[email protected]: sjDCC
![Page 2: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/2.jpg)
Why are you here?
• You’re managing data (your own or your group's)
• Or you think you maybe should be
• You’re not sure why it matters
• You’re not sure how best to do it
• You’d like to know whether you’re on the right track
Photo: by Orijinal http://www.flickr.com/photos/orijinal/3539418133
![Page 3: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/3.jpg)
Why manage your data?
![Page 4: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/4.jpg)
What if your data fell into the wrong hands?
•http://news.bbc.co.uk/1/hi/uk/8332445.stm
![Page 5: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/5.jpg)
What if you had to produce your data?
![Page 7: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/7.jpg)
Why YOU need a Data Management Plan
What if this was your backpack?
http://blogs.ch.cam.ac.uk/pmr/2011/08/01/why-you-need-a-data-management-plan
![Page 8: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/8.jpg)
Good data management is about making informed decisions
![Page 10: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/10.jpg)
Why manage research data?
• To make your research easier!
• To stop yourself drowning in irrelevant stuff
• In case you need the data later
• To avoid accusations of fraud or bad science
• To share your data for others to use and learn from
• To get credit for producing it
• Because somebody else said to do so
![Page 11: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/11.jpg)
RDM policy at UEL
http://www.uel.ac.uk/wwwmedia/services/library/lls/resources/rspresearchtools/Research-Data-Management-policy-for-UEL-FINAL.pdf
![Page 12: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/12.jpg)
Expectations of public access
“Publicly funded research data are a public good, produced in the public interest, which should be
made openly available with as few restrictions as possible in a timely and responsible manner that
does not harm intellectual property.”
RCUK Common Principles on Data Policyhttp://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
![Page 13: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/13.jpg)
•13http://www.bis.gov.uk/innovatingforgrowth
…open data
![Page 14: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/14.jpg)
...personal data
![Page 15: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/15.jpg)
Benefits of sharing data (1)
www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0
“It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never get biomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately.”
Dr John Trojanowski, University of Pennsylvania
•... scientific breakthroughs
![Page 16: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/16.jpg)
Benefits of sharing data (2)
www.guardian.co.uk/politics/2013/apr/18/uncovered-error-george-osborne-austerity
... validation of results
“It was a mistake in a spreadsheet that could have been easily overlooked: a few rows left out of an equation to average the values in a column.
The spreadsheet was used to draw the conclusion of an influential 2010 economics paper: that public debt of more than 90% of GDP slows down growth. This conclusion was later cited by the International Monetary Fund and the UK Treasury to justify programmes of austerity that have arguably led to riots, poverty and lost jobs.”
![Page 17: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/17.jpg)
Benefits of sharing data (3)
“There is evidence that studies that make their data available do indeed receive more citations
than similar studies that do not.” Piwowar H. and Vision T.J 2013 "Data reuse and the open data citation advantage“ https://peerj.com/preprints/1.pdf
9% - 30% increase
•... more citations
![Page 18: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/18.jpg)
Things to think about...
Photo by @boetter http://www.flickr.com/photos/jakecaptive/3205277810
![Page 19: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/19.jpg)
What is data management?“the active management and appraisal of data over the lifecycle of scholarly and scientific interest”
Digital Curation Centre
Data management is just part of good research practice
![Page 20: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/20.jpg)
What is involved in RDM?
• Data Management Planning
• Creating data
• Documenting data
• Accessing / using data
• Storage and backup
• Sharing data
• Preserving data
![Page 21: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/21.jpg)
If you plan to share your data....
• Have you got consent for sharing?
• Do any licences you’ve signed permit sharing?
• Is your data in suitable formats?
Decisions made early on affect what you can do later
![Page 22: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/22.jpg)
File formats for long-term access• Unencrypted• Uncompressed• Non-proprietary/patent-encumbered• Open, documented standard• Standard representation (ASCII, Unicode)
Type Recommended Avoid for data sharing
Tabular data CSV, TSV, SPSS portable Excel
Text Plain text, HTML, RTFPDF/A only if layout matters
Word
Media Container: MP4, OggCodec: Theora, Dirac, FLAC
QuicktimeH264
Images TIFF, JPEG2000, PNG GIF, JPG
Structured data XML, RDF RDBMS
•Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
![Page 23: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/23.jpg)
Documentation
What would someone unfamiliar with your data need in order to find, evaluate,
understand, and reuse them?
Consider the differences between someone inside your research group, someone outside your group but in your field, and someone outside your field.
Two parts: metadata and methods
![Page 24: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/24.jpg)
Metadata
• About the project– Title, people, key dates, funders and grants
• About the data– Title, key dates, creator(s), subjects, rights,
included files, format(s), versions, checksums
• Keep this with the data
![Page 25: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/25.jpg)
Methods• Reason #1 for not reusing someone else’s data: “I don’t know
enough about how it was gathered to trust it.”
• Document what you did. (A published article may not be enough.)
• Document any limitations of what you did.
• If you ran code on the data, document the code and keep it with the data.
• Need a codebook? Or a data dictionary?– If I can’t identify at sight what each bit of your dataset means, yes, you do
need a codebook or data dictionary.– DO NOT FORGET THE UNITS!
![Page 26: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/26.jpg)
Standards
• Why reinvent the wheel? If there’s a standard format for your data or how to describe it, use that!
• The tricky part is finding the right standard.– Standards are like toothbrushes...– But using standards is good hygiene!– Your librarian can often help you find relevant standards.– Also check out the DCC catalogue of disciplinary metadata
http://www.dcc.ac.uk/resources/metadata-standards
![Page 27: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/27.jpg)
Where to store your data?
• Your own drive (PC, server, flash drive, etc.)– And if you lose it? Or it breaks?
• Somebody else’s drive
• Departmental drive
• “Cloud” drive– Do they care as much about your data as you do?
![Page 28: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/28.jpg)
How to backup?
• 3… 2… 1… backup!– at least 3 copies of a file– on at least 2 different media– with at least 1 offsite
• Use managed services where possible e.g. University filestores rather than local or external hard drives
• Ask central IT team for advice
![Page 29: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/29.jpg)
What to keep?
It’s not possible to keep everything. Select based on:– What has to be kept e.g. data underlying publications
– What can’t be recreated e.g. environmental recordings
– What is potentially useful to others
– What has scientific, cultural or historical value
– What legally must be destroyed
– ...
How to select and appraise research data:www.dcc.ac.uk/resources/how-guides/appraise-select-research-data
![Page 30: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/30.jpg)
How to share/preserve data?
• What is required?– By your funder– By your publisher– By your uni
• What subject repositories, data centres and structured databases are available?http://databib.org
![Page 31: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/31.jpg)
Putting the pieces together...
Photo by Dread Pirate Jeff http://www.flickr.com/photos/justageek/2851643792
![Page 32: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/32.jpg)
Data Management Plans
DMPs are often submitted with grant applications, but are useful whenever you are creating data to:
•Make informed decisions to anticipate and avoid problems
•Avoid duplication, data loss and security breaches
•Develop procedures early on for consistency
•Ensure data are accurate, complete, reliable and secure
•Save time and effort – make your life easier!
![Page 33: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/33.jpg)
Which funders require a DMP?
•www.dcc.ac.uk/resources/policy-and-legal/ overview-funders-data-policies
![Page 34: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/34.jpg)
What do research funders want?
• A brief plan submitted in grant applications, and in the case of NERC, a more detailed plan once funded
• 1-3 sides of A4 as attachment or a section in Je-S form
• Typically a prose statement covering suggested themes
• An outline of data management and sharing plans, justifying decisions and any limitations
![Page 35: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/35.jpg)
Five common themes1. Description of data to be collected / created
(i.e. content, type, format, volume...)
2. Standards / methodologies for data collection & management
3. Ethics and Intellectual Property (highlight any restrictions on data sharing e.g. embargoes, confidentiality)
4. Plans for data sharing and access (i.e. how, when, to whom)
5. Strategy for long-term preservation
![Page 36: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/36.jpg)
A useful framework to get started
•Think about why the questions are
being asked
•Look at examples to get an idea of what to include
•www.icpsr.umich.edu/icpsrweb/content/datamanagement/dmp/framework.html
![Page 37: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/37.jpg)
Help from the DCC
•https://dmponline.dcc.ac.uk
•www.dcc.ac.uk/resources/ •how-guides/develop-data-plan
a web-based tool to help you write DMPs according to different requirements
![Page 38: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/38.jpg)
How DMP Online works
Create a plan based on relevant funder /
institutional templates...
...and then answer the questions using the guidance provided
![Page 39: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/39.jpg)
Example plans
• Technical plan submitted to AHRC by Bristol Unihttp://data.bris.ac.uk/files/2013/02/data.bris-AHRC-Technical-Plan-v21.pdf
• Rural Economy & Land Use (RELU) programme exampleshttp://relu.data-archive.ac.uk/data-sharing/planning/examples
• UCSD example DMPs (20+ scientific plans for NSF)http://rci.ucsd.edu/dmp/examples.html
• My DMP – a satire (what not to write!) http://ivory.idyll.org/blog/data-management.html
![Page 40: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/40.jpg)
Tips on writing DMPs
• Keep it simple, short and specific
• Seek advice - consult and collaborate
• Base plans on available skills and support
• Make sure implementation is feasible
• Justify any resources or restrictions needed
http://www.youtube.com/watch?v=7OJtiA53-Fk
![Page 41: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/41.jpg)
Acknowledgement
Thanks in particular to Dorothea Salo, Ryan Schryver and colleagues for content from the “Escaping Datageddon” presentation, available at: http://www.slideshare.net/cavlec/escaping-datageddon
And to the Research360 project at the University of Bath for the “Managing your research data” presentation, available at: http://opus.bath.ac.uk/32296
![Page 42: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/42.jpg)
Thanks – any questions?
DCC guidance, tools and case studies:www.dcc.ac.uk/resources
Follow us on twitter: @digitalcuration and #ukdcc
![Page 43: Research Data Management](https://reader036.fdocuments.us/reader036/viewer/2022070315/555089fab4c905a85c8b4c89/html5/thumbnails/43.jpg)
Exercise
• Use the template to start drafting a DMP
• Discuss your ideas in groups to identify available support and decide the best approaches to follow for your context