Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman,...

68
University of Cambridge Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016

Transcript of Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman,...

Page 1: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

University of

Cambridge

Introduction to

Research Data Management

Mary Kattuman, Claire Sewell, Marta Teperek

11/05/2016

Page 2: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Today:

Mixture of activities and talking

Introduction

1. Backup and exchange strategies

2. How to organise your data well

3. Data sharing

4. …how to avoid problems => data management plans

We will send you the slides

Page 3: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

To start with…

Do you have any questions about data

management that you hope that will be

addressed during this workshop?

Page 4: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Part 1:Data backup and data

exchange strategies

Page 5: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Disastrous data loss…

Credit:

Peter Murray-Rust

http://blogs.ch.cam.ac.uk/pmr/2011/

08/01/why-you-need-a-data-

management-plan/

August 2011, CC-BY

Department of Chemistry,

University of Cambridge

Page 6: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

How much of your data would you lose if…?

Page 7: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

your laptop got stolen

your lab/office burnt

you've lost your USB stick

your portable hard drive got damaged

data from your Dropbox/Googledrive account

disappeared

5 mins

How much of your data would you lose if…?

Page 8: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Backup strategies:

Departmental backup system

External drives

Online backups

At least two backups, at two different

locations

Page 9: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

At least 2 backups at 2 locations:

Free software to manage backups (there is plenty of free software):

http://www.2brightsparks.com/download-syncbackfree.html

Every Monday

morning

Everyday at 10am

(automated!)

Page 10: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

At least 2 backups at 2 locations:

Store at home!

Free software to manage backups (there is plenty of free software):

http://www.2brightsparks.com/download-syncbackfree.html

Your

departmental

server

Page 11: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

At least 2 backups at 2 locations:

Store at home!

Free software to manage backups (there is plenty of free software):

http://www.2brightsparks.com/download-syncbackfree.html

Your

departmental

server

Shiny new exciting data!

Page 12: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

At least 2 backups at 2 locations:

Store at home!

Free software to manage backups (there is plenty of free software):

http://www.2brightsparks.com/download-syncbackfree.html

Copy ASAP!Your

departmental

server

Page 13: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

File sharing:

Google Drive/Dropbox… - cautious!

o Do not use cloud storage to store restricted data

E-mail

Website/Moodle

Sharepoint

FTP/SFTP

University of Cambridge Microsoft

OneDrive: 1TB of space for everyone

http://www.uis.cam.ac.uk/ees/onedrive

Page 14: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

http://www.uis.cam.ac.uk/ees/onedrive

Questions: [email protected]

Page 15: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Part 2:Data organisation

Page 16: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Data organisation:

Examples A and B

Which example is better, and why?

What are good and bad features?

Your own example – how can it be improved?

3 mins

Page 17: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Which example is better?

Page 18: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Data organisation:

consistent

meaningful to you and your colleagues

allow you to find files easily

would you be able to easily get hold of

your own data?

Page 19: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Copyright: http://www.vukovicnikola.info/folder-structure-for-research/

Page 20: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Organisation of physical samples:

Page 21: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Organisation of physical samples:

create maps of your sampleso can be simple Excel spreadsheets

o and keep them up to date!

reference your samples:o date in the lab books

o supplier’s name/code

add any relevant notes

Page 22: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

File naming conventions – why

matter?

Copyright: http://10pm.com/

******

Page 23: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

File naming conventions – why

matter?

Would you know in 3 years time what

are all these?

Page 24: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

File naming convention:

http://www.data.cam.ac.uk/files/gdl_tilsdocnaming_v1_20090612.pdf

Page 25: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Part 3:Data sharing

Page 26: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

What is your opinion?

26

Page 27: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

It would be useful if research data

underpinning publications was

available

Page 28: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

I (/my group) regularly share

research data underpinning

publications

Page 29: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Question:

Why it might be a good idea to

share data?

Page 30: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Open Access is a ‘good thing’:

Page 31: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Science relies on the principle that

we share our findings

Page 32: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Non-positive results need to be shared

p-value 0.05: who is going to publish their results?

Page 33: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Protection against misconduct

Page 34: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Protection against misconduct

Page 35: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Less time wasted

Page 36: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Less time wasted

2010

Start of the PhD

2011

1 year of PhD

gone

results not

reproduced

2012

2 years of PhD gone

results not reproduced

Looking for the original

data…

Page 37: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Less time wasted

Page 38: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Less time wasted

Page 39: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Less time wasted

Page 40: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Less time wasted

It took 6 years from the time of the

original publication (2007) to the final

retraction (2013)

Time & resources wasted because data

was not available (not to mention

people’s careers!)

Page 41: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Get access to shared data

https://researchdata.jiscinvolve.org/wp/2016/02/04/932/

Page 42: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Policy landscape for data sharing

Page 43: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

“Publicly funded research data are a public

good (…), which should be made openly

available with as few restrictions as

possible…”

http://www.rcuk.ac.uk/research/datapolicy/

Page 44: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

How to share data?• Store data for (at least) 10 years

• Describe your data

• Deposit your data in suitable data repositories and add a link to your data in your

publication

• NCBI/GEO: http://www.ncbi.nlm.nih.gov/geo/

• For sensitive data:

• UK Data Service: reshare.ukdataservice.ac.uk/ or EGA: www.ebi.ac.uk/ega/home

• Or other repositories (including Cambridge repository): www.data.cam.ac.uk/repository

Page 45: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Exemptions

• Personal/sensitive data

• IP protection/commercial data

Appropriate statement in the publication

needs to explain the reasons for restrictions

Page 46: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Resources for working with personal/sensitive data

• University Ethics website:

• www.research-integrity.admin.cam.ac.uk/research-ethics/

• Dr Rhys Morgan, Research Governance and Integrity Officer:

[email protected]

• MRC guidelines:

• http://www.mrc.ac.uk/documents/pdf/personal-information-in-medical-research/

• ESRC consent form, anonymisation guide, and access control:

• http://www.data-archive.ac.uk/create-manage/consent-ethics/consent?index=3

• http://ukdataservice.ac.uk/manage-data/legal-ethical/anonymisation

• http://ukdataservice.ac.uk/manage-data/legal-ethical/access-control

• Our website (University resources):

• http://www.data.cam.ac.uk/sensitive-data

Page 47: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Some funders actually check it…

Random checks on all publications from 1

May 2015 that acknowledge EPSRC +

sanctions for not sharing

Page 48: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

What do I need to do?

• For every new publication – share what is

shareable & add a link to your data

• Be aware of help available to you at the

University of Cambridge

Page 49: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Cambridge support for data

management and sharing

Page 50: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

www.data.cam.ac.uk

Page 51: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

www.data.cam.ac.uk/funders

Funder names

arranged

alphabetically. Click

on the hyperlink

below to see the

full-length policy.

Key policy

highlights

Date of the last

update/policy check

Page 52: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

How to share research data?

www.data.cam.ac.uk/repository

Page 53: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Discipline-specific repositories

preferred

www.re3data.org

Page 54: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Cambridge data repository

www.data.cam.ac.uk/upload

Page 55: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

www.data.cam.ac.uk/upload

www.data.cam.ac.uk/upload

Submit

We will check your data, upload it into the repository

and send you a link for your paper

Page 56: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Non-positive results can be shared as well

p-value 0.05: who is going to publish their results?

Page 57: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

www.repository.cam.ac.uk

Each submission gets a separate record

Page 58: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

www.repository.cam.ac.uk

Repository is well ‘googleable’

Page 59: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

59

Part 4:How to avoid problems with

data management

Page 60: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Data management plan:

…roadmap to help you not to get

lost with your data

Page 61: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

You now have 3 mins to write your own

data plan

Fill in just the top section and leave the

‘Comments’ section blank

Data management plan:

Page 62: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Now work in pairs and exchange your

plans

You have 3 mins to write down

comments on each other’s plans

Data management plan:

Page 63: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Now you have 2 minutes to exchange

feedback

Data management plan:

Page 64: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

You have created your first data

management plan with comments from

peer-review

(This will be extremely useful when

applying for grants)

Data management plan:

Page 65: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Today’s summary:

We have covered the following:

1. Backup and exchange strategies

2. How to organise your data well?

3. Data sharing

4. How to avoid problems: data management plans

Page 66: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Take-home message:

Page 67: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

Final conclusions:

Data management plan can save you

from a lot of trouble

www.data.cam.ac.uk

[email protected]

Page 68: Introduction to Research Data Management · Introduction to Research Data Management Mary Kattuman, Claire Sewell, Marta Teperek 11/05/2016. Today: Mixture of activities and talking

THANK YOU

Feedback forms + certificates

Questions: [email protected]

Follow us on Twitter: @CamOpenData