Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1....

30
Introduc)on to Data Management Chris’e Wiley Sarah C. Williams Heidi Imker

Transcript of Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1....

Page 1: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Introduc)on  to    Data  Management    

 Chris'e  Wiley  

Sarah  C.  Williams  Heidi  Imker  

Page 2: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!
Page 3: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Data  Management  Workshop  Series  

•  Introduc)on  to  Data  Management  –  Feb  10th    4PM  –  5PM  –  Apr  6th  1PM  –  2PM  

•  Documenta)on  and  Organiza)on  for  Data  and  Processes  –  Feb  17th    4PM  –  5PM  –  Apr  13th    1PM  –  2PM  

•  Making  Research  Data  Public:  Why,  What,  and  How  –  Feb  24th  4PM  –  5PM  –  Apr  20th    1PM  –  2PM  

Page 4: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Intro  to  Data  Management  Objec'ves  

1.  Learn  the  benefits  of  data  management  2.  Learn  the  components  of  data  management  3.  Iden'fy  gaps  in  your  current  prac'ces  4.  Know  where  to  find  help  

Page 5: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Data  Management  Prac'ces  •     complete  records    •     find    

•     understand    

•     back-­‐up    

•     migrate  

Page 6: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Data  Management  Prac'ces  •  I’ve  lost  two  patent  claims  because  we  didn’t  have  complete  records  in  lab  notebooks.  

•  I  panic  every  'me  I  try  to  find  my  students’  data.  

•  I  know  I  won’t  be  able  to  publish  anything  aSer  my  grad  student  leave  because  it’s  too  hard  to  understand  what  he/she  did.  

•  We  don’t  have  the  assay  results  with  substrate  X,  because  I  didn’t  back-­‐up  my  computer.    And  we’re  out  of  substrate  X.    

•  The  only  reason  I  was  able  to  do  the  analysis  was  because  on  a  whim  I  decided  to  migrate  some  of  my  data  off  of  floppy  disks  to  CDs  back  in  the  90s.    I  wish  I  had  done  all  of  it.      

Page 7: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!
Page 8: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!
Page 9: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Benefits  of  Good  Data  Management  

-­‐  Be^er  understand  your  data  now  and  in  the  future  -­‐  Access  your  data  in  the  future    -­‐  Save  'me  and  effort  -­‐  Meet  funder  and  publisher  requirements  

Page 10: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

What  are  the  elements  of  good    data  management?  

   

Page 11: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Plan  

Page 12: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Data  Management  Planning  Basics  

•     Data  Management  Plans  (DMPs)  •  What  data  produced,  how  will  you  make  it  accessible,  how  will  you  preserve  it,    what  restric'ons  are  there  on  access  

•   NSF  -­‐  2011  •   NIH  -­‐  2003  (>$500K)  •   DOE  -­‐  2014  

Page 13: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

DMPTool  –  h^p://www.dmptool.org/  

Page 14: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Organize  

Page 15: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Organiza'on  

•  Dis'nguish  Raw  Data  

•  Be  Consistent  in  File  Naming  and  Directory  Structures  

•  Define  and  Exercise  Version  Control  

Page 16: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Date  Tip  

BAM  Co-­‐Exp  Run  01  20140904.txt  BAM  Co-­‐Exp  Run  02  20140904.txt  BAM  Co-­‐Exp  Run  03  20140904.txt  

Run  1  B  anth  meth  Sept  4  .txt  BAM  Rxn  2  2014_09_04.txt  20140904_meth_3.txt  

vs.

Page 17: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Document  

Page 18: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Describing  Data  •  Enables  someone  unfamiliar  with  your  data  to  find,  evaluate,  understand,  and  reuse  your  data  

•  Helps  YOU  to  remember  details  about  your  data  aSer  'me  has  passed  

 Examples      ReadMe  File:    h^p://goo.gl/eOzQOO  Metadata:    h^p://dublincore.org/documents/dc-­‐xml-­‐guidelines/  Data  Dic'onaries:    Page  4-­‐1  of  this  User  Guide    h^p://goo.gl/Ugqiqo  Codebook:    h^p://www.icpsr.umich.edu/icpsrweb/ICPSR/help/cb9721.jsp    

Page 19: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Basic  Documenta'on  

•  Generic  Informa'on:    •  'tle,  key  dates,  creators,  subjects,  rights,  files,  formats,  versions,  project  &  funder  

 •  Addi'onal  Informa'on:  •  source  of  the  data,  how  analyzed,  soSware  and  hardware  requirements,  resul'ng  publica'ons  

Page 20: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Storage  &  Backup  

Page 21: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Storage  &  Back-­‐Up  Best  Prac'ces  

•  Security  •  Robustness  •  Number  of  copies  •  Loca'on  of  copies  •  Scheduling  •  Efficacy  

Page 22: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

protected  human  informa'on  (PHI)   con'ngent  on  compliance  with  FERPA/HIPAA/IRB  regula'ons  and  guidelines  

personally  iden'fiable   con'ngent  on  ability  to  de-­‐iden'fy  

proprietary  data   con'ngent  on  campus  IP  regula'ons  

classified  data   con'ngent  on  campus  export  control  

local  sensi'vity   con'ngent  on  lab/collabora'on  culture  

Considera'ons  for  Sensi've  Data  

Page 23: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Archive  

Page 24: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

An'cipate  the  Long  Term  •  What  happens  to  your  data  in…  •  2  years?    •  5  years?    •  20  years?  

•  Will  the  formats  be  valid?      •  Will  the  hardware  be  defunct?  •  Will  anyone  know  where  it  is?  

Page 25: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

An'cipate  the  Long  Term  

More  Control  More  Responsibility  

Less  Control  Higher  Poten'al  Impact  

Keep  Private   Make  Public  

! h^p://databib.org/    

Page 26: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Intro  to  Data  Management  Objec'ves  

1.  Learn  the  benefits  of  data  management  2.  Learn  the  components  of  data  management  3.  Iden'fy  gaps  in  your  current  prac'ces  4.  Know  where  to  find  help  

Page 27: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Contact  Chris)e  Wiley  Engineering  Data  Services  Librarian  [email protected]    Sarah  C.  Williams  Life  Sciences  Data  Services  Librarian  [email protected]    Heidi  Imker  Director,  Research  Data  Service  [email protected]    

visit:      310-­‐312  Main  Library  

 call:      

(217)  300-­‐3513    

website:    researchdataservice.illinois.edu  

Page 28: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Data  Management  Workshop  Series  

•  Introduc)on  to  Data  Management  –  Feb  10th    4PM  –  5PM  –  Apr  6th  1PM  –  2PM  

•  Documenta)on  and  Organiza)on  for  Data  and  Processes  –  Feb  17th    4PM  –  5PM  –  Apr  13th    1PM  –  2PM  

•  Making  Research  Data  Public:  Why,  What,  and  How  –  Feb  24th  4PM  –  5PM  –  Apr  20th    1PM  –  2PM  

Page 29: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Scholarly  Commons  h^p://www.library.illinois.edu/sc/index.html  

Library  Home  Page  h^p://www.library.illinois.edu/  

 

Paste  DOI  here.  No  VPN  needed  for  access  to  full  text  (just  ne'd/password)!  

Bonus  Tips!  

Page 30: Introduc)onto* Data*Management** · Intro!to!DataManagementObjec’ves! 1. Learn!the!benefits!of!datamanagement 2. Learn!the!components!of!datamanagement 3. Iden’fy!gaps!in!your!currentprac’ces!

Useful?  Tell  your  friends  and  collogues!  

 Not  Useful?  Tell  us!