HARVARD-PURDUE DATA SYMPOSIUM

22
HARVARD-PURDUE DATA SYMPOSIUM JUNE 17, 2015 – REPOSITORY SERVICES Purdue University Research Repository Michael Wi* Head, Distributed Data Cura5on Center Associate Professor of Library Science h*p://www.lib.purdue.edu/research/wi* Email: mwi*@purdue.edu

Transcript of HARVARD-PURDUE DATA SYMPOSIUM

Page 1: HARVARD-PURDUE DATA SYMPOSIUM

HARVARD-PURDUE DATA SYMPOSIUM JUNE 17, 2015 – REPOSITORY SERVICES

Purdue University Research Repository

Michael  Wi*  Head,  Distributed  Data  Cura5on  Center  Associate  Professor  of  Library  Science  

h*p://www.lib.purdue.edu/research/wi*  E-­‐mail:  mwi*@purdue.edu  

Page 2: HARVARD-PURDUE DATA SYMPOSIUM

DATA = EVIDENCE

h"p://epicgraphic.com/data-­‐cake  

Page 3: HARVARD-PURDUE DATA SYMPOSIUM

REPOSITORIES IN A DATA ECOSYSTEM A  searchable  catalog  of  1,268  research  data  repositories  from  around  the  world  in  all  disciplines,  h*p://re3data.org  

Many  different  flavors  of  data  repositories…    

•  Publisher,  e.g.,  Dryad  •  Sub/Disciplinary,  e.g.,  RKMP  •  Consor5um,  e.g.,  ICPSR  •  Country,  e.g.,  Research  Data  Australia  •  Government,  e.g.,  Data  Portal  India  •  Research  center,  e.g.,  NASA  GES  DISC  •  Instrument,  e.g.,  CHANDRA  •  General-­‐purpose,  e.g.,  FigShare  •  Roll-­‐your-­‐own,  e.g.,  DataVerse  •  University,  e.g.,  PURR  

Page 4: HARVARD-PURDUE DATA SYMPOSIUM

h"p://purr.purdue.edu  

Page 5: HARVARD-PURDUE DATA SYMPOSIUM

CAMPUS COLLABORATION Purdue  University  Research  Repository  (PURR)    

The  PURR  service  is  a  collabora5ve  effort  of  the  Purdue  University  Libraries,  Execu?ve  Vice  President  for  Research  and  Partnerships,  and  Informa?on  Technology  at  Purdue.  PURR  is  a  designated  university  core  research  facility.    

Designated  community:    Purdue  University  faculty,  staff,  and  graduate  student  researchers;  their  collaborators;  and  the  current  and  future  consumers  of  their  data.    

Page 6: HARVARD-PURDUE DATA SYMPOSIUM

LIBRARY STRATEGIC PLAN Data  is  wri*en  into  the  three  pillars  of  our  strategic  plan:  •  Learning:  “…informa?on  literacy  defined  broadly  to  include  

digital  informa?on  literacy,  science  literacy,  data  literacy,  health  literacy,  etc…”  

•  Scholarly  Communica5on:  “Lead  in  data-­‐related  scholarship  and  ini?a?ves”  

•  Global  Challenges:  “We  will  lead  in  interna?onal  ini?a?ves  in  informa?on  literacy  and  e-­‐science  and  …  contribute  to  interna?onal  informa?on  literacy,  learning  spaces,  data  management,  and  scholarly  communica?on  ini?a?ves.”  

h"ps://www.lib.purdue.edu/sites/default/files/admin/plan2016.pdf  

Page 7: HARVARD-PURDUE DATA SYMPOSIUM

CURATION LIFECYCLE SERVICE MODEL

Wi*,  M.  (2012).  Co-­‐designing,  Co-­‐developing,  and  Co-­‐implemen5ng  an  Ins5tu5onal  Data  Repository  Service.  Journal  of  Library  Administra?on,  52(2).  DOI:10.1080/01930826.2012.655607.  h"p://docs.lib.purdue.edu/lib_fsdocs/6/    Digital  Cura5on  Centre’s  Cura5on  Lifecycle  Model:  h"p://www.dcc.ac.uk/resources/cura?on-­‐lifecycle-­‐model    

Page 8: HARVARD-PURDUE DATA SYMPOSIUM

PURR POSTCARD AND POSTER

8 8

Page 9: HARVARD-PURDUE DATA SYMPOSIUM

DATA MANAGEMENT PLANS •  Boilerplate  text  •  Example  DMPs  •  DMP  Self-­‐Assessment  •  DMPTool  •  Workshops  •  Tutorials  •  Reference  and  consulta5on  with  subject-­‐specialist  librarian  

and/or  data  services  specialist  

h"ps://purr.purdue.edu/dmp  

Page 10: HARVARD-PURDUE DATA SYMPOSIUM

Dimensions  of  Discovery  (Winter  2013).  Office  of  the  Vice  President  for  Research,  Purdue  University,    h"p://www.purdue.edu/research/vpr/publica?ons/docs/dimensions/Winter2013.pdf  

Page 11: HARVARD-PURDUE DATA SYMPOSIUM

CREATE A PROJECT

PURR  project  tutorial  video:  h"p://www.youtube.com/watch?v=q5xGO_oF9uQ  

Page 12: HARVARD-PURDUE DATA SYMPOSIUM

USE PROJECT TO COLLABORATE Create: •  any Purdue faculty, staff, or graduate student researcher can create projects •  describe the project •  disclaim use of sensitive or restricted data •  receive a default allocation of storage •  register a grant award to increase allocation •  invite collaborators to join project

Collaborate: •  git repository to share and version files (sftp & Google Drive integration) •  virtual machine/s •  wiki •  blog •  to-do list management and project notes •  newsfeed •  stage data publications

Page 13: HARVARD-PURDUE DATA SYMPOSIUM

STORAGE ALLOCATION

h"ps://purr.purdue.edu/about/pricing  

Page 14: HARVARD-PURDUE DATA SYMPOSIUM

DATA PUBLICATION & ARCHIVING

PURR  publica5on    tutorial  video:  h"p://www.youtube.com/watch?v=jYBcsfiRhio  

Page 15: HARVARD-PURDUE DATA SYMPOSIUM

PURR GOVERNANCE & STAFFING •  Execu<ve  Commi"ee:  Dean  of  Libraries,  Vice  President  for  Research,  Chief  Informa5on  Officer  

•  Steering  Commi"ee:  2  from  libraries,  2  from  IT,  2  from  research  office  and  sponsored  programs,  3  domain  faculty  researchers  

•  Personnel:  Project  Director  (.50),  Technologists  (3.85),  HUBzero  Liaison  (.35),  Metadata  Specialist  (.20),  Digital  Archivist  (.25),  Repository  Outreach  Specialist  (1.0),  Data  Curator  (1.0)  

•  Key  players:  Subject-­‐specialist  librarians  &  data  services  specialists  

Page 16: HARVARD-PURDUE DATA SYMPOSIUM

Librarians  consult  on  data  management  plans  in  their  subject  areas.  

Crea5ng  opportuni5es  for  librarians  to  interact  with  researchers  about  data  

Page 17: HARVARD-PURDUE DATA SYMPOSIUM

Librarian  is  no5fied  by  e-­‐mail  when  a  new  project  is  created  or  a  grant  is  awarded,  based  on  department  affilia5on  of  Purdue  project  owner.    

Crea5ng  opportuni5es  for  librarians  to  interact  with  researchers  about  data  

Page 18: HARVARD-PURDUE DATA SYMPOSIUM

Librarian  may  consult  or  collaborate  on  project  if  needed.  

Crea5ng  opportuni5es  for  librarians  to  interact  with  researchers  about  data  

Page 19: HARVARD-PURDUE DATA SYMPOSIUM

Librarians  review  and  post  submi*ed  datasets.  

Crea5ng  opportuni5es  for  librarians  to  interact  with  researchers  about  data  

Page 20: HARVARD-PURDUE DATA SYMPOSIUM

At  the  end  of  ini5al  commitment  (10  years),  archived  and  published  datasets  are  remanded  to  the  Libraries’  collec5on.  A  librarian  working  with  the  digital  archivist  selects  (or  not)  the  dataset  for  the  collec5on.  

Crea5ng  opportuni5es  for  librarians  to  interact  with  researchers  about  data  

Page 21: HARVARD-PURDUE DATA SYMPOSIUM

EARLY ASSESSMENT 2013  was  first  full  year  of  PURR  in  opera5on,  to  date:  •  1,472  data  management  plans  •  172  grant  awards  •  1,437  registered  researchers  •  559  research  projects  •  239  published  datasets  from  135  different  co-­‐authors  •  200  cita5ons  of  datasets    

 

Page 22: HARVARD-PURDUE DATA SYMPOSIUM

THANK YOU

PURR:  h*p://purr.purdue.edu  

Michael  Wi*  Head,  Distributed  Data  Cura5on  Center  Associate  Professor  of  Library  Science  

h*p://www.lib.purdue.edu/research/wi*  E-­‐mail:  mwi*@purdue.edu  

h"p://bit.ly/1MWlZ27