Incorpora(ng Data Access into Journal Workflow
Mercè Crosas, Ph.D. Twi@er: @mercecrosas
Director of Data Science, IQSS Harvard University
• The Harvard Dataverse Network is a data sharing repository open to all research data from all domains.
• The Dataverse Network soMware is open-‐source, installed in ins(tu(ons across the world (h-p://thedata.org)
h@p://thedata.harvard.edu h@p://thedata.harvard.edu
Dataverse: Container for research studies Study: Container for data, documenta(on, and code
Data sharing and archiving with control and recogni(on for data authors, distributors
Persistent Data Cita(ons permanently linking your data to your publica(on (Altman, King, 2007)
Support for all file types any format, max 2 GB per file
Customized Branding or embed on your site
Data Restric(ons & terms of use op(ons, although encouraging Open Data
Rich data support for some data formats
SPSS, Stata, R Data metadata extrac(on, subse]ng
& analysis (R, Zelig)
FITS Data metadata extrac(on from file
header
Social Network Data (GraphML) smart queries & subse]ng
Data visualiza(ons for (me series
Data management, standards and archival good prac(ces
Data Cataloging self-‐curated, with custom metadata
templates (DDI, Dublin Core)
New Study
Revise Study
Released version 2
Released version 1
Data Versioning preserve & cite previous versions
Log traffic & downloads to your dataset with Guestbook
Permanent storage preserva(on format with w/copies in mul(ple loca(ons (OAI-‐PMH, LOCKSS)
Dataverse for an Individual Researcher
Dataverse for an Organiza(on
Dataverse for a Journal
Seamless Integra(on between Dataverse and Journals
OJS plugin for: Data + metadata + suppor(ng files,
sent via SWORD API to the Dataverse
PKP’s Open Journal System (OJS) Harvard Dataverse Network
CitaGon to Data
CitaGon to ArGcle
0
600
OJS Journals and Dataverses Growth
From 1990 to 2013: 5000 ac(ve OJS Journals
From 2007 to 2013: 500 Dataverses
Credit: Juan Alperin, PKP; Gustavo Durand, Dataverse
Dataverse Plugin in OJS
First, set up a Dataverse for each Journal
Metadata fields will be selected ahead of time by journal admin.
Published Ar(cle -‐ Linked to Data
Data in Dataverse -‐ Linked to Published Ar(cle
Data Publishing Workflow
Submission Ar(cle + Data
Review, Approved
Ar(cle published in journal (OJS)
Data published in Dataverse
Review, Not Accepted Data
published in Dataverse ?
# of arGcle downloads
# of data set downloads
Alterna(ve Workflow
Submission Ar(cle
Data already in Repository
Add Data Cita(on to Ar(cle
Submit Ar(cle Cita(on to
Data
Track # of arGcle downloads and # of data sets downloads
Par(cipa(ng Journals and Outreach
Beta Testers: 43 OJS journals, from 6 publishers (social sciences, health, life
sciences)
Extended Testers: > 400 OJS journals (economics, social sciences, health, life
sciences)
Outreach: Beyond OJS,
journals using other publishing systems
Beta Testers Criterion: Current, quan(ta(ve OJS journals interested in data sharing
Summer/Fall 2013
Fall 2013/Winter 2014
Metajournals as incenGves
Credit: Brian Hole, Ubiquity Press (beta tester Publisher for Dataverse integraGon)
Credit: Brian Hole, Ubiquity Press
Amsterdam Manifesto: Formal Data Cita(on in Publica(on’s Reference List
Top Related