Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec...

11
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014

Transcript of Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec...

Archiving research data in the cloud or in a local

repositoryMichele Kimpton, CEO DuraSpaceCNI Dec 2014

DuraSpace

Common issues• Where can I put my data for long term

access?• How do I make it discoverable, re-useable,

reproducible?• What metadata, provenance and

identifiers should I use?• What policies should be in place for

archiving and preserving data?• How do I fund this?

Data management in DSpace

• DOI support- EZID• ORCID integration• Linked open data support• Integrated with DuraCloud

New features in DSpace 5.0 related to Data Management and archivinghttps://wiki.duraspace.org/display/DSPACE/DSpace+Release+5.0+Status

Data management in Fedora

• Supports linked open data• Content modeling• Versioning• Large files• Fixity checking• External, asynchronous storage

Commercial based cloud solutions

Three popular commercial solutions•Attract end users because solves immediate need without adding a ton of work to end user

•Share, collaborate or meet mandate by publisher or funding agency•Little to no preservation practices in place•No stated or unstated longterm data management practices•Long term at risk reliant on investors interest and success in the market•Lack of trust and control within academic community

Community based cloud solutions

Benefits of networked solutions

• Web based access, easy to access• Can easily replicate content via standard Internet

protocols and API’s• Requires no infrastructure at local facility• Can expose content outside of institution• Can provide geographical distribution• Provides solution for those that can not afford to

build locally

Questions to consider for any hosted solution

• Is it open source?• Are the policies transparent?• What is the governance?• Are there policies and practices in place to

“actively” preserve the data?• Is there an “exit” strategy for the user?• Are the formats open and standard?

• Who has rights to the data?

archivematica

Resources

• RDA: https://rd-alliance.org/groups/long-tail-research-data-ig.html

• POWRR:http://digitalpowrr.niu.edu/tool-grid/