Post on 26-Dec-2015
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
Thu-Mai Christian <tlchristian@unc.edu>[Slides]
Micah Altman <micah_altman@alumni.brown.edu>Jonathan Crabtree <jonathan_crabtree@unc.edu>
[Project Directors]
AcknowledgementsThanks to the Institute for Museum and Library Services (LG-05-09-0041-09) for funding development of the system; and to the Library of Congress for funding initial prototypes.
Thanks to the staff at the Odum Institute and Dataverse Team at the Institute of Quantitative Science for development efforts, and to ICPSR and the Roper Center for Public Opinion Research for testing.
Thanks to the LOCKSS team for consultation and assistance.
Thanks to Amazon for supporting delivery of the course at IASSIST through their AWS Research in Education grants.
References• Micah Altman, Jonathan Crabtree (2011) Using the SafeArchive
System : TRAC-Based Auditing of LOCKSS, 165-170. In Archiving 2011. http://www.box.net/shared/8py6vl9kxivo6u21rkn8
• Altman, M., Beecher, B., and Crabtree, J.; with L. Andreev, E. Bachman, A. Buchbinder, S. Burling, P. King, M. Maynard. 2009. "A Prototype Platform for Policy-Based Archival Replication." Against the Grain. 21(2): 44-47.
• Victoria Reich, "Distributed Digital Preservation," Proceedings of the Indo-US Workshop on International Trends in Digital Preservation, National Digital Preservation Program, Pune, India, March 2009 http://www.lockss.org/locksswiki/files/ReichIndiaFinal.pdf
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
I. IntroductionsII. Overview of the SafeArchive SystemIII. Hands On: Running the SafeArchive System using Amazon Web ServicesIV. BreakV. Hands On: Setting up the SafeArchive SystemVI. Hands On: Using the Safe Archive System
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
I. IntroductionsII. Overview of the SafeArchive SystemIII. Hands On: Running the SafeArchive System using Amazon Web ServicesIV. BreakV. Hands On: Setting up the SafeArchive SystemVI. Hands On: Using the Safe Archive System
Why was Created?Verified geographically-distributed replication of content is an essential
component of any comprehensive digital preservation plan.
The requirement has emerged as a necessity for recognition and certification as a trusted repository.
Why use ?
SafeArchive provides the reliability of a top-down replication system with the resiliency of a peer-to-peer model.
- SafeArchive automates high-level replication and distribution policies- SafeArchive automates multi-institutional replication- SafeArchive facilitates sharing TRAC policies- SafeArchive verification and audit trails for replication policies- SafeArchive is Open Source, and integrates with LOCKSS, and the Dataverse Network- SafeArchive is Standards-Based, and supports DDI, OAI-PMH, and TRAC
SafeArchive Overview
SafeArchive automates high-level replication policies and helps institutions to collaborate in preserving digital content. GUI-based tools are designed for librarians and archivists—not systems administrators.
Operationally, system users can:• Analyze any LOCKSS network• Check that collections are replicated, valid, and up-to-date• Create formal replication policies• Replicate content from web sites or digital repository systems• Audit the network for current and historical ISO compliance• Automatically manage and repair a LOCKSS network based on a specific replication policy
SafeArchive provides the reliability of a top-down replication system with the resiliency of a peer-to-peer model.
What can you do with ? • Analyze any existing set of public LOCKSS systems or Private LOCKSS Network
• which collections are replicated?• when were they last verified, and updated?• identify potential problems with the storage network
• Create formal TRAC policies• create operational policies for replication and distribution• create advisory policies for all TRAC criteria
• Audit your storage network against your policies• verify that collections are currently replicated, verified, updated• create historical audit trails and evidence of long-term compliance
• Replicate content from web sites or digital repository systems• use SafeArchive/DVN plugins to replicate content in the Dataverse Network• use SafeArchive/LOCKSS plugins to replicate content through OAI or HTTP• [This Fall] Automatically deploy and repair LOCKSS replication based on policy
Review• Install SafeArchive:
– Using the prepackaged Amazon Web Services image– OREby installing the source on Linux
• Tell SafeArchive what hosts to monitor• Use the Audit Schema Manager web interface• OR Supply a LOCKSS PLN configuration file
• Tell SafeArchive your policies• Use the Audit Schema Manager web interface• Fill in the blanks
• Verify your content• Use the Audit Report Web interface to see things immediately• OR configure the system to send you e-mail• OR visit historical records later
• Sleep better – your content is SAFE
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
I. IntroductionsII. Overview and Tour of the SafeArchive SystemIII. Hands On: Running the SafeArchive System using Amazon Web Services
A. Launching the SafeArchive AMIB. Configuring and Installing the SafeArchive AMI
IV. BreakV. Hands On: Setting up the SafeArchive SystemVI. Hands On: Using the Safe Archive System
Running the SafeArchive System using Amazon Web Services
http://aws.amazon.com
Running the SafeArchive System using Amazon Web Services
Enter your e-mail address and Amazon password
Running the SafeArchive System using Amazon Web Services
https://console.aws.amazon.com/ec2/home?region=us-east-1#launchAmi=ami-e67ddc8f
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
I. IntroductionsII. Overview and Tour of the SafeArchive SystemIII. Hands On: Running the SafeArchive System using Amazon Web Services
A. Launching the SafeArchive AMIB. Configuring and Installing the SafeArchive AMI
IV. BreakV. Hands On: Setting up the SafeArchive SystemVI. Hands On: Using the Safe Archive System
Running the SafeArchive System using Amazon Web Services
Associate the IP address with the SafeArchive AMI
Running the SafeArchive System using Amazon Web Services
Associate the IP address with the SafeArchive AMI
Running the SafeArchive System using Amazon Web Services
Add Custom TCP rules
Ports:22
3306370038203920484880808686
Running the SafeArchive System using Amazon Web Services
PuTTY.exe
Running the SafeArchive System using Amazon Web Services
Press Enter to accept the default inputs (as appropriate)
Running the SafeArchive System using Amazon Web Services
Copy and paste the URL in the Firefox browser
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
I. IntroductionsII. Overview and Tour of the SafeArchive SystemIII. Hands On: Running the SafeArchive System using Amazon Web Services
A. Launching the SafeArchive AMIB. Configuring and Installing the SafeArchive AMI
IV. BreakV. Hands On: Setting up the SafeArchive SystemVI. Hands On: Using the Safe Archive System
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
I. IntroductionsII. Overview and Tour of the SafeArchive SystemIII. Hands On: Running the SafeArchive System using Amazon Web Services
A. Launching the SafeArchive AMIB. Configuring and Installing the SafeArchive AMI
IV. BreakV. Hands On: Setting up the SafeArchive SystemVI. Hands On: Using the Safe Archive System
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS
I. IntroductionsII. Overview and Tour of the SafeArchive SystemIII. Hands On: Running the SafeArchive System using Amazon Web Services
A. Launching the SafeArchive AMIB. Configuring and Installing the SafeArchive AMI
IV. BreakV. Hands On: Setting up the SafeArchive SystemVI. Hands On: Using the Safe Archive System