Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving...
Transcript of Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving...
![Page 1: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/1.jpg)
Building Archivable
Websites
Nicholas Taylor
Web Archiving Service Manager
Digital Library Systems and Services
Drupal Camp
April 19, 2014
![Page 2: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/2.jpg)
ARCHIVABLE WEBSITES?
Why Build
“Frosted Spiders' Web” by Jess Wood under CC BY 2.0
![Page 3: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/3.jpg)
future users are users, too
“a connection between past and future” by Gioia De Antoniis under CC BY-NC-ND 2.0
![Page 4: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/4.jpg)
maintain web usability
“Broken Web Connections? Welcome to 2009...” by Paul:Ritchie under CC BY-NC-ND 2.0
![Page 5: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/5.jpg)
improve temporal web usability
Internet Archive: “Wayback Machine”
![Page 6: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/6.jpg)
improve temporal web usability
Internet Archive: “Wayback Machine”
![Page 8: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/8.jpg)
refer to earlier website versions
“The Iraq War: Wikipedia Historiography” by STML under CC BY-SA 2.0
![Page 9: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/9.jpg)
institutional history
Internet Archive Wayback Machine: “Stanford University Homepage”
![Page 10: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/10.jpg)
websites are cultural artifacts
“The World Wide Web project”
![Page 11: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/11.jpg)
facilitate compliance
![Page 12: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/12.jpg)
optimize for other crawlers
“SEO on a railway platform” by superboreen under CC BY-NC-ND 2.0
![Page 13: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/13.jpg)
IMPROVE ARCHIVABILITY
How to
“metal web” by paul:74 under CC BY-NC-SA 2.0
![Page 14: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/14.jpg)
follow web standards and
accessibility guidelines
“Web Standards Fortune Cookie” by Matt Herzberger under CC BY-SA 2.0
![Page 15: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/15.jpg)
use a site map, transparent links,
and contiguous navigation
“Card sorting” by Manchester Library under CC BY-SA 2.0
![Page 16: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/16.jpg)
maintain stable URLs and
redirect when necessary
“San Francisco-Oakland Bay Bridge 1442a” by Don Barrett under CC BY-NC-ND 2.0
![Page 17: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/17.jpg)
use semantically-meaningful URLs
“w63.w63.org”
![Page 18: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/18.jpg)
be careful w/ robot exclusion rules
“drupal/robots.txt at 7.x”
![Page 19: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/19.jpg)
minimize reliance on external
assets necessary for presentation
Internet Archive Wayback Machine: “Stanford Department of English”
![Page 20: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/20.jpg)
minimize reliance on external
assets necessary for presentation
“Stanford Department of English”
![Page 21: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/21.jpg)
serve reusable assets from a
single, common location
Google Images: “stanford university seal site:stanford.edu”
![Page 22: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/22.jpg)
specify HTTP response headers for
caching and content encoding
“time capsule on Alcatraz” by inajeep under CC BY 2.0
![Page 23: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/23.jpg)
embed metadata, especially
character encoding
“Keep the Packaging!” by davidd under CC BY 2.0
![Page 24: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/24.jpg)
use durable data formats
“Lascaux cave painting” by Christine McIntosh under CC BY-ND 2.0
![Page 25: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/25.jpg)
prefer responsive design over user-
agent personalization
“«Responsive web design» - 217/366” by Roger Ferrer Ibáñez under CC BY-NC-SA 2.0
![Page 26: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/26.jpg)
examine your site in the Internet
Archive Wayback Machine
Internet Archive Wayback Machine: “Welcome to A Multidimensional Perception ~/*\= & PCGuru”
![Page 27: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/27.jpg)
TOOLS AND SERVICES
Web Archiving
“giant mechanical spider & crowd” by mjtmail (tiggy) under CC BY 2.0
![Page 28: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/28.jpg)
Heritrix
Wikimedia Commons: “File:Heritrix-screenshot.png”
![Page 29: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/29.jpg)
Wget
Wikimedia Commons: “File:Wget_1.13.4.png”
![Page 35: Building Archivable Websites - nullhandle.org · 4/19/2014 · Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014. ARCHIVABLE](https://reader034.fdocuments.us/reader034/viewer/2022052101/603b9577e554b111771f6d58/html5/thumbnails/35.jpg)
thank you!
“stanford dish at sunset” by Dan under CC BY-NC-SA 2.0
Nicholas Taylor