Post on 26-Mar-2015
HathiTrust
Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.
The scholarship that is being produced today is at serious risk of being lost forever to future generations.
Universal LibraryUniversal Library
Common GoalsCommon Goals
Single Entity, Many Single Entity, Many PartnersPartners
HathiTrust MissionTo contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge.
HathiTrust is attempting nothing short of creating a comprehensive preservation repository of published literature, primarily though not exclusively through digitization.
What’s in HathiTrust?
What’s in Hathi Trust?*
*As of October 30, 2012
10,566,195 total
volumes
5,564,036 book titles
274,939 serial titles
3,698,168,250 pages
474 terabytes
125 miles
8,585 tons
3,250,787 volumes
(~31% oftotal) in the public domain
As of May 17,
2012*As of October 30,
2012
What’s preserved in HathiTrust - by
date
The top 10 languages make up 86% of the content
The top 10 languages make up 86% of the content
*As of October 30, 2012
The remaining 40 languages make up 13% of the content
The remaining 40 languages make up 13% of the content
*As of October 30, 2012
Who’s in HathiTrust?
University of MissouriUniversity of MissouriUniversity of Nebraska-University of Nebraska-Lincoln Lincoln The University of North The University of North Carolina at Chapel HillCarolina at Chapel HillUniversity of Notre DameUniversity of Notre DameUniversity of PennsylvaniaUniversity of PennsylvaniaUniversity of PittsburghUniversity of PittsburghUniversity of UtahUniversity of UtahUniversity of VirginiaUniversity of VirginiaUniversity of WashingtonUniversity of WashingtonUniversity of Wisconsin-University of Wisconsin-MadisonMadisonUtah State UniversityUtah State UniversityVirginia TechVirginia TechWashington UniversityWashington UniversityYale University LibraryYale University Library
Individual Individual InstitutionsInstitutionsArizona State UniversityArizona State UniversityBaylor UniversityBaylor UniversityBoston CollegeBoston CollegeBoston UniversityBoston UniversityCalifornia Digital LibraryCalifornia Digital LibraryColumbia UniversityColumbia UniversityCornell UniversityCornell UniversityDartmouth CollegeDartmouth CollegeDuke UniversityDuke UniversityEmory UniversityEmory UniversityFlorida State UniversityFlorida State UniversityHarvard University LibraryHarvard University LibraryIndiana UniversityIndiana UniversityJohns Hopkins UniversityJohns Hopkins UniversityLafayette CollegeLafayette CollegeLibrary of CongressLibrary of CongressMassachusetts Institute of Massachusetts Institute of TechnologyTechnologyMcGill UniversityMcGill UniversityMichigan State UniversityMichigan State UniversityNew York UniversityNew York UniversityNew York Public LibraryNew York Public LibraryNorth Carolina Central UniversityNorth Carolina Central UniversityNorth Carolina State UniversityNorth Carolina State UniversityNorthwestern UniversityNorthwestern UniversityThe Ohio State UniversityThe Ohio State UniversityThe Pennsylvania State UniversityThe Pennsylvania State University
Princeton University Princeton University Purdue UniversityPurdue UniversityStanford UniversityStanford UniversityTexas A&M UniversityTexas A&M UniversityUniversidad Complutense de MadridUniversidad Complutense de MadridUniversity of ArizonaUniversity of ArizonaUniversity of California BerkeleyUniversity of California BerkeleyUniversity of California DavisUniversity of California DavisUniversity of California IrvineUniversity of California IrvineUniversity of California Los AngelesUniversity of California Los AngelesUniversity of California MercedUniversity of California MercedUniversity of California RiversideUniversity of California RiversideUniversity of California San DiegoUniversity of California San DiegoUniversity of California San FranciscoUniversity of California San FranciscoUniversity of California Santa BarbaraUniversity of California Santa BarbaraUniversity of California Santa CruzUniversity of California Santa CruzThe University of ChicagoThe University of ChicagoUniversity of ConnecticutUniversity of ConnecticutUniversity of DelawareUniversity of DelawareUniversity of FloridaUniversity of FloridaUniversity of Illinois at Urbana-University of Illinois at Urbana-ChampaignChampaignUniversity of Illinois at ChicagoUniversity of Illinois at ChicagoThe University of IowaThe University of IowaUniversity of MarylandUniversity of MarylandUniversity of MichiganUniversity of MichiganUniversity of Minnesota University of Minnesota
Partnership Community*Partnership Community*
ConsortiaConsortiaCommittee on Institutional Committee on Institutional CooperationCooperationTriangle Research Libraries Triangle Research Libraries NetworkNetworkUniversity of CaliforniaUniversity of California
*As of October 30, 2012
Member Benefits
Content Storage
Preservation ServicesAccess
Services
Availability of Bibliographic Data
Governance
Eligibility
Institutions worldwide. HathiTrust's partnership model is geared primarily towards academic and research libraries with large amounts of digitized book and journal content or substantial print collections.
All interested parties are invited to contact us at feedback@issues.hathitrust.org
Partner Checklist: http://www.hathitrust.org/partnership_checklist
How much does it cost?
$
Cost for partner institutions= [# of vol in the public domain] * 1.5 *
[cost/vol/yr]
[# of partners]
Part I: Public Domain Costs
Cost for partner institutions=
[# of vol in-copyright] * 1.5 * [cost/vol/yr]
[# of partners holding these volumes]
Part II: In-Copyright Costs
1
1: partners who currently or previously had these volumes in their print holdings.
Example:
2,000,000 * 1.5 * 0.19
62= $9,193.55
2,000,000 * 1.5 * 0.19
12= $47,500
Example:
In-copyright
Public Domain
$9,193.55+ $47,500.00
$56,693.55
Total Cost to a Partner for this ExampleFor more information:
http://www.hathitrust.org/cost
HathiTrust Research Center
The HathiTrust Research Center (HTRC) enables computational access for nonprofit and educational users to published works in the public domain and, in the future, on limited terms to works in-copyright from the HathiTrust.
The HTRC is a collaborative research center launched jointly by Indiana University and the University of Illinois, along with the HathiTrust Digital Library, to help meet the technical challenges of dealing with massive amounts of digital text that researchers face by developing cutting-edge software tools and cyberinfrastructure to enable advanced computational access to the growing digital record of human knowledge.
Leveraging data storage and computational infrastructure at Indiana University and the University of Illinois at Urbana-Champaign, the HTRC will provision a secure computational and data environment for scholars to perform research using the HathiTrust Digital Library. The center will break new ground in the areas of text mining and non-consumptive research, allowing scholars to fully utilize content of the HathiTrust Library while preventing intellectual property misuse within the confines of current U.S. copyright law.
http://www.hathitrust.org/htrc
Projects
Creating a Registry of US Federal Governmen
t Documentshttp://www.hathitrust.org/usgovdocs_registry http://permanent.access.gpo.gov/gpo30084/CRPT-
112hrpt663-pt1.pdf
Creating a MetaData Management System
DataData
DataData
DataData
DataData
DataData
DataDataabout about
ab
out
aboutabout
about
Coordinated & sponsored by the California Digital Library and HathiTrust
http://www.hathitrust.org/htmms
http://www.hathitrust.org/mdl_imageshttp://www.mndigital.org/projects/preservation/
The Minnesota Digital Library Image Preservation Prototype Project:
to ensure long-term preservation of digital content, particularly
images, from cultural heritage institutions