HathiTrust and Print Storage Building around a digital core.

60
HathiTrust and Print Storage Building around a digital core

Transcript of HathiTrust and Print Storage Building around a digital core.

Page 1: HathiTrust and Print Storage Building around a digital core.

HathiTrust and Print Storage

Building around a digital core

Page 2: HathiTrust and Print Storage Building around a digital core.

HathiTrust Content Growth

Page 3: HathiTrust and Print Storage Building around a digital core.

Content Distribution

* As of May 1, 2011

8,625,158 Total volumes2,297,041 Public Domain4,722,664 Book titles209,930 Serial titles

Page 4: HathiTrust and Print Storage Building around a digital core.

Content Distribution

* As of May 1, 2011

Page 5: HathiTrust and Print Storage Building around a digital core.

Dates

* As of May 1, 2011 Statistics and Visualizations

Page 6: HathiTrust and Print Storage Building around a digital core.

Breakdown of HathiTrust book corpus by publication date

Bibliographic Indeterminacy and the Scale of Problems and Opportunities of "Rights" in Digital Collection Building – 2/2011

Page 7: HathiTrust and Print Storage Building around a digital core.

Breakdown of HathiTrust book corpus by publication date

Page 8: HathiTrust and Print Storage Building around a digital core.

Language Distribution (1)

The top 10 languages make up ~86% of all content

Statistics and Visualizations* As of May 1, 2011

Page 9: HathiTrust and Print Storage Building around a digital core.

Language Distribution (2)

The next 40 languages make up ~13% of total

Statistics and Visualizations* As of May 1, 2011

Page 10: HathiTrust and Print Storage Building around a digital core.

Content over time

* As of May 1, 2011

Page 11: HathiTrust and Print Storage Building around a digital core.

Dates (copyright)

Page 12: HathiTrust and Print Storage Building around a digital core.

A global change in the library environment

June 2010Median duplication: 31%

June 2009Median duplication: 19%

Academic print book collection already substantially duplicated in mass digitized book corpus

Page 13: HathiTrust and Print Storage Building around a digital core.

Continuing growth of overlap …

• ARL overlap– 31% in June 2010– 33% in Dec (adjustment: adding little-held works)– ~ 1% per 225,000 vols– 38% in May, 2011; 45% by December, 2011

• Oberlin Group overlap– 41% in December, 2010– Higher rate of overlap per added volume?– Close to 50% in May, 2011

Page 14: HathiTrust and Print Storage Building around a digital core.

And yet every library is different

• Our median rate of overlap may be the same• But our overlap profiles will differ by library

Page 15: HathiTrust and Print Storage Building around a digital core.
Page 16: HathiTrust and Print Storage Building around a digital core.
Page 17: HathiTrust and Print Storage Building around a digital core.
Page 18: HathiTrust and Print Storage Building around a digital core.
Page 19: HathiTrust and Print Storage Building around a digital core.
Page 20: HathiTrust and Print Storage Building around a digital core.
Page 21: HathiTrust and Print Storage Building around a digital core.
Page 22: HathiTrust and Print Storage Building around a digital core.
Page 23: HathiTrust and Print Storage Building around a digital core.
Page 24: HathiTrust and Print Storage Building around a digital core.
Page 25: HathiTrust and Print Storage Building around a digital core.
Page 26: HathiTrust and Print Storage Building around a digital core.
Page 27: HathiTrust and Print Storage Building around a digital core.
Page 28: HathiTrust and Print Storage Building around a digital core.
Page 29: HathiTrust and Print Storage Building around a digital core.
Page 30: HathiTrust and Print Storage Building around a digital core.
Page 31: HathiTrust and Print Storage Building around a digital core.
Page 32: HathiTrust and Print Storage Building around a digital core.
Page 33: HathiTrust and Print Storage Building around a digital core.
Page 34: HathiTrust and Print Storage Building around a digital core.
Page 35: HathiTrust and Print Storage Building around a digital core.
Page 36: HathiTrust and Print Storage Building around a digital core.
Page 37: HathiTrust and Print Storage Building around a digital core.
Page 38: HathiTrust and Print Storage Building around a digital core.
Page 39: HathiTrust and Print Storage Building around a digital core.
Page 40: HathiTrust and Print Storage Building around a digital core.
Page 41: HathiTrust and Print Storage Building around a digital core.
Page 42: HathiTrust and Print Storage Building around a digital core.
Page 43: HathiTrust and Print Storage Building around a digital core.
Page 44: HathiTrust and Print Storage Building around a digital core.
Page 45: HathiTrust and Print Storage Building around a digital core.
Page 46: HathiTrust and Print Storage Building around a digital core.
Page 47: HathiTrust and Print Storage Building around a digital core.
Page 48: HathiTrust and Print Storage Building around a digital core.
Page 49: HathiTrust and Print Storage Building around a digital core.
Page 50: HathiTrust and Print Storage Building around a digital core.
Page 51: HathiTrust and Print Storage Building around a digital core.
Page 52: HathiTrust and Print Storage Building around a digital core.
Page 53: HathiTrust and Print Storage Building around a digital core.
Page 54: HathiTrust and Print Storage Building around a digital core.
Page 55: HathiTrust and Print Storage Building around a digital core.
Page 56: HathiTrust and Print Storage Building around a digital core.
Page 57: HathiTrust and Print Storage Building around a digital core.
Page 58: HathiTrust and Print Storage Building around a digital core.

And yet every library is different

• Our median rate of overlap may be the same• But our overlap profiles will differ by library• Our use patterns differ• Our risk profiles differ• Our roles vis-à-vis our constituencies differ• Thus, the need to act independently on

common data

Page 59: HathiTrust and Print Storage Building around a digital core.

Extending the holdings database

• HathiTrust print holdings database– Basis for new cost model (overlap of in-copyright)– Basis for lawful uses (e.g., print disabilities, Section 108)– A more complete picture than elsewhere

• Print monograph storage proposal– Enable partners to register commitments – Establish definitions (e.g., environment, use and condition)– Build in cost-sharing: collectively fund those that make

commitments– Communicate information to partners to facilitate

decision-making

Page 60: HathiTrust and Print Storage Building around a digital core.

Next steps?

• Work to develop draft proposal, led by Tom Teper, underway by HathiTrust Collections Committee (Ivy Anderson, chair)

• Early draft for review to Executive Committee in May/June

• Final version from Executive Committee to partners in late summer

• Consideration as part of new cost model at Constitutional Convention