[email protected] An update on Google Book search digitization at the University of Michigan …...
Transcript of [email protected] An update on Google Book search digitization at the University of Michigan …...
An update on Google Book search digitization at the University of Michigan
… the agreement and plans for work between Google and the University of Michigan, … updates on the progress of the project … focus on conversion, content in Google, and Michigan's efforts with UM's copy of the content
Basic information (URLs on final slide)• Contract online• What will be digitized?
– 7m University Library volumes– print (bound)
• Process for books– Flow
• Check-out + send• Wand + scan• Return + check-in
– Pre-/post-condition survey– Industrial approach
• Images returned to UM in continuous flow
About the files…
• Benchmarking/standards• Returned to UM: package per volume, id’d by barcode,
incl. (see UM MDP FAQ)• 600dpi TIFF ITU G4 (bitonal) for print• 300dpi JPEG2000 color/grayscale for illus.• naming conventions corresponding to UM specs• OCR• Checksums• Production notes
• Quality control– Ongoing improvement of hardware/engineering– Image quality good and improving
• What is secret and why?– Technology– Numbers
Status
• Google began capture @UM in July, 2004
• UM receiving content continuously
• Large amounts of UM content went into GBS in November, 2005
• Production ramp-up continues– New facility– Scaling (last week, 11k volumes)
• At UM, embarking on implementation
Key contract provisions, pt. 1• 4.4.1 Use of U of M Digital Copy on U of M Website.
U of M shall have the right to use the U of M Digital Copy, in
whole or in part at U of M's sole discretion, as part of services offered on U of M's website. U of M shall implement technological measures … to restrict automated access to any portion of the U of M Digital Copy …. U of M shall also make reasonable efforts … to prevent third parties from (a) downloading or otherwise obtaining any portion of the U of M Digital Copy for commercial purposes, (b) redistributing any portions of the U of M Digital Copy, or (c) automated and systematic downloading from its website image files from the U of M Digital Copy….
Why would UM put the materials online?
• Responsibility for the “archive” • Michigan “audience” more specific and thus
more specialized…– Rights that Google may not have
• Current Section 108 provisions• Negotiated rights?
– Functions that Google may not want to support• More flexible displays• More powerful citation tools• Power searches?• Data mining and other research applications
Key contract provisions, pt. 2
• 4.4.2 Use of U of M Digital Copy in Cooperative Web Services. Subject to the restrictions set forth in this
section, U of M shall have the right to use the U of M Digital Copy, in whole or in part at U of M's
sole discretion, as part of services offered in cooperation with partner research libraries such as the institutions in the Digital Library Federation….
Creating a cooperative enterprise
• Original vision– Greater than the sum or our parts– Effort tied to the mission of our scholarly enterprise(s)– Extensible framework for content and services
• CIC discussions– 11 of 13 institutions participating– Two coordinated instances of replicated content– A foundation for creating shared definition
• Next?
URLs
• http://www.lib.umich.edu/mdp/– Contract– Project FAQ– Link to information about UM access?
• Mirlyn (UM online catalog)– http://mirlyn.lib.umich.edu/