Jpwilkin@umich.edu1 What the Hell Were We Thinking!?!?! UM-Google Digitization Deal What is it?...

Post on 22-Dec-2015

214 views 0 download

Tags:

Transcript of Jpwilkin@umich.edu1 What the Hell Were We Thinking!?!?! UM-Google Digitization Deal What is it?...

jpwilkin@umich.edu 1

What the Hell Were We Thinking!?!?! UM-Google

Digitization DealWhat is it?

Where do things stand?

jpwilkin@umich.edu 2

Basic information

• What will be digitized?– 7m University Library volumes– print (bound)

• Google’s approach to “fair use”

• Process for books

• Images are returned to UM

• What (in basic terms) will UM do with images it receives

jpwilkin@umich.edu 3

jpwilkin@umich.edu 4

Why would Google? “project’s aim is simple: help maintain the

preeminence of books and libraries in our increasingly Internet-centric culture by making these information resources an integral part of the online experience. We hope to guide more users to their local libraries; to digital archives of some of the world's greatest research institutions; and to out-of-print books they might not be able to find anywhere else – all while carefully respecting authors’ and publishers' copyrights.”

Google Print/Library FAQ

jpwilkin@umich.edu 5

About the files…

• Benchmarking/standards• What we get is package per volume, id’d by barcode, incl.

• 600dpi ITU G4 (bitonal) for print• 300dpi JPEG2000 color/grayscale for illus.• naming conventions corresponding to UM specs• OCR• Checksums• Production notes

• Quality control• Ongoing improvement of hardware/engineering• Image quality good and improving• What is secret and why?

– Technology– Numbers

jpwilkin@umich.edu 6

THE GOOGLE WORKSTATION(CONFIDENTIAL)

jpwilkin@umich.edu 7

THE ANN ARBORWORK GROUP

jpwilkin@umich.edu 8

Status

• Google began capture @UM in July, 2004

• UM receiving content continuously

• Large amounts of UM content went into GBS in November, 2005

• Production ramp-up continues

• At UM, embarking on implementation

jpwilkin@umich.edu 9

Why would UM put the materials online?

• Responsibility for the “archive”

• Michigan “audience” more specific and thus more specialized– More flexible displays– More powerful citation tools– Power searches?– Data mining and other research applications

jpwilkin@umich.edu 10

jpwilkin@umich.edu 11

jpwilkin@umich.edu 12

jpwilkin@umich.edu 13

jpwilkin@umich.edu 14