Central Registry for Digitized Objects: Linking Production and Bibliographic Control (2007)

Central Registryfor Digitized Objects:

Linking Production andBibliographic Control

Ralf StockmannGöttinger Digitization Center

As things are now

• Huge ventures in– Digitization

• Google• Microsoft• National programs• Local centers

– Accessibility• World Digital Library• European Digital Library• National portals• Google Book Search

As things are now

• We just face the dawn of mass digitization– Leaving behind the state of

manufacturing– Entering industrialization– Scanning Robots– Accessible Full Text (OCR)

Lack of …

• Coordination in digitization activities– Who scans what

where when in which quality and how will it be accessible

• How is “quality” defined?

• Do we agree on “what”?

Number of digitized items per volume

ue Waste of Ressources

Facing the Consequences

AdditionalBenefit

TechnicalImprovements

The Solution• Central registry for digitized objects• Focused on the production context (no user

frontend)• API driven

– Application Programming Interface– Query / Ingest– Simple implementation into existing workflow-tools

• Batch mode (lists)• Open Source / free service• Matching on volume level

– Score / probability

Implementation

APIAPI

Aggregator / Normalizer / MappingAggregator / Normalizer / Mapping

Registry / Meta Data StoreRegistry / Meta Data Store

IngestIngest

Present Collections

QueryQuery

IngestIngest

Running Project

! ! !Notice of Intent

IngestIngest

Backend ServicesEROMM / EDL / OCLC / …

Metadata Store• Bibliographic

– Title– Author– Date– Place of publication– Number of Pages (?)– Language– Print / Format– Edition

• Technical– Resolution– Color depth– File type / compression

• Accessibility– Institution– Persistent identifier– Rights– URL

• Status– Digitized– In Progress– Intended (Timeline?)– Requested?

Matching / Score„what“

Additional Judging„who, where, which quality, how accesible“

Decisive Factor„when“

Obstacles• (open source) Tools for automated matching /

scoring?• Interface for manual comparison / decision making• Multivolume works: low rate of uniformity (near

50% of physical SUB stock before 1900)• Unicode• Transliteration tables• Random bound books• Reliable identifier

– ISBN for old books?

• Anticipated rate of accuracy: 50 – 70 %

Appreciation of Values• The goal is NOT to build a reliable database in terms of

library standards

• But to prevent further waste of resources.

• If we manage to archive just 50% precision,

• We saved a min. 50% of founding!

Work Packages• Define metadata model• Set up database• Implement mapping tools• Define API calls• Implement API• Build some connectors to popular mass digitization workflow

tools (e.g. “Goobi”)• Establish ISBN workflow• Harvest existing sources• Start with a community of actual projects

• Get some (!) founding• Estimated schedule plan: 6 months

Thank You(stockmann@uni-goettingen.de)

Central Registry for Digitized Objects: Linking Production and Bibliographic Control (2007)

Technology

Transcript of Central Registry for Digitized Objects: Linking Production and Bibliographic Control (2007)

Arrangement of bibliographic sources Structure of bibliographic databases.

Whither Bibliographic Data? Designing a roadmap to a new bibliographic information ecosystem

Central Registry for Digitized Objects: Linking Production and Bibliographic Control

BASIC BIBLIOGRAPHIC INFORMATIONmladomino.mla.org/webhelp/2020ManualFiles/BASIC... · 2020. 1. 30. · 1 3. BASIC BIBLIOGRAPHIC INFORMATION Indexers enter bibliographic information

digitized by - Internet Archive...digitized by . digitized by . digitized by

· Web viewDigitized by Google. Digitized by Google. Digitized by. Digitized by. Digitized by Google. Digitized by Google. Digitized by. Digitized by. Digitized by Google ...

Digitized classrooms

ISBD(PM): International Standard Bibliographic Description ... · International Standard Bibliographic Description arose out of a ... International Standard Bibliographic Description

Digitized health

NISO Bibliographic Roadmap Meeting - Carpenter welcome and overview of bibliographic infrastructure copy

digitized by

digitized by - Internet Archive · 2012. 8. 25. · digitized by . digitized by . digitized by

ONLY )' /- BIBLIOGRAPHIC

THEcifas.us/pdf/Comitas s/Long Manuscripts/2005... · Web viewTHE DIGITIZED CARIBBEANA A Bibliographic Guide to the Non-Hispanic Countries of the Region for the period 1900-1975 LAMBROS

Bibliographic Details

Bibliographic Management

digitized by - The Polis Project, Inc · digitized by . digitized by . digitized by

Bibliographic Database: WorldCat

1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers.

“A Digitized Enterprise needs a Digitized Workforce”msd2017.metrodata.co.id/microsite-2015/images/key... · 2017-09-15 · “A Digitized Enterprise needs a Digitized Workforce