Siri sgrpmtg05092013

15
Biodiversity Heritage Library: A Mass Scanning Mix of Metadata Bianca Crowley, Collections Coordin Biodiversity Heritage Library Smithsonian Libraries 4/24/22

Transcript of Siri sgrpmtg05092013

Page 1: Siri sgrpmtg05092013

Biodiversity Heritage Library:

A Mass Scanning Mix of Metadata

Bianca Crowley, Collections CoordinatorBiodiversity Heritage LibrarySmithsonian Libraries Apr 13, 2023

Page 2: Siri sgrpmtg05092013
Page 3: Siri sgrpmtg05092013

BHL Overview

• http://biodiversitylibrary.org• New user interface launched in March• Search by title, author, article, subjects and

scientific names• Various download options, even high

resolution• Taxonomic name finding algorithm• Machine-to-machine services

Page 4: Siri sgrpmtg05092013

Core BHL Member Institutions

Smithsonian Libraries: 6,800+ titles Nearly 18,000 volumes 7 million+ pages

Page 5: Siri sgrpmtg05092013

Digitization Workflow

Page 6: Siri sgrpmtg05092013

Metadata

1. Titles vs. Items vs. Segments

2. Metadata we need: • MARC for book and journal titles • Volume information • Page data

BHL Term Library Term Meaning Metadata

Title Book or Journal Titles

Conceptual Unit MARC record

Item Volume, Piece Object Derived from holdings + created @ digitization

Segment Article, Book Chapter, Part

Section of consecutive pgs

Harvested from BioStor.org or created post digitization

Page 7: Siri sgrpmtg05092013

Smithsonian Libraries Record

BHLRecord

Page 8: Siri sgrpmtg05092013

Metadata Challenges

• BHL collection aggregates metadata from 15 member library catalogs

• Also aggregating metadata from a couple hundred Internet Archive contributors

• Default page metadata created at time of scanning lacks detail esp. for plates, figures, etc.

• Taxonomic name finding algorithm only as good as optical character recognition (OCR)

Page 9: Siri sgrpmtg05092013

BHL ADMINISTRATIVE DASHBOARD ALLOWS ACCESS TO BACKEND DATABASE

Page 10: Siri sgrpmtg05092013

User Feedback is Critical

General feedback form Scan request form

http://biodiversitylibrary.org/contact

Page 11: Siri sgrpmtg05092013

Gemini Issue Tracking System

Page 12: Siri sgrpmtg05092013

©opyright MetadataCopyright Status Status language License

Permissions (1923 in-copyright pubs)

In copyright. Digitized with the permission of the rights holder.

Creative Commons Attribution, Non-Commercial, Share-Alike (CC-BY-NC-SA)

Due Diligence (1923-77 US pubs)

No known copyright restrictions as determined by scanning institution.

NA

Public Domain (pre-1923)

Public domain. The BHL considers that this work is no longer under copyright protection.

NA

• We also have an open data policy – metadata 100% available for reuse under a CC0 license (public domain)

Page 13: Siri sgrpmtg05092013

Impact• “BHL came to the rescue when a planned trip to work in the Mertz Library at The New

York Botanical Garden had to be cancelled due to Hurricane Sandy. Thanks to the online resources available through BHL I was able to source most of the key works I needed, with their supporting bibliographic information. Further use of BHL occurred when building work at the Linnean Society of London limited access to some of the book I had been able to use from that collection."

• “I would like thank you all very much for invaluable work and support you do. I just got a pdf-file from more than century old (1893) journal paper (regional naturalist society paper, published in Finland), to get copy I should take 500 mile drive to our university library. Now I am got it fastly in high-quality pdf-copy. Cordial thanks and all success in continuing your highly valuable mission.” [conservation biologist from Estonia]

• “You are a wonderful resource. I maintain a Website that describes the plant genus Opuntia (prickly pear cacti). There is no way I could maintain such a site without access to literature from 100-200 years ago. Most of the cactus species were discovered long ago; I find it invaluable to put up PDF files to document each species in the literature as I document them photographically. I am a botanist, but I work in the pharmaceutical field (not so many botanical jobs out there). Your library makes it possible for me to continue working with plants in a meaningful and scientific manner.”

Page 14: Siri sgrpmtg05092013

Impact

• Repackaging content in new ways for new audiences via:– flickr, Facebook, Twitter, & Pinterest– iTunes U & iBooks

• Open data & APIs – Put content where users are already working

(Encyclopedia of Life-EOL.org, Int’l Plant Names Index-IPNI.org, Tropicos.org)

– Gets power users to work for us (for free!) e.g. BioStor.org, synynyms.com

Page 15: Siri sgrpmtg05092013

Questions?

Bianca [email protected]

Thank you

http://biodiversitylibrary.org