How I failed to present on using DVCS to control archival metadata
Batch metadata assignment to archival photograph collections using facial recognition software
-
Upload
kyle-banerjee -
Category
Technology
-
view
483 -
download
2
Transcript of Batch metadata assignment to archival photograph collections using facial recognition software
![Page 1: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/1.jpg)
Batch metadata assignment to archival photograph
collections using facial recognition softwareKyle [email protected]
![Page 2: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/2.jpg)
Why should anyone care?
Current methods for assigning metadata are:
•Slow
•Difficult
•Error Prone
•Incomplete
2
Filing code stencil cards at the W. Atlas Burpee Company Library of Congress Prints and Photographs Division
![Page 3: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/3.jpg)
A few challenges
• Libraries and archives use external systems to maintain metadata
• Archival images are huge and clunky to work with
• Metadata standards for image files are implemented inconsistently and weren’t designed with library needs in mind
3
![Page 4: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/4.jpg)
Automation
• Process in bulk• Use metadata embedded
within the image
4
Fran Bilas Spence and Jean Jennings Bartik work on ENIACARL Technical Library
• Use the file system• Use consumer grade software as a force multiplier• Improve search engine visibility and simplify
migrations
![Page 5: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/5.jpg)
What you need to get started
• A computer with the operating system of your choice
• Mad programming skilz
• Modest scripting ability (any language)
5
![Page 6: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/6.jpg)
Image metadata demystified
$ head lovejoy-moskovetz_1923.tif
II▒▒▒@d▒▒F▒(1▒2▒▒ ▒▒]BI▒▒ ▒Ci▒Black and white photograph of Esther Pohl Lovejoy and Doctors Elliot and Moskovetz in Athens in 1923.▒▒['▒▒['Adobe Photoshop CS2 Windows2012:04:10 14:16:16<?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
[a few lines deleted here]
<rdf:Description rdf:about=""
xmlns:tiff="http://ns.adobe.com/tiff/1.0/">
<tiff:ImageWidth>6046</tiff:ImageWidth>
<tiff:ImageLength>4880</tiff:ImageLength>
[a few more lines deleted]
<dc:subject>
<rdf:Bag>
<rdf:li>Lovejoy</rdf:li>
<rdf:li>Moskovetz</rdf:li>
</rdf:Bag>
</dc:subject>
6
![Page 7: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/7.jpg)
Facial recognition
• People are an important access point• Provides authority control by nature• Identification of individuals helps
determine other details
7
Facial recognition primerWPI Transformations
• Extraction of faces simplifies manual identification
• Non-specialist staff can do more metadata work
![Page 8: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/8.jpg)
Useful software
• Free Picasa software works great
8
• Stores person info in a combination of contacts.xml and .picasa.ini files
![Page 9: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/9.jpg)
9
![Page 10: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/10.jpg)
Since I know you’re wondering, it’s no good for…
10
![Page 11: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/11.jpg)
.picasa.ini
[lovejoy-esther_portrait_nd.jpg]
faces=rect64(135a175de074cd8b),c0ef2256901bfbb6
backuphash=23375
[matarrazo-joseph_2001.jpg]
faces=rect64(3407026fe607ac00),c2c65f903b3150cb
backuphash=33
11
![Page 12: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/12.jpg)
contacts.xml
<contact id="c0ef2256901bfbb6" name=“Esther Pohl Lovejoy" modified_time="2012-11-26T09:48:04-08:00" local_contact="1"/>
<contact id="c2c65f903b3150cb" name="Joseph Matarazzo" modified_time="2012-11-30T15:02:10-08:00" local_contact="1"/>
12
![Page 13: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/13.jpg)
Adding metadata en masse
• Exiftool (available for all platforms) is incredibly handy
exiftool -XMP-dc:Subject+=‘My new heading’ myimage.tif
exiftool -XMP-iptcExt:PersonInImage+=‘Doe, John’ myimage.tif
• Notice the Dublin Core subject tag• DC doesn’t define people explicitly as subjects
so we used IPTC extensions here
13
![Page 14: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/14.jpg)
14
![Page 15: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/15.jpg)
Exiftool is useful for reading metadata
• Exif stores excellent technical metadata so it’s nuts to hand key this into other systems
• Usage is brain deadexiftool filename (Labeled display)
exiftool –X filename (XML)
exiftool –T filename (Tab delimited)
• Many powerful options
15
![Page 16: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/16.jpg)
You need 3 image metadata standards
• Exif for technical metadata
• IPTC for many descriptive fields
• XMP for specialized information needed by archivists and librarians
16
![Page 17: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/17.jpg)
A glimpse into the future
• Social metadata• Union catalogs contain better metadata than
local catalogs• Create richer and more accurate metadata
much faster and cheaper than is otherwise possible
17
![Page 18: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/18.jpg)
18
![Page 19: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/19.jpg)
Before going nuts on your photos…
Picasa can mess up existing metadata if you let it write tags (facial recognition doesn’t use tags)
You can create new tags, but don’t expect other software to read them
Facial recognition is a handy tool, but don’t use it as a crutch
Always test before performing batch metadata modifications or you may wind up blasting out existing metadata
19
![Page 20: Batch metadata assignment to archival photograph collections using facial recognition software](https://reader035.fdocuments.in/reader035/viewer/2022062420/55bdd673bb61ebad0a8b4858/html5/thumbnails/20.jpg)
Takeaways from this presentation
1. Facial recognition is easy with Picasa
2. Exiftool is incredibly useful for reading and writing image metadata
3. Learning to use embedded metadata is easy and makes too much sense not to do
20