Web Archiving: Description and Access
-
Upload
elizabeth-lily-pregill -
Category
Technology
-
view
417 -
download
0
Transcript of Web Archiving: Description and Access
Web Archiving: Description and Access
Lily PregillNYARC Coordinator & Systems Manager
Metropolitan New York Library CouncilWeb Archiving Series, Part 3
February 29, 2016
Chocolate + peanut butter approach
Descriptive metadata + full-text indexing are both essential to drive discovery and retrieval of web archives
What is NYARC?
2009
2010
2006
2012
2015
2013
Brooklyn Museum + The Frick Collection + MoMA
New York Art Resources Consortium (NYARC) formed
Launched Arcade, shared Millennium ILS
Archive-It and Auction Catalogs Pilot Project
Mellon Grant: Reframing Collection for a Digital Age
Mellon Grant: Making the Black Hole Gray
Launched NYARC Discovery
Archive-It
Thematic Collections
Art ResourcesArtists’ WebsitesAuction HousesCatalogues RaisonnésNYC Galleries Restitution of Lost or Looted Art
Institution-based Collections
Brooklyn MuseumThe Frick CollectionMoMANYARC
10 collections > 250 websites + growing…
http://nyarc.org/webarchive
Accessing Web Archives
URL driven search
Multiple levels of searchCombined full-text and DC metadata search on collection page
NYARC Discovery
Arcade, NYARC’s classic cataloghttp://arcade.nyarc.org
Archive-Ithttp://nyarc.org/webarchive
NYARC Discoveryhttp://discovery.nyarc.org
NYARC Discovery
Info icon hover text:
NYARC Discovery
NYARC Discovery: surfacing uncataloged content
Search: maya angelou bearden train
NYARC Discovery: discover local blog posts
Metadata Profile
http://www.nyarc.org/sites/default/files/web-archiving-profile.pdf
583 ##
ǂa capture ǂc [date captured]ǂh New York Art Resources Consortium ǂ5 NyNyARC ǂ2 pet [code for PREMIS event type]
Developed by Rebecca Gunther
Metadata Workflow
• Connexion: Begin cataloging in Connexion• Use Extract Metadata tool• Apply Local Constant Data built off the metadata profile• Upload to WorldCat • Export to local Millennium system (Arcade)• Millennium records ingested by Primo/NYARC Discovery weekly
Metadata Workflow: Constant Data Example
m o d 007 c ǂb r ǂd c ǂe n040 FXM ǂb eng ǂe rda ǂc FXM049 FXMA300 1 online resource : ǂb color illustrations336 text ǂb txt ǂ2 rdacontent336 still image ǂb sti ǂ2 rdacontent337 computer ǂb c ǂ2 rdamedia338 online resource ǂb cr ǂ2 rdacarrier520 Summary583 capture ǂc year ǂh New York Art Resources Consortium ǂ2 pet ǂ5 NyNyARC588 Description of the resource based on live site viewed on Month, Day, Year, and archived site; title from home page.655 7Web sites. ǂ2 aat85640ǂz Live site85640ǂu ǂz Archived site
Metadata: WorldCat
Metadata: Arcade (local catalog)
Metadata: NYARC Discovery
Where can I learn more?Archive-It • OpenSearch API
https://webarchive.jira.com/wiki/display/search/OpenSearch+API
• Metadata in Archive-Ithttps://webarchive.jira.com/wiki/display/ARIH/Metadata+in+Archive-It
NYARC Web Archiving Reports• Archive-It and Online Auction Catalogs (2010)
http://www.nyarc.org/sites/default/files/ait_leahy_report.pdf
• Reframing Collections for a Digital Age: Final Report (2013)http://www.nyarc.org/sites/default/files/reports/reframing_final_report2013.pdf
• Making the Black Hole Gray: Final Report (2016)http://www.nyarc.org/sites/default/files/making_the_black_hole_gray_final_report.pdf
NYARC Documentation• Metadata Application Profile
http://www.nyarc.org/sites/default/files/web-archiving-profile.pdf
• Metadata for Web Archived Resources: Recommendations for Further Exploration http://www.nyarc.org/sites/default/files/Recommendations%20for%20further%20exploration-final.pdf
• NYARC Wikihttp://wiki.nyarc.org
Website coming soon ….. OCLC Research Partners Web Archiving Metadata Working Group