Our Marathon Presentation at DH Data Curation Workshop

25
Our Marathon: The Boston Bombing Digital Archive DH Data Curation Workshop May 1, 2014 facebook.com/OurMarathon www.northeastern.edu/marathon @OurMarathon

Transcript of Our Marathon Presentation at DH Data Curation Workshop

Page 1: Our Marathon Presentation at DH Data Curation Workshop

Our Marathon: The Boston Bombing Digital Archive

DH Data Curation Workshop

May 1, 2014

facebook.com/OurMarathon www.northeastern.edu/marathon@OurMarathon

Page 2: Our Marathon Presentation at DH Data Curation Workshop

TELL A WIDE RANGE OF STORIES

Page 3: Our Marathon Presentation at DH Data Curation Workshop

“NO STORY IS TOO SMALL”

Page 4: Our Marathon Presentation at DH Data Curation Workshop

BUILD A LASTING COMMUNITY MEMORIAL

Page 5: Our Marathon Presentation at DH Data Curation Workshop

PRESERVE THE HISTORICAL RECORD

Page 6: Our Marathon Presentation at DH Data Curation Workshop

AUDIENCES

• Regional: Boston, MA residents directly and indirectly affected by these events

• More broadly, a “general” audience of anyone interested in these events

• Researchers and Scholars: interest in preserving items / files and creating / preserving metadata

Page 7: Our Marathon Presentation at DH Data Curation Workshop

BUILDING OUR MARATHON

4,700+ items

Boston City Archives material

289 stories from the Globe Lab

307 memes (image macros)

40 oral histories (WBUR)

raw news footage (WCVB-TV)

Page 8: Our Marathon Presentation at DH Data Curation Workshop

WBUR ORAL HISTORY PROJECT

Page 9: Our Marathon Presentation at DH Data Curation Workshop

BOSTON CITY ARCHIVES COLLECTION

Page 10: Our Marathon Presentation at DH Data Curation Workshop

OMEKA (“WORDPRESS FOR MUSEUMS”)

Page 11: Our Marathon Presentation at DH Data Curation Workshop

HOW WE’RE USING OMEKA

• Dublin Core Metadata (currently Simple; transitioning to Extended Dublin Core imminently)

• Modified Contribution Plugin: Crowdsourced contributions submit Item Type Metadata

• Geotagging Items

• Tagging Items (Search Functionality / Organization)

Page 12: Our Marathon Presentation at DH Data Curation Workshop

KINDS OF ITEMS IN THE ARCHIVE

• “Born Digital” Material (photos, text, memes, screencaps)

• Scanned / Digitized Items (BCA Items, Boston Medical Center Items)

• Modified Items (redacted files, edited audio files)

Page 13: Our Marathon Presentation at DH Data Curation Workshop

SOME ITEMS / FILE TYPES IN THE ARCHIVE

• BCA Items (Hi-Res Scans: TIF files; JPEG Copies)

• Web Sites (Archive-It / The Internet Archive)

• Oral History Audio Files: .wav and .mp3

• Crowdsourced contributions: variety

• Social media files: screencaps

Page 14: Our Marathon Presentation at DH Data Curation Workshop
Page 15: Our Marathon Presentation at DH Data Curation Workshop
Page 16: Our Marathon Presentation at DH Data Curation Workshop
Page 17: Our Marathon Presentation at DH Data Curation Workshop

Crowdsourcing

Challenges of “Born Digital” Content

• Perceived Value By Contributor• Copyright Issues and Social Media• Preservation Challenges• Metadata Challenges

Page 18: Our Marathon Presentation at DH Data Curation Workshop

DUBLIN CORE METADATA FIELDS

• Title

• Description

• Source

• Date

• Rights

• Language

Page 19: Our Marathon Presentation at DH Data Curation Workshop

GEOTAGGING

Page 20: Our Marathon Presentation at DH Data Curation Workshop

TAGS

Page 21: Our Marathon Presentation at DH Data Curation Workshop

LONG-TERM PRESERVATION PLANS

• Northeastern’s Libraries (Archives & Special Collection) is final home of Our Marathon items

• Items Public Now Will Be Public In The Future

• “Planned Obsolescence” (Home Page / Site)

• Five year position (Basic Monitoring of Archive)

• Oral History Audio Files: .wav and .mp3

• Crowdsourced contributions: variety

• Social media files: screencaps

Page 22: Our Marathon Presentation at DH Data Curation Workshop

SHORT-TERM CHALLENGES

• What Metadata Cleanup to Do Now (BCA Items, Public Submissions)

• How To Make Content More Accessible (Tags, Maps)

• Social Media Content (Tweets)

Page 23: Our Marathon Presentation at DH Data Curation Workshop

SOME LONG-TERM CHALLENGES

• Institutional Memory of Project (Documentation of Methodologies, Meta-Archive)

• When to phase out web site / “Share Your Story” Plugin

• When to make sensitive material public

• Approval Process for Researchers / Scholars

Page 24: Our Marathon Presentation at DH Data Curation Workshop
Page 25: Our Marathon Presentation at DH Data Curation Workshop

Thanks!

[email protected]

Twitter: @JimMc_Grath