Cabrinety-NIST Project: AMIA DAS 2015

45
Cabrinety-NIST Project Large-scale digital preservation of a legacy software collection AMIA Digital Asset Symposium November 19, 2015 Charlotte C. Thai Cabrinety Project Archivist [email protected] |

Transcript of Cabrinety-NIST Project: AMIA DAS 2015

Cabrinety-NIST ProjectLarge-scale digital preservation of a legacy software collection

AMIA Digital Asset Symposium November 19, 2015Charlotte C. ThaiCabrinety Project [email protected] |@cabrinety

1

Outline

Part 1: Cabrinety-NIST Project BackgroundStephen M. Cabrinety Collection in the History of Microcomputing, ca. 1975-1995Who is Stephen M. Cabrinety?NSRL Grant Details (2012-2014; 2014-2016)

Part 2: Cabrinety-NIST Project workflowGeneral overviewDeep dive into key sections

Part 3: Next steps

2

15,000+ software packages27 operating systems400 pieces computer hardware200 pieces computer-related realiaDocuments, ephemera, periodicals

Stephen M. Cabrinety collection in the History of Microcomputing, ca. 1975-1995

3

Stephen M. Cabrinety

4

NSRL GrantCabrinety-NIST ProjectFunding agencyNational Software Reference Library

Project PartnersStanford University LibrariesNIST (National Institute of Standards and Technology)

5

NSRL GrantStanford Digital RepositoryHydra Technology StackFedora Commons Ruby on RailsApache SolrBlacklight

http://projecthydra.org

6

7

STARTRequest boxProcessingRegistrationCatalogingPhotographyAuditingNIST receives boxCreate disk imageShip from SUL to NISTVerify imageCreate Box Archive ingest packageShip box from NIST to SULReceive box at StanfordDownload Box ArchivePerform physical QCAccession files and metadata into Stanford Digital RepositoryEndCreate PURLReturn box

Perform digital QC

Cabrinety-NIST Project WorkflowStage 1Stage 3Stage 2

8

SAL3 (Stanford auxiliary library 3)3,925,000 barcoded items170 average requests per day22 machine carts1 truck

9

SearchWorksSirsiDynix WorkFlows

10

SAL3 to Stanford Campus

11

Going the distanceLeg 1: SAL3 -> Stanford (40 miles)Leg 2: Stanford -> Redwood City (9 miles)Leg 3: Redwood City -> NIST (2,817 miles)Leg 4: NIST -> Stanford (2, 827 miles)Leg 5: Stanford -> Redwood City (9 miles)Leg 6: Redwood City -> SAL3 (40 miles)

Approx. distance each box travels: 5,742 miles

12

Wait for it

13

This Fruit Roll-Up is 22 years old.

14

Argo registration

15

Argo tracking sheet

16

DatastreamsDCRELS-EXTdescMetadatacontentMetadatatechnicalMetadata

17

18

19

20

21

1 Game = 55 Photographs

22

23

24

25

DisclaimerTrade names and company products are mentioned in the text or identified. In no case does such identification imply recommendation or endorsement by the National Institute of Standards and Technology, nor does it imply that the products are necessarily the best available for the purpose.

2626

Perl media imager (custom, wraps dcfldd, etc.)

dcfldd (or, digital computer forensics lab dd)

27

Media Image Listingzy394kt3697-1/0b1a57009bb039c4f4dc87030d5179c0e4d3acd5:

-rw-r--r-- 1 admin wheel 0 Jul 22 15:14 0b1a57009bb039c4f4dc87030d5179c0e4d3acd5.dderr-rw-r--r-- 1 admin wheel 737280 Jul 22 15:15 0b1a57009bb039c4f4dc87030d5179c0e4d3acd5.img-rw-r--r-- 1 admin wheel 173 Jul 22 15:15 0b1a57009bb039c4f4dc87030d5179c0e4d3acd5.img.sha-rw-r--r-- 1 admin wheel 0 Jul 22 15:13 0b1a57009bb039c4f4dc87030d5179c0e4d3acd5.mountstr

zy394kt3697-2/aef4055512d90778c10a2b70f7232cf237e67023:

-rw-r--r-- 1 admin wheel 0 Jul 22 15:16 aef4055512d90778c10a2b70f7232cf237e67023.dderr-rw-r--r-- 1 admin wheel 737280 Jul 22 15:17 aef4055512d90778c10a2b70f7232cf237e67023.img-rw-r--r-- 1 admin wheel 173 Jul 22 15:17 aef4055512d90778c10a2b70f7232cf237e67023.img.sha-rw-r--r-- 1 admin wheel 0 Jul 22 15:15 aef4055512d90778c10a2b70f7232cf237e67023.mountstr

zy394kt3697-3/caad5a2e928551421fb567eeba0509f7f7f9be8a:

-rw-r--r-- 1 admin wheel 0 Jul 22 15:18 caad5a2e928551421fb567eeba0509f7f7f9be8a.dderr-rw-r--r-- 1 admin wheel 737280 Jul 22 15:19 caad5a2e928551421fb567eeba0509f7f7f9be8a.img-rw-r--r-- 1 admin wheel 173 Jul 22 15:19 caad5a2e928551421fb567eeba0509f7f7f9be8a.img.sha-rw-r--r-- 1 admin wheel 0 Jul 22 15:17 caad5a2e928551421fb567eeba0509f7f7f9be8a.mountstr

zy394kt3697-4/a7e8b599626bb1cf5de832ba5ef011aedbe40f31:

-rw-r--r-- 1 admin wheel 0 Jul 22 15:20 a7e8b599626bb1cf5de832ba5ef011aedbe40f31.dderr-rw-r--r-- 1 admin wheel 737280 Jul 22 15:20 a7e8b599626bb1cf5de832ba5ef011aedbe40f31.img-rw-r--r-- 1 admin wheel 173 Jul 22 15:21 a7e8b599626bb1cf5de832ba5ef011aedbe40f31.img.sha-rw-r--r-- 1 admin wheel 0 Jul 22 15:19 a7e8b599626bb1cf5de832ba5ef011aedbe40f31.mountstr

2828

5.25 inch floppy disk captureDevice Side Data FC5025 USB 5.25 inch floppy controller5.25 inch floppy drive

5.25 inch floppy disks

29

How is the data captured from cartridge media?

Retrode

30

SNES and Sega Genesis Checksums

http://www.stanford.edu/group/htgg/cgi-bin/drupal/?q=node/1179

31

CopyNES

32

Mappers

33

CRC32 (cyclic redundancy check)

34

GameBoy + Arduino

35

Computer CassetteEcasoundAudacity

36

Computer CassetteWAV2TAPVICE (VIC-20) Emulator

37

Box Archives

38

Accessioning: JIRA

39

40

Next steps?

41

EaaS: Emulation as a Service

42

Emulation: jsMESS

43

Questions?Contacts:

Stanford University LibrariesCharlotte C. Thai ([email protected], 650.497-7020)

National Institute of Standards and TechnologyDouglas White ([email protected], 301.642.8596)

44

LinksArduino: https://www.arduino.cc/

Audacity screen capture: Doug White presentation at Stanford Capture Lab Unconference 2015

CopyNES: www.retrousb.com

DCFLDD: http://dcfldd.sourceforge.net/

Emulation as a Service: http://bw-fla.uni-freiburg.de/

gphoto2: http://www.gphoto.org/

Hydra: http://projecthydra.org/

Image Science Associates: http://www.imagescienceassociates.com

Internet Archive Oregon Trail: https://archive.org/details/msdos_Oregon_Trail_The_1990

JMESS Github: https://github.com/jsmess/jsmess

Maze Commodore VIC-20 screen capture: Doug White presentation at Stanford Capture Lab Unconference 2015

NSRL: http://www.nsrl.nist.gov/

Pokemon Red Game Boy cartridge, Game Boy Cart Shield, Arduino: Christopher Fox blog How to Give Cartridge-Based Video Game Data an Extra Life (Part 2): http://web.stanford.edu/group/htgg/cgi-bin/drupal/?q=node/1205

Retrode: http://www.retrode.org/

Stanford Digital Repository: https://library.stanford.edu/research/stanford-digital-repository

Stanford Digital Repository Sprite PURL: https://purl.stanford.edu/kg712zz5745

45