Archivematica Digital Preservation
with
Dan GilleanGlenstone Digital Preservation Roundtable
November 4th, 2016
Let’s talk digital preservation
ISO 14721:2002 ISO 16363:2012
• Governance• Organizational structure• Staffing• Procedural accountability• Preservation policy framework• Documentation• Financial sustainability• Security
ISO 16363 Reminds us that much of digital
preservation readiness is not technical – it’s organizational
??????
ISO 16363
Meet Archivematica(hello world!)
What is Archivematica?Archivematica is a
web- and standards-based, open-source application which
allows your institution to
preserve long-term access to
trustworthy, authentic and reliable digital
content.
Standards basedOpen source
CustomizableIntegrated w 3rd party systemsActive community
2014
2008
2009 2010 2011 2012 2013
2007: UNESCO REPORT
0.1-ALPHA
DASHBOARD
INTRODUCED
Archivematica’s development
0.7
1.0RELEAS
ED!0.9
0.8
Bradley, K., Lei, J., Blackall, C. Towards An Open SourceArchival Repository and Preservation System (2007)
Planning and development begin. Initial Funding via UNESCO MotW Subcommittee, IMF Archives, City of Vancouver Archives
0.6-ALPHA
February 2010
May 2010
February 2011
February 2012
PREMISin
METS
0.10April 2013
August 2012 STORAGE
SERVICE 0.2January 2014
PREMIS in METS XML
Archivematica AIP structurePackaged according to BagIt
specifications
Virus scan, normalization report, extraction log, etc
For browsing in Archivematica
Original + normalized objects, submission docs, original metadata included at SIP creation
<mets:amdSec> <mets:techMD> PREMIS: OBJECT <mets:rightsMD> PREMIS: RIGHTS <mets:digiprovMD> PREMIS: EVENT <mets:digiprovMD> PREMIS: AGENT
PREMIS in METSMETS SECTIONS
<metsHdr> METS header
<dmdSec> Descriptive metadata
<amdSec> Administrative metadata
<fileSec> File section<structMap> Structural Map
Embracing Openness
Open SourceOpen StandardsOpen FormatsOpen documentation https://commons.wikimedia.org/wiki/File:Postcards_and_magnifying_glass.jpg
A program is free software if the program's users have the four essential freedoms: 1. The freedom to run the program as you wish, for
any purpose (freedom 0).2. The freedom to study how the program works, and
change it so it does your computing as you wish (freedom 1). Access to the source code is a precondition for this.
3. The freedom to redistribute copies so you can help your neighbor (freedom 2).
4. The freedom to distribute copies of your modified versions to others (freedom 3). By doing this you can give the whole community a chance to benefit from your changes. Access to the source code is a precondition for this.
Free Software Foundation
Free Software Definition
https://www.fsf.org/licensing/essays/free-sw.html
What is Free Software?
What is Freedom?
https://commons.wikimedia.org/wiki/File:Beer_mug_transparent.png
“ ”Free as in…
Beer? Speech?
Kitten?
http://www.freestockphotos.biz/stockphoto/9343
What is Freedom?“ ”It’s all 3…and that’s not a bad
thing
Community-based development
…and many others
Broad and active user community
Development PhilosophyCommunity-based
developmentBounty model of business
• Standards-based• Open source / Creative
Commons• Generalize specific use cases• Include all features in public
release• Accept community
improvements• Iterative development via
multiple contributions over subsequent releases
• Maintain: documentation, software, wiki,
• Produce additional resources (e.g. videos, presentations, webinars)
• Participate in user forum• Offer additional paid services• Always include development
in public project
Do one thing well… Micro-
servicesHandshake
sPartnership
s
Gears – Joe DeSousa. https://www.flickr.com/photos/mustangjoe/22711070429
Metal Handshake – Grey Geezer. https://commons.wikimedia.org/wiki/File:Metal_Handshake.jpg
Hands Passing Baton - tableatny, https://www.flickr.com/photos/53370644@N06/4976497160
Micro-services Architecture
Micro-services Architecture
Micro-services: Tools UsedBulk_extractor
Clam AVElasticsearchExifToolFfmpegFido
FITSGearmangzipImagemagickInkscapeJHOVE
md5deepMediaInfoNFS-commonPython-lxmlunarUUID
…and more…
Handshakes:
Integration not duplication
Handshakes:AtoMArchivesSpaceArchivist’s ToolkitArkivumBinder
CONTENTdmDataverseDSpaceDuraCloudFedora
HPTrimHydraIslandoraLOCKSSOpenStack
Still from the film Color Blind - Photograph by Pui Shan Chan February 2009. https://en.wikipedia.org/wiki/File:Black_%26_White_Handshake_-_Still_from_the_film_Colour_Blind_(2009).JPG
Partnerships
Building Community Services Together
archivesDIRECT• Partnership with
DuraSpace• U.S. Based• Launched August 2014• Secure storage and
monitoring via DuraCloud
• Artefactual provides AM technical support
http://archivesdirect.org/
Perpetua• Partnership with
Arkivum• U.K. Based• Launched July 2016• Secure storage and
monitoring via Arkivum• Artefactual provides AM
technical support
http://arkivum.com/perpetua/
ArchivesCANADA
Digital Preservation Service
• Partnership with The Canadian Council of Archives (CCA)
• Canada Based• Launched September
2016• Artefactual provides AM
technical support, storage, monitoring
http://archivescanada.ca/ACDPS
[email protected] Thanks!
RESOURCESAM homepage: https://www.archivematica.org
AM demo: http://sandbox.archivematica.org
Wiki: https://wiki.archivematica.org
Documentation: https://www.archivematica.org/docs/
RESOURCESRoadmap: https://wiki.archivematica.org/Development_roadmap:_Archivematica
Issue tracker: https://projects.artefactual.com/projects/Archivematica
Code repo: https://github.com/artefactual/Archivematica
Forum: https://groups.google.com/forum/#!forum/archivematica
Top Related