Automating & Scaling Digital Preservation in the Cloud...Automating & Scaling Digital Preservation...

29
Automating & Scaling Digital Preservation in the Cloud Future-proofing long-term records and digital content Mike Quinn, CEO

Transcript of Automating & Scaling Digital Preservation in the Cloud...Automating & Scaling Digital Preservation...

Automating & Scaling Digital Preservation in the Cloud

Future-proofing long-term records and digital content

Mike Quinn, CEO

A world where Active Digital Preservation is business as usual, seamlessly integrated into the systems that manage and protect our most valuable digital information.

A world where the potential of our digital memory is harnessed to enrich and protect our cultural, social, economic and political lives.

Our Vision

Corporate & business

Government & state

National & pan-national

Memory & education

Protecting the history of the news, ensuring authenticity, continuity and access

“It was vital to protect our digital assets from therisks of technology obsolescence by using digitalpreservation techniques much more sophisticatedthan simply storing the “bits and bytes”

Valerie Komor , Director - AP Archives

“We needed a system that could help us govern information over the long-term, and also integrate with our existing systems in order to give us a single, cohesive view of our most important information assets”

Tina Staples

Global Head of Archives

Future-proofing

digital assets of unique strategic, historical and brand importance

Preserving governance, critical infrastructure plans, & corporate history“It was important for us to not just use the Cloud tostore digital content, but to also have a completedigital preservation platform that would allow us toactively migrate files to newer formats over time toensure they remain useable and readable forfuture generations”.”

Tamara Thornhill, Corporate Archivist

From sepia photograph to 1.2 Million views

Digital archive of 100,000+ digitized & born-digital heritage assets

Be more. Achieve more. Together.

Simple, powerful

upload tools

Post-ingest flexibility

User driven roadmap

Advanced access and discovery

Single intuitive application

A trusted living archive – why our users love Preservica

Active Digital Preservation

APIsAdvanced

security and administration

Storage and deployment

choice

Flexible content and metadata management

Content contributors

Content consumers

Information managers

Easy andautomated content

acquisition

Integrated secure access and discovery

Designed & Architected for Automated & Scalable Digital Preservation

A trusted living archive: designed to be discoverable

✓ Easy content ingest & acquisition

✓ Active migration to new file formats

✓ Organized, trustworthy digital assets

✓ Safe intelligent storage choice

✓ Flexible collection & metadata management

✓ Catalog synchronization

✓ Secure, authenticated access and discovery

OAIS: ISO 14721

Proven. Secure. Flexible.

▪ Cloud hosted on AWS (available On-Premise)

▪ Storage neutral on multiple platforms

▪ Purpose built to ISO 14721 OAIS standard

▪ Developed to ISO 9001 & ISO 27001 standards

▪ GDPR-ready

Preservica – Optimized on AWS

▪ S3 and Glacier used extensively by customers▪ Will be able to use Deep Archive in future – based

on user requirements for WORM

▪ Automated backups replicated across multiple Availability Zones

▪ We simplify and manage cross region replication to build upon AWS’ 99.999999999% durability

Optimized, intelligent storage

S3

Glacier

Deep Archive

AZ 1

AZ 2 AZ 3

Customer content

Customer content

Replication

Preservica Innovation team hosted customer webinars demonstrating:

▪ Facial description▪ Audio transcript▪ Image description▪ Text sentiment analysis

Trusted by Public Sector Organizations in US & GloballyAlabama Department of Archives and HistoryArkansas State ArchivesBoston City ArchivesCalifornia State ArchivesCity of SomervilleClark County, NevadaGeorgia Archives - University System of GeorgiaIllinois State ArchivesKansas Historical SocietyKentucky Department for Libraries and Archives Massachusetts State ArchivesMichigan History Center/Archives Of MichiganMinnesota Historical SocietyNew York State ArchivesOhio History ConnectionRhode Island Department of StateSouth Carolina Department of Archives and HistoryState Historical Society of North DakotaState of California - Office of Legislative CounselTennessee State Library and ArchivesTexas State Library and Archives CommissionVermont State Archives and Records AdministrationWisconsin State Historical SocietyPreservica Public Sector Customers:

▪ 20 US state archives▪ 15 national archives▪ Pan-national organizations▪ UK central government▪ US & UK county & city government

Inter-agency Records Transfer Survey

▪ Preservica is collaborating with the Council of State Archivists (CoSA) and National Association of State CIOs (NASCIO) to conduct survey research examining transfer practices of permanent state electronic records from three perspectives:▪ State agencies▪ IT support – agency, central IT or third party▪ State archives

▪ Survey topics include:▪ Categories of permanent records appraised for transfer to Archives▪ Existing transfer capabilities and protocols▪ Software and data management systems containing permanent records▪ Availability of guidance▪ Preservation planning coordination

https://www.surveymonkey.com/r/InteragencyRecordsTransfer

Texas State Library & Archives Commission (TSLAC)

Background▪ Official library and archives of Texas, received large volume of

Governor Rick Perry’s digital records in 2014

Customer Concerns▪ Legal mandate to ensure future access and FOI/transparency▪ Prohibitive cost of using own state data center

Solution Offered▪ Preservica hosted on AWS GovCloud (US) – S3 and Glacier

Success Metrics▪ Launch of the Texas Digital Archive – online public access▪ Minimal local IT resources with scalable cloud deployment▪ Geographical dispersal of multiple data copies

Meeting FOI/FOIA and citizen access requirements

Texas Digital Archive Kentucky State Digital Archives

Library and Archives Canada (LAC)

Background▪ One of the world’s largest library and archival institutions▪ To serve as the continuing memory of the Government of Canada

and its institutions

Business Drivers▪ Preserving digital documentary history for all Canadians▪ 6.7 PB of data (over 13.4 PB for 2 copies), 190 million digital objects▪ Processing 120 TB/month, retrieving 1.5 million objects per year▪ Scaling to 300 TB/month

Digital Preservation▪ Preservica on TeraMach Cloudx platform▪ Hosted on AWS Montreal (Central)▪ A sustainable Digital Preservation platform for the nation

Making

Digital Preservation

Automatic &

Scalable

Preservica & AWS: Proven Scalability

EU (Ireland)

Asia Pacific (Sydney)

US East (N.Virginia)

Canada (Central)

AWS GovCloud (US)

▪ Preservica Enterprise Private Cloudo can be deployed in any AWS region

▪ Optimize storage costs with choice of AWS S3 and Glacier

▪ Scale with confidence: low-cost storage aligned to AWS price

▪ AWS Snowball for bulk transfer

▪ AWS encryption

Preservica SaaS Active Digital Preservation

▪ Long-term accessibility & authenticity of digital records

▪ Preservica Cloud Editiono available in multiple AWS regions...with

more to come

Low cost storage at AWS price

Active Digital Preservation

APIsAdvanced

security and administration

Storage and deployment

choice

Flexible content and metadata management

Content contributors

Content consumers

Information managers

Easy andautomated content

acquisition

Integrated secure access and discovery

Designed & Architected for Automated & Scalable Digital Preservation

AI

Manual or auto ingest

into Preservica

Manual preparation

Auto preparation

Auto upload

Manual upload

Preservica ready format

Manual

Auto

AI

Pre

-pro

cess

APIs

Automated Content Acquisition

AWS AI Services

▪ Image and Video Analysis▪ Advanced Text Analytics▪ Document Analysis▪ Voice▪ Translation▪ Transcription

Intelligent Archiving Engine™

Connector

Content contributors ▪ Hierarchical rules engine using

user-defined fields▪ Continuous, periodic or one-off

content harvest▪ Identify and extract content and metadata▪ Transform and merge files▪ Load and validate

Automated Content Acquisition for SharePoint

Agency(County, State or Federal)

Automated transfer

Automated transfer

Securely hosted on

Securely hosted on

Active Digital Preservation

State Agency

Automated Content Acquisition & Transfer – State Agencies

Operational system

Cutoff

State Archive

Securely hosted on

Active Digital Preservation

State Agency

Operational system

Cutoff

*after designated period

County Agency

County Agency

County Archive

Also applies to County Agencies

▪ User Access (UA) application built within Preservica

▪ Secure accessibility, self-service (as appropriate)

▪ Advanced search (fielded, faceted, filtered)

▪ Easily customizable

Advanced Access and Discovery

Making Access & Discovery Engaging for All

▪ The Preservation Action Registry (PAR) - sharing of best practice across the industry

▪ Extending the Registry to include business rules

▪ Better preservation tools for characterization, migration, rendering

▪ Applying the rules automatically & seamlessly

We’re making

Auto-Preservation

a Reality

Hear from others in the community▪ Be more. Achieve more. blog series▪ Case studies, videos & whitepapers▪ Learn more at preservica.com

See Preservica in action▪ Live Demo : Thursday @ 10am EST▪ Local Government Product Showcase

- March 6th @ 10am EST / 3pm GMT

Upcoming events▪ NAGARA Conference : St Paul, MN - July ▪ Archives*Records : Austin, TX - August

Speak to a Digital Preservation specialist▪ Contact us today on [email protected]

Thank you

[email protected]

preservica.com

@preservica

@dPreservation