Wittenberg Portico: Lessons From a Community Supported Archive
-
Upload
national-information-standards-organization-niso -
Category
Education
-
view
283 -
download
0
Transcript of Wittenberg Portico: Lessons From a Community Supported Archive
Portico is committed to the
preservation of scholarly literature
published in electronic form to
ensure that these materials remain
accessible to future generations of
scholars, researchers, and students.
PORTICO’S SERVICES
E-JOURNALPRESERVATION
SERVICE
E-BOOKPRESERVATION
SERVICE
D-COLLECTIONPRESERVATION
SERVICE
PUBLISHER & LIBRARY FEES
PUBLISHER FEES, NO LIBRARY FEE
NATIONALLIBRARY SERVICE
LIBRARY FEES, NO PUBLISHER FEE
WHAT WE DOPortico is a digital preservation service for e-journals, e-books, and other electronic content
Started by JSTOR in 2005 with funding from the Library of Congress and the Andrew Mellon Foundation.
Based on collaboration between libraries and publishers.
How It WorksWe maintain archiving agreements with publishers to collect and preserve content.
We receive content directly from the publishers. This content is held in a “dark archive”: it is stored until a specific event occurs that causes content to become accessible.
We make content available under certain circumstances:
Libraries don’t have to worry about whether their content exists for future generations.
• Publisher ceases operation• Publisher discontinues a title• Publisher drops a back file
TRIGGERS
Publishers can opt in to Portico’s post-cancellation access service.
• customers can request access to previously subscribed to/purchased material via Portico
• must be Portico participant in addition to publisher customer
PERPETUAL/POST-CANCELLATION ACCESS
Portico is a centralized and replicated repository that uses migration as its primary long-term archival approach, as part of a managed preservation strategy.
HOW DOES IT WORK
Portico provides preservation at the far end of the scale.We manage the content in our care very closely. The primary preservation methodology is migration—transitioning content from one file format to another as technology evolves.
NO ACTION BACKUP BYTEREPLICATION
FULL MANAGEDPRESERVATION
CONTENT MANAGEMENT ACTIVITIES
Usability
• the intellectual content of the item must remain usable via the delivery mechanism of current technology
Authenticity
• the provenance of the content must be proven and the content an authentic replica of the original
Discoverability
• the content must have logical bibliographic metadata so that it can be found by end users through time
Accessibility
• the content must be available for use to the appropriate community
Digital preservation is the series of management policies and activities necessary to ensure the enduring usability, authenticity, discoverability, and accessibility of content over the very long-term. The key goals of digital preservation include:
WHAT WE DOUnderstand and repackage content into our content model specific to the genre
Complete an initial metadata migration
Provide managed preservation
• replication
• fixity checks
• repair and replace content as needed
• refresh hardware
• migrate files to new format
Regular activities:
• Replicate archive• Perform fixity checks • Repair or replace content if it becomes
corrupted• Audit content• Report on the content to participants• Refresh hardware• Monitor the preservation and
academic community for changes in preservation needs
Annual or as needed activities:
• Validate or revalidate files as new tools are developed
• Migrate files to new formats as necessitated by the changing technological environment
• Update preservation plans• Receive preservation accreditation
OUR REPLICATION STRATEGYPortico’s makes both offline and online replicas of archive objects and their metadata and distributes them around the world.
ON-LINE MASTER – UNITED STATES, EAST COAST
ON-LINE REPLICALOCALLY MANAGED. UNITED
STATES, MIDWEST
ON-LINE REPLICAU.S. COMMERCIAL CLOUD
STORAGE
OFF-LINE REPLICA AT THE NATIONAL LIBRARY OF THE
NETHERLANDS (KB)
Content arrives at Portico
via DVD, hard drive, FTP, RSS
feed, etc. Files are off loaded
Files are processed
through the Portico workflow
and publisher specific tools.
Portico archival units are created.
Portico archival units are
preserved in the
Portico archive..
End users access content.
Archival units are replicated to two
locations in the U.S. and one location in
Europe.
When content is triggered or a PCA claim fulfilled the
content delivery is enabled.
49
United Kingdom of Great Britain and Northern Ireland
387
United States of America
7
New Zealand
58
Argentina
1Lebanon
25Australia
3Sweden
23Canada
169Brazil
3India
We have participants from small colleges to large research institutions—more than 900 libraries in 21 countries.
GLOBAL LIBRARY PARTICIPATION
2
United Arab Emirates
55
United Kingdom
1
New Zealand
210
United States
3
South Africa
1
Hong Kong
4
Singapore
4
Colombia
1
Argentina
14
Australia
1
Sweden19
Canada
2
Mexico1
Nigeria
1
Russia
2
Brazil
We have participants from small publishers to very large publishers—more than 400 publishers in 49 countries.
GLOBAL PUBLISHER PARTICIPATION
EJOURNAL GROWTH IN ARTICLES
0M 5M 10M 15M 20M 25M 30M 35M 40M 45M 50M 55M 60M 65M 70M 75M
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
15,509,065
53,351,065
17,709,028
22,020,508
36,349,048
14,540,713
28,228,653
61,175,927
8,492,778
4,794,891
160,598
415 Publishers from 49 countries
971 Libraries from 21 countries
26,241 E-journals committed
23,434 E-journals preserved
752,694 E-books committed
474,629 E-books preserved
156 D-collections committed
137 D-collections preserved
61,175,927 E-journals articles preserved
382,576 E-books preserved
894,993 Digitized books preserved
1,253,281 Digitized documents preserved
1636803 Digitized newspaper issues preserved
1,237,626,247 Preserved Files
756,557,484 Preserved Images
162,612,149 Preserved Repository created archival files
251,611,386 Preserved Supplied text files
25,615,391 Preserved Application Specific Files
4,858,679 Preserved Multi-file Packages
104,667 Preserved Video Files
2,653 Preserved Audio Files
60 Preserved Executable Files
20 Triggered journal titles 3 Triggered OA journal titles1 Triggered book title
1st digital preservation service to become certified as a Trusted Digital Repository