RODA and CRiB: A Service-Oriented Digital...
Transcript of RODA and CRiB: A Service-Oriented Digital...
![Page 1: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/1.jpg)
RODA and CRiB: A Service-Oriented Digital Repository
José Carlos [email protected]
Miguel [email protected]
Luis [email protected]
Francisco [email protected]
Cecília [email protected]
Luis [email protected]
![Page 2: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/2.jpg)
Context
RODA (2006-2009)• Metadata management (EAD based)• Digital object management (...)• Digital preservation protocols and policies• National project with public funds (Portuguese National Archives)
CRiB: Preservation Services for Digital Repositories (2005-2008)• Distributed Migration Service• Migration Adviser Service• Phd thesis at U.Minho (Miguel Ferreira)
![Page 3: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/3.jpg)
Context
RODA (2006-2009)• Metadata management (EAD based)• Digital object management (...)• Digital preservation protocols and policies• National project with public funds (Portuguese National Archives)
CRiB: Preservation Services for Digital Repositories (2005-2008)• Distributed Migration Service• Migration Adviser Service• Phd thesis at U.Minho (Miguel Ferreira)
![Page 4: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/4.jpg)
Premisses
• RODA:✴ OAIS as reference
model✴ EAD as descriptive
metadata✴ Open Source✴ SOA✴ Preservation Policy:
Migration✴ Authenticity
• CRiB:✴ Distributed Migration✴ SOA✴ Quality control✴ Migration Adviser
![Page 5: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/5.jpg)
Authenticity
“O Codex 632” by José Rodrigues dos Santos
Subject: Who really was Cristophoros Colombus?
Was he italian? Spanish? Or a portuguese belonging to a jewish family?
![Page 6: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/6.jpg)
Authenticity
We must trust our sources: in ancient History there are no direct speech or evidence.
EX: the bible
![Page 7: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/7.jpg)
Authenticity
We must trust our sources: in ancient History there are no direct speech or evidence.
EX: the bible
How do we become trustful?
![Page 8: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/8.jpg)
Authenticity
We must trust our sources: in ancient History there are no direct speech or evidence.
EX: the bible
How do we become trustful?
• Reputation
• Documenting every action taken upon DOs
![Page 9: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/9.jpg)
DO Classes: normalization
![Page 10: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/10.jpg)
Open Archival Information System
ISO 14721
![Page 11: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/11.jpg)
OAIS (Information Packages
• Submission Information Package (SIP)✴ Digital Object✴ Metadata created by producer
‣ too open...• Archival Information Package (AIP)
✴ Digital Object to be stored✴ Metadata: enough to ensure DO’s preservation
and access‣ model defined by PREMIS
• Dissemination Information Package (DIP)• DO transformed into the format that will be
delivered to the consumer• Metadata
![Page 12: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/12.jpg)
Ingestion
![Page 13: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/13.jpg)
Ingestion
Submission Contract• SIP specification• Ingestion workflow specification
![Page 14: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/14.jpg)
SIP Structure (example)
one still image
![Page 15: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/15.jpg)
SIP Structure (example)
one still image
criation properties:
- date- hardware- ...
![Page 16: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/16.jpg)
SIP Structure (example)
one still image
criation properties:
- date- hardware- ...
Technical Metadata:- color- dimensions- ...
![Page 17: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/17.jpg)
SIP Structure (example)
one still image
criation properties:
- date- hardware- ...
Technical Metadata:- color- dimensions- ...
Descriptive Metadata:- producer
- colection- ...
![Page 18: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/18.jpg)
SIP Structure (example)
one still image
criation properties:
- date- hardware- ...
Technical Metadata:- color- dimensions- ...
Descriptive Metadata:- producer
- colection- ...
Manifest
![Page 19: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/19.jpg)
SIP Structure (example)
Com
pres
sed
Fileone still image
criation properties:
- date- hardware- ...
Technical Metadata:- color- dimensions- ...
Descriptive Metadata:- producer
- colection- ...
Manifest
![Page 20: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/20.jpg)
SIP Structure (+complex)
![Page 21: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/21.jpg)
SIP Structure (+complex)
1001001011010100100101
![Page 22: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/22.jpg)
SIP Structure (+complex)
1001001011010100100101
DO = Image+
![Page 23: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/23.jpg)
SIP Structure (+complex)
1001001011010100100101
DO = Image+ Properties
![Page 24: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/24.jpg)
SIP Structure (+complex)
1001001011010100100101
DO = Image+ Properties Technical Metadata
![Page 25: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/25.jpg)
SIP Structure (+complex)
1001001011010100100101
DO = Image+ Properties Technical MetadataDescriptive Metadata
![Page 26: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/26.jpg)
SIP Structure (+complex)
1001001011010100100101
DO = Image+ Properties Technical MetadataDescriptive Metadata
Manifest
![Page 27: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/27.jpg)
SIP Structure (+complex)
1001001011010100100101
DO = Image+ Properties Technical MetadataDescriptive Metadata
Manifest
Compressed File
![Page 28: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/28.jpg)
SIP AIP
• integrity check• virus check• generation of preservation metada (PREMIS)• conversion to a normalized format
• generation of technical metadata• generation of preservation metadata (PREMIS)
Ingestion Workflow
![Page 29: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/29.jpg)
AIP Storage
<EAD>
<PREMIS> <PREMIS> Metadata
DOs
![Page 30: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/30.jpg)
Roda Data Model
![Page 31: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/31.jpg)
Roda Data Model
Description Objects
![Page 32: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/32.jpg)
Roda Data Model
Description Objects
Representation Objects
![Page 33: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/33.jpg)
Roda Data Model
Description Objects
Representation Objects
Preservation Objects
![Page 34: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/34.jpg)
Implementation Requisites• Graphical Interface for Ingestion process
• Producer registry
• SIP production tool
• Ingestion feedback
• Partial Ingestion
• “Quarantine” zone: cache, ingestion buffer
• SIP validation
• Error reporting
• Persistent identifiers
• PREMIS event generation
• DIP digital signature
• ...
![Page 35: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/35.jpg)
Development framework
![Page 36: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/36.jpg)
Requisites based comparaison
IngestionAIP
Management
Dissemination
Ingestion Management Dissemination
![Page 37: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/37.jpg)
Matching data models
DSpace
![Page 38: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/38.jpg)
Matching data models
Fedora
Text
Description
Items
Services
PID
![Page 39: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/39.jpg)
Architecture
![Page 40: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/40.jpg)
RODA Schemas
![Page 41: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/41.jpg)
8
DO Anatomy
DatabaseText Doc.Still Image
SQL Server Hard DiscAccess
PDF Doc.
PNG image
Ms Word Doc.
Tape
...
Conceptuallevel
Logicallevel
Physicallevel
![Page 42: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/42.jpg)
8
DO Anatomy
DatabaseText Doc.Still Image
SQL Server Hard DiscAccess
PDF Doc.
PNG image
Ms Word Doc.
Tape
...
Conceptuallevel
Logicallevel
Physicallevel
If one of these levels becomes obsolete we
loose access to the DO
![Page 43: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/43.jpg)
DO Preservation Strategies• Focusing the physical/logical object
o Centered in preserving information in her logical format or/and physical support
o Uses original technology associated to these objects to ensure the access to them
o Technology preservation
• Focusing the conceptual object
o Centered in preserving the object core properties in a way that is independent from hardware and software
o Conceptual object preservation
![Page 44: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/44.jpg)
Conceptual object preservation
Migration: periodic DO transfer from one hw/sw configuration into an updated one (centered in preserving significant properties other then preserving the original bit stream).
Advantages– DO are disseminated in formats known to users– No need to preserve the original hw/sw platform– Most used strategy and the only that has worked so far
Disadvantages– Possible loss of information during conversion– Continued maintenance is needed – In the longterm perspective costs are high
![Page 45: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/45.jpg)
Conceptual object preservation
Migration: periodic DO transfer from one hw/sw configuration into an updated one (centered in preserving significant properties other then preserving the original bit stream).
Advantages– DO are disseminated in formats known to users– No need to preserve the original hw/sw platform– Most used strategy and the only that has worked so far
Disadvantages– Possible loss of information during conversion– Continued maintenance is needed – In the longterm perspective costs are high
What are the significant properties?
![Page 46: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/46.jpg)
Distributed Migration• Remote conversion services
o known APIs
o descriptive metadata for localization and invocation (UDDI)
• Advantages
o Platform independency
o Redundancy/multiple migration paths
o Compatible with other migration strategies
Normalizatio, migration on request
o Generalized cost reduction
• Disavantages
o Bandwidth requirements
o SlowFormat
B
Format
C
Format
D
Format
E
Format
A
Conversion
A-C
Conve
rsion
A-B
Conversion
B-C
Conver
s ion
D-E
Conversion
C-E
Convers
i on
B-C
Conversion
A-E
Convers
ion
C-D
![Page 47: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/47.jpg)
Distributed Preservation Services
t1
t4
t2t3
t5
t6 t7
<.9, .8, .95, .1>
<.5, .3, .95, .6>
<.5, .3, .95, 1>
<.9, .6, .9, .7>
<.3, .6, .9
5, .1>
<.7, .5, .65, .1>
<.9, .8, .6, .1>
![Page 48: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/48.jpg)
Distributed Preservation Services
t1
t4
t2t3
t5
t6 t7
<.9, .8, .95, .1>
<.5, .3, .95, .6>
<.5, .3, .95, 1>
<.9, .6, .9, .7>
<.3, .6, .9
5, .1>
<.7, .5, .65, .1>
<.9, .8, .6, .1>
CRiB project: http://crib.dsi.uminho.pt
![Page 49: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/49.jpg)
CRiB: architecture
![Page 50: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/50.jpg)
Migration Broker
• Carries out format conversions
o Invokes the necessary conversion services
• Measures the performance of the conversion process
o Availability
o Stability
o Throughput
o Scalability
o Cost
o Size ratio
o File count ratio
![Page 51: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/51.jpg)
Object Evaluator• Determines the amount of data loss involved in
migration
• Detects similarity between significant properties of digital objects
o Object class dependent;
o Different significant properties for bitmap images, text documents, relational databases, etc.
• Produces evaluation reports in PREMIS format
o Datetime of intervention
o Description of involved agents
o Type of event (i.e. Migration)
o Outcome of intervention
![Page 52: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/52.jpg)
RODA+CRiB
![Page 53: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/53.jpg)
Text Documents and Still Images
![Page 54: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/54.jpg)
Text Documents and Still Images
• EAD elements capture most of the significant properties: provenance, producer history, context, ...
• Content is kept in a normalized format: PDF and uncompressed TIFF.
![Page 55: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/55.jpg)
Text Documents and Still Images
• EAD elements capture most of the significant properties: provenance, producer history, context, ...
• Content is kept in a normalized format: PDF and uncompressed TIFF.
<EAD>
![Page 56: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/56.jpg)
Text Documents and Still Images
• EAD elements capture most of the significant properties: provenance, producer history, context, ...
• Content is kept in a normalized format: PDF and uncompressed TIFF.
<EAD>
<EAD>
![Page 57: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/57.jpg)
Databases
• Data?
• Structure?
• Views?
• Reports?
• Stored Procedures?
• ...
![Page 58: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/58.jpg)
Databases
• Data?
• Structure?
• Views?
• Reports?
• Stored Procedures?
• ...
First prototype:
• Data
• Structure
![Page 59: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/59.jpg)
SIP Builder
![Page 60: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/60.jpg)
Browser
![Page 61: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/61.jpg)
Disseminators
![Page 62: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/62.jpg)
Metadata Editor
![Page 63: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/63.jpg)
Preservation Metadata Viewer
![Page 64: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/64.jpg)
Advanced Search
![Page 65: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/65.jpg)
Final thoughts
“Data Preservation is a people problem”Michael Lesk
![Page 66: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/66.jpg)
Final thoughts
“Data Preservation is a people problem”Michael Lesk
• People need to be trained to save data in a proper way.• What to preserve? Data, Structure, Semantics...• Preservation is for future users but only today users vote on budget• We need to make data collecting people have preservation concerns• Preservation is fault tolerance. All systems are imperfect
![Page 68: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/68.jpg)
Business model• CRiB is free: [email protected]
• RODA is free: property of Portuguese National Archives
• Support, maintenance and internationalization: KEEP Solutions Lda, www.keep.pt
![Page 69: RODA and CRiB: A Service-Oriented Digital Repositoryrepositorium.sdum.uminho.pt/bitstream/1822/8226/2/iPres... · 2008-12-02 · RODA and CRiB: A Service-Oriented Digital Repository](https://reader035.fdocuments.in/reader035/viewer/2022070801/5f0274047e708231d404568c/html5/thumbnails/69.jpg)
Let’s Preserve Tomorrow’s History...
Questions?