International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library...
-
Upload
jacob-spencer -
Category
Documents
-
view
215 -
download
0
Transcript of International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library...
International Seminary on Digitisation:
Experience and Technology11 th May 2004 | National Library | Lisbon – Portugal
DIGITAL ARCHIVE OF PORTUGUESE ART
Maria Inês Cordeiro Art Library, Calouste Gulbenkian Foundation
Maria Inês Cordeiro, 11 Maio 2004International Seminary on Digitisation:
Experience and Technology
End user services & underlying model
Objectives and management requirements
Technical infrastructure
Information structures
Future developments
Digitisation: experience & technology
Maria Inês Cordeiro, 11 May 2004
End user services ...
Online Access to front matter (cover & content pages)
- as a systematic service since 2000 - > 21.000 pages, for periodicals and monographs
Online access to full content: special collections
- at present: 11 collections available- ca 1900 titles of textual materials (21.000
pages) - ca 3000 images from photographic collections
- goal to 2006 - 17 collections, including several important artist’s
personal archives and architectural drawings - over 150.000 documents
Maria Inês Cordeiro, 11 May 2004
...underlying model
Acess is integrated with already existing services
- the catalogue is the main component for search and retrieval functions
ex. periodical content pages ex. monograph content pages ex. full content
Additional access points and functionalities
- special functionalities to restrict search to digitised docs
- access points alternative to the traditional search from new option ‘Digitised Collections’
from a web page from a collection record
Maria Inês Cordeiro, 11 May 2004
Objectives & management requirements
Optimize the existing technical structure - diversify the output, not the underlying systems (e.g. databases) - integrate new workflow with existing flows - concentrate efforts on core competences
Integration / transparency for new resources and functions - resource integration: the Library as a whole, a single entity - navigability between the Catalogue, the Digital Archive and the Web
- outsourcing of digitization of full content- bibliographic metadata in MARC - local tasks: tech definitions, quality control, integration
Digital preservation issues - neutral, simple & consistent criteria
- avoid format diversity & dependency of proprietary applications
- document processes and resources - archive master files with the corresponding
technical & administrative metadata
Maria Inês Cordeiro, 11 May 2004
Technical infrastructure
Technical requirements and policies
DIGITISATIONSelection and accessibility policies
formats, versions, metadata and management procedures
JPEG or GIF TB- thumbnail
300 dpiPDF IC - usage
300 dpiTIFF, no comp.IA - archive
Min resol. FormatsImage types
File name conventions
Metadata in TIFF headers
72 dpi
Maria Inês Cordeiro, 11 May 2004
TIFF archive (offline) master files
Control reproduction fidelity
Control no of files, names, etc., and metadata
Record archival operation
DIGITAL ARCHIVEIntegrate usage files
TECHNICAL REQUIREMENTS & POLICIES
formats, versions, metadata and management procedures
JPEG or GIF TB- thumbnail
300 dpiPDF IC - usage
300 dpiTIFF, no comp.IA - archive
Min resol. FormatsImage types
File name conventions
Metadata in TIFF headers
72 dpi
Maria Inês Cordeiro, 11 May 2004
DocumentName 269 Name of doc., convention
ImageDescription 270 Convention for descript.of groups of docs in the same collection
ImageWidth 256 (# pixels, horizontal)
ImageLength 257 (# pixels vertical) BitsPerSample 258 1, 8 ou 24 for b/w, shades of grey or color Compression 259 1 (no compression) PhotometricInterpretation 262 0 b/w and shades of grey, 3 for color RGB FillOrder 266 1 (default) StripOffsets * 273 (o byte offset for each strip) SamplesPerPixel 277 0 b/w and shades of grey, 3 for color RGB RowsPerStrip * 278 (# rows per strip) StripByteCount * 279 (absent, if no compression) Xresolution 282 (# pixels /resolution unit, horizontal) Yresolution 283 (# pixels /resolution unit, vertical) ResolutionUnit 296 2 (inches)
Copyright 33432 Indication according to conventions
Tag name Tag# Content DateTime 306 (date of capture, standard format) Model 272 (capture equipment model) Make 271 (capture equipment maker) Software 305 (capture software used) HostComputer 316 (machine & OS used)
TIFF TAGS – MASTER FILES
Maria Inês Cordeiro, 11 May 2004
Clarification & clearance of legal
rights
Content relevance
rarity & preservation…
In the Library only?
On the Internet?
SELECTION AND ACESSIBILITY POLICIES
Technical requirements and policies
DIGITISATIONSelection and accessibility policies
Maria Inês Cordeiro, 11 May 2004
INTEGRATION APPLICATION archival & links
DIGITAL ARCHIVE
Digital usage files
Archive structure management
TIFF archive (offline)
Master files
Definitive, files full content out of dig. collectionsOTHER FULL CONT
provisional, incoming files to integrateIN_POOLs (collection]
definitive for each collectionC[collection short name]
definitive, content pages periodicalsSUPER
definitive, content pages of current monog.SUMON
Content & rulesDirectories
Technical requirements and policies
DIGITISATIONSelection and accessibility policies
Technical infrastructure
Maria Inês Cordeiro, 11 May 2004
TIFF archive (offline)
Master files
Technical requirements and policies
DIGITISATIONSelection and accessibility policies
INTEGRATION APPLICATION archival & links
DIGITAL ARCHIVE
Digital usage files
Archive structure management
HORIZON SYSTEM
Bibliographic data
Accessibility data 95X
Data needed to define access, search and visualization
conditions regarding bib records with associated digital files
Technical infrastructure
Maria Inês Cordeiro, 11 May 2004
Information structures
MANAGEMENT OF ACCESSIBILITY DATA : 95X MARC TAGS
956 Search keys for records pertaining to the same collection set
957 Link to Terms & conditions of usage Web page 958 Link to access the digital archive, link description
Elements defining whether content page or full content; Elements defining access on the Internet or just local
959 Search key for records with content pages of new accessions; elements defining wether periodical or monograph
HORIZON SYSTEM
Maria Inês Cordeiro, 11 May 2004
Information structures
Presents the various integration options
Doesn’t require Horizon/MARC knowledge or
knowledge of specific details and rules of the
digital archive
Ensures the consistency of
the archive organization
Writes data in MARC 95X
tags
INTEGRATION APPLICATION archival & links
DIGITAL ARCHIVE
Digital usage files
Archive structure management
HORIZON SYSTEM
Bibliographic data
Accessibility data 95X
Maria Inês Cordeiro, 11 May 2004
IPAC LOCAL
Data and digital files available on the local network only
IPACWWW
Data and digital files available on the INTERNET
LOCAL NETWORK
INTERNET
Technical infrastructure
INTEGRATION APPLICATION archival & links
DIGITAL ARCHIVE
Digital usage files
Archive structure management
HORIZON SYSTEM
Bibliographic data
Accessibility data 95X
Technical requirements and policies
DIGITISATIONSelection and accessibility policies
TIFF archive (offline)
Master files
Maria Inês Cordeiro, 11 May 2004
DIGITAL ARCHIVE
Digital files
HORIZON SYSTEM
Bibliographic records
Links to digital files
COLLECTIONS
IPAC LOCAL SYSTEM
Access on the local network
IPAC WWW SYSTEM
Access on the Internet
Information structures
Maria Inês Cordeiro, 11 May 2004
Future developments
Enhance exposure of content on the WWW environment
> Web sites for selected content (e.g, by collection) permanently reachable by search engines> metadata accessible to other systems - Web services > automated mechanisms for MARC metadata conversion into other schema, e.g. XMLMARC, Dublin Core in XML
Enhance search & retrieve of images through automated means,
non textual
- based on algorithms for image analysis and indexing - when a considerable critical mass of images is held- to fully exploit digital resources of a visual nature