Numeric
Statistical assessment of the digitisation of Europe’s
cultural heritage.
19 June 2007
Phillip RamsdaleIPF
Numeric
“Analogue-to-Digital”
The study seeks to capture information about analogue-to-digital conversion of physical materials including the preservation of facsimile images, but this does not include text written directly to the web.
Numeric
Study phasing
2007 2008
2008 2009
May J un J ul Aug Sep Oct Nov Dec J an Feb Mar Apr
May J un J ul Aug Apr
Stage 1 - Development
Stage 2 - Implementation
J an Feb MarSep Oct Nov Dec
Initiation meeting in
Luxembourg
Building networks, desk research, identify data sources, defining sample frame, refine intended methodology
Desk
research
findings
report
Baseline statisitcal analysis, refinement of definitions
Feed-
back /
"Prov ing"
workshop
Refinement based on feedback
Data collection
Data collection, analysis, interpretation, presentation, reporting, consultation, publication Promotion, sustainability actions
Progress
report
Progress
report
Findings
report
D2 D3 D4
D5 D6 D7
Final
report /
hand-
over
frame-
D8
This report /
presentation to
Experts' Group
D1
First data
analysis
&
method-
ology
NOWMar08
Numeric
Cultural heritage
Culture / Creative Industry
x "part of"
NACE 1.1 ISIC 3.1 NAICS 2002
EU UN North
America
Video, film and photography of which: Photography
22.32 x 92.10 92.72 x 74.87 x 74.81 x
2230 x 9211 9212 9249 x 7499 x 7494
334612 x 5121 56131 x 7114 x 54192
Music and the visual and performing arts Sound recording and music publishing Visual and Performing arts (including Festivals)
22.14 22.31 x 92.31 92.32 92.34 x 92.72 x
2213 2230 x 7499 x 9214 9219 x 9249 x
5122 334612 x 7111 7114 x 7115 x 7113 7114 x 56131 x
Radio and TV (Broadcasting) 92.20 92.72 x 9213 9249 x 515 516 x 5175 56131 x
Libraries (includes archives) 92.5 9231 51912
Museums 92.5 x 9232 x 71211
Historic and heritage sites 92.5 x 9232 x 71212
Other heritage institutions 92.5 9233 x 71219
Numeric
Study outputs• Standard definitions, classifications &
indicators.• A tested model for future policy
monitoring.• Estimates of the scale of digitisation in
Europe.• Improved estimates of the analogue base.• A sustainable framework for measuring
the progress of digitisation efforts in Europe.
• Web-site informing of progress and pointing to useful frameworks.
Numeric
Recording in a standard way to take account of differences:
FUNDING of digitisation activitiesTECHNOLOGY employedINSTITUTIONAL OBJECTIVESNATIONAL POLICY
Establish UNIFORM approaches for:• Classification of outputs• Definition of indicators• Data collation by national institutions
This will facilitate Benchmarking
Numeric
What are we measuring?
• What population? i.e. define culture.
• What objects / materials?
• Which technologies?
The technology employed differs between sectors
Numeric
What population?
Film A/V and Archives, Libraries, Museums, Yes…
but… what about “Research libraries”?
There are ambiguities and inconsistencies.
Brief: …to follow as much as possible the definitions of cultural institutions as used by UNESCO and EUROSTAT.
Numeric
What objects?
Collective memory of print: books, journals, newspapers
Photographs
Museum objects
Archival documents
Audio-visual materials such as films
Granularity becomes coarser as the classification is
summarised.
Numeric
Which technologies?
The standards adopted materially influence the quality and cost.
• File formats
• Sampling rates
• Source microfilm, flat paper, etc..
• OCR source fonts
• Metadata creation / storage
Numeric
The Method – put simply
• Infer the overall scale of digitisation activity and expenditure using data collated from a sample of institutions believed to be representative in their specific sector / country.
• Verify the estimates against a “foundation database” of the known analogue universe. i.e. institutions, collections, staff, etc..
Numeric
Mapping: “Analogue to Digital”
DIGITAL MATERIALS
ANALOGUE MATERIALS
Botto
m - U
pTop -
Dow
n
Numeric
Tested method
Could be any, or a mixture of:
• coincidental national surveys of major institutions
• enhanced by sample survey of other institutions
• estimates based on robotically collected data
• estimates based on analogue trends
Numeric
Persistent research
• Make estimates based on “jig-saw” data assembled from desk-research;
• Review project descriptions for data (investment and outputs) in order to reference extrapolated statistics;
• Mount samples surveys to supplement and enhance the data and estimates deriving from desk research;
• Investigate other possibilities such as persistent identifiers;
• Service national survey initiatives by providing tested frameworks.
Numeric
Desk research phase will provide useful pointers
• Past surveys help in identifying the “lowest branches”, with the “biggest fruits”.
i.e. Concentrate on importance and avoid effort on comparatively trivial items.
Numeric
0 20 40 60 80 1000
20
40
60
80
0
20
40
60
80
at
be
bgcy
czde
dk
ee
es
fi
fr
gr
hu
ielt
lu
lv
mt
nl
pl
ptro
se
si
sk
uk
at
be
bg
cy
cz
de
dk
ee
es
fi
fr
gr
hu
ielt
lulv
mt
nl
pl
pt
ro
se
sisk
uk
% Households connected to the Internet
% v
isiti
ng
lib
rary
at l
ea
st o
nce
pe
r ye
ar
% v
isiti
ng
Mu
seu
m /
Ga
llery
at l
ea
st o
nce
pe
r ye
ar
dk
dk
it
it
gr
de
fi
fi
Numeric
Public Libraries
Source:“Status of Technology and Digitization in the Nation’s Museums and Libraries”, US Institute of Museum and Library Services, Jan 2006.
• Funding of digitisation remains uncertain• However, digitisation is not a high priority
In the past year did you have
funding for:Yes No ?
technology? 81% 17% 2%
digitization? 12% 71% 17%
Next year do you plan to have
funding forYes No ?
technology? 75% 9% 17%
digitization? 20% 52% 29%
Numeric
Archives
Source:“Status of Technology and Digitization in the Nation’s Museums and Libraries”, US Institute of Museum and Library Services, Jan 2006.
• Funding of digitisation is more certain• This makes it easier to verify the statistics
In the past year did you have
funding for:Yes No ?
technology? 76% 20% 4%
digitization? 57% 38% 5%
Next year do you plan to have
funding forYes No ?
technology? 67% 13% 20%
digitization? 59% 19% 22%
73% for State Library Administrative
Agencies
Numeric
Materials that are being digitisedInstitutions reporting digitisation of the following in the past year (IMLS)5 highest ranked materials
Public Libraries
Archives State Library Admin
Agencies
Photographs 4.8% 17.5% 2.7%
Correspondence, diaries, etc. 2.4% 6.5% 12.8%
Historical documents/archives 3.3% 11.6% 5.1%
Maps 1.9% 6.6% 8.1%
Government publications 0.0% 1.1% 15.4%
Information on the institution 4.8% 5.4% 5.3%
Films, videotapes 1.0% 6.5% 7.9%
Other items 0.0% 5.0% 10.0%
Manuscripts 1.0% 7.4% 2.6%
Images of items in the collections 1.5% 6.5% 0.0%
Numeric
Coarse Benchmarks
Source:IFLA/UNESCOSurvey on Digitisationand Preservation1999
Maximum Minimum Average
Per page US$15 US$0.12 US$7.72
Per book US$154 US$28 US$70.66
Per serial issue US$14 US$14 US$14
Numeric
Specific Benchmarks
Cost per Hour
Multi-plier
Good Difficult Unplay-able
Total
Euros Number Euros Euros Euros EurosTrial 20 50.00 5.00 2400.00 3000.00 0.00 5400.00Total 2400.00 3000.00 0.00 5400.00
Raw Media
Mass Storage
Secure Backup
Browse Private
Browse Public
Total
Type Euros Euros Euros Euros Euros EurosTrial 20 Film 340.23 2721.82 850.57 1.70 17.01 3931.33Total 340.23 2721.82 850.57 1.70 17.01 3931.33
Carrier Type Transfer Cost
Storage Cost
Total
Name Name Euros Euros Euros PrestoSpace toolTrial 20 Film 5400.00 3931.33 9331.33Total 5400.00 3931.33 9331.33
Cost of Storage for Carriers
Cost per carrier media condition to transfer
Combined Cost for Carriers
Numeric
Other Sources
• Report on Digital Material in European National Archives;
• EDL project survey of digitisation in CENL National Libraries;
• Status reports;• Information from projects like
EDLNet, MINERVA and MICHAEL;• Other projects / studies / surveys
referred by experts.
Numeric
Robotics – Investigate possibilities
Consult• The European Digital Library –
“persistent identifiers”• Other projects e.g. London
Metropolitan Archive• The European (Internet) Archive• Multimatch (ISTI-CNR)
Could help harvest data in future
Numeric
An example where tags for digitised content may be required
• The collections span 1067 to 2006 and fill 101km of shelving over two sites.
• 10,000+ parish registers; c.80,000 wills; 9,000+ records for named individuals; c.7,000 poll and electoral registers; c.2,000 admission and discharge registers for London schools
• Chargeable online access via a web-site.• Part revenues reinvested in the care and
preservation of the collections.• This new service would sit alongside existing on-
site services which the City will continue to provide.
Numeric
First priorities – the next 5 months
• Desk research / frameworks in-practice• Clarify classifications / definitions• Build / verify analogue database• Review robotic opportunities• Promote and involve:
– Website– Newsletter– Presentations
Numeric
Role of this Group in the study
• Advocate the study objectives to colleagues in own sector / country.
• Apply study outputs as required in own country.
• The Commission may follow-up this meeting with a specific request for contact details of suitable experts in each country to help the study source further information.
Numeric
Thank you
www.Numeric.ws
• [email protected]+44 208 667 8508
• [email protected]+352 4301 35786
Top Related