Post on 11-May-2015
description
Europeana Newspapers WP 4
Aggregation & Indexing Plan
Markus Muhr
2
Agenda
● Customer Relationship Management● Aggregation Workflow - Metadata
• Aggregation Workflow - Full-text and Images
• Newspaper Content Browser Options
• Viewing Images
• Delivery to Europeana / Zeitschriftendatenbank
• Aggregation and Indexing Plan
• Questions
3
Customer Relationship Management
• SugarCRM
• Management of all administrative information• Organisations, contacts, datasets, projects, etc.
• Important features for project handling• Newspaper collections• Cases per specific collection• Aggregation and Indexing Plan• Automatic reporting
4
Customer Relationship Management
5
Customer Relationship Management
6
Customer Relationship Management
7
Aggregation Workflow – Metadata
● Scheduling of ingestion● Datasets ready for harvesting● Create case in CRM: case # to provider● Harvesting metadata (OAI-PMH, FTP, ...)● Enhance metadata (VIAF, Geonames, MACS,...)● Indexing in acceptance portal ● E-mail to provider to accept dataset● Live index = live portal● Delivery to Europeana● Enhancing and publishing in Europeana
8
Aggregation Workflow – Metadata
9
Aggregation Workflow - Full-text and Images
● Hard-disk delivery by UIBK/CSS● Hard-disk delivery to ULCC● Ingestion and alignment of fulltext and images with
harvested metadata● JPEG 2000 generation for hosted IIP image server● Enrichment with named entities from KB● Indexing into content browser● Adaptations of image viewer for external image servers
• E-mail to partner
10
Aggregation Workflow - Full-text and Images
11
Aggregation Workflow - Full-text and Images
12
Aggregation Workflow - Full-text and Images
13
Newspaper Content Browser Options
• Questionnaire to content providers determined how the content would appear in newspaper content browser
• Option 1 - Images and full-text• Option 2 - Snippets of images and full-text• Option 3 - Full-text only• Option 4 - Metadata only• Option 5 - Option 1 via external image server• Option 6 - Option 2 via external image server
14
Viewing Images
● The European Library hosts images for Option 1 and 2 ● IIP Image Server with JPEG 2000● Viewing images transformed into JPEG 2000● Ingestion workflow includes transformation step for tifs and
jpgs● Time-demanding operation● Image viewer is IIPMooViewer● Open source projects ● Europeana Regia
http://www.theeuropeanlibrary.org/tel4/virtual/regia
15
Viewing Images
● External image servers for Option 5 and 6 ● Current support of external viewers via iframe
● Alignment and highlighting not available● Improved usage of content browser via integrated image
viewer● Adaptations for each different kind of image server● Time-demanding task● Existing viewer that can be easily embedded in the
Newspaper Content Browser are preferable● Technical support at partner libraries is necessary
16
Delivery to Europeana / Zeitschriftendatenbank
● Metadata from Full and Associate Partners should go into Newspapers content browser, Europeana portal and Zeitschriftendatenbank / Union Catalogue of Serials
● EDM to Europeana● Duplin Core to Zeitschriftendatenbank
● Europeana Data Model delivery should be finalised soon
17
Europeana Data Model
18
Dublin Core
19
Aggregation and Indexing Plan
● Plan includes aggregation of partners and 11 associated partners
● Q3 first quarter with indexing work● Aggregation and indexing is aligned with deliveries from
UIBK/CCS● Deliveries to Europeana & Zeitschriftendatenbank from Q4
onwards● Aggregation and indexing is split over multiple quarters for
some partners
20
Aggregation and Indexing Plan – Q3 2013
● Österreichische Nationalbibliothek / Austrian National Library – Option 5● Currently working on first batch of 1.090k full-text pages
● Kansalliskirjasto / National Library of Finland – Option 1 (new)● Currently working on first batch of 132k full-text pages
and images
21
Aggregation and Indexing Plan – Q4 2013
● Landesbibliothek Dr. Friedrich Teßmann / Teßmann Library – Option 2● 857k full-text pages and thumbnail images
● Österreichische Nationalbibliothek / Austrian National Library – Option 5 and 4● Remaining batches of 1.090k full-text pages● Metadata for 5.691k pages
22
Aggregation and Indexing Plan – Q4 2014
● Bibliotheque Nationale de France / National Library France – Option 5● First batch of 2.388k full-text pages
● Latvijas Nacionala Biblitoteka / National Library of Latvia – Option 1● 450k full-text pages and images
23
Aggregation and Indexing Plan – Q4 2013
● Landsbókasafn Íslands - Háskólabókasafn / National and Univeristy Library of Iceland – Associated Partner ● Metadata for 4.112k pages
● National Library of Spain – Associated Partner● Metadata for 5.831k pages
● Bibliothèque nationale de Luxembourg / National Library of Luxembourg – Associated Partner● Metadata for 620k pages
24
Aggregation and Indexing Plan – Q1 2014
● Bibliotheque Nationale de France / National Library France – Option 5● Next batch of 2.388k full-text pages
● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● First batch of 594k full-text pages and images
● Milli Kutuphane Baskanligi / National Library of Turkey – Option 4● Metadata for 9k pages
25
Aggregation and Indexing Plan – Q1 2014
● Staatsbibliothek zu Berlin / Berlin State Library – Option 1● First batch of 248k full-text pages and images
● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● First batch of 1707k full-text pages and images
● Univerzitet u Beogradu / University Library of Belgrade – Option 1● First batch of 408k full-text pages and images
26
Aggregation and Indexing Plan – Q1 2014
● National Library of Wales – Associated Partner ● Metadata for 1.100k pages
● National Library and University Library in Zagreb – Associated Partner● Metadata for 300k pages
27
Aggregation and Indexing Plan – Q1 2014
● St. Cyril and Methodius National Library / The National Library of Bulgaria – Associated Partner● Metadata for 12k pages
● National Library of Czech Republic – Associated Partner● Metadata for 5.760k pages
28
Aggregation and Indexing Plan – Q2 2014
● Bibliotheque Nationale de France / National Library France – Option 5● Next batch of 2.388k full-text pages
● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● Next batch of 594k full-text pages and images
29
Aggregation and Indexing Plan – Q2 2014
● Biblioteka Narodowa / National Library of Poland – Option 2● 83k full-text pages and images
● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● Next batch of 1707k full-text pages and images
● Koninklijke Bibliotheek / National Library of the Netherlands – Option 5● 1.900k full-text pages
30
Aggregation and Indexing Plan – Q2 2014
● Narodna in univerzitetna knjižnica / National and University Library of Slovenia – Associated Partner● Metadata for ?k pages
● National Library of Portugal – Associated Partner● Metadata for 400k pages
● National Library of Romania – Associated Partner● Metadata for 442k pages
31
Aggregation and Indexing Plan – Q3 2014
● Bibliotheque Nationale de France / National Library France – Option 5● Next batch of 2.388k full-text pages
● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● Next batch of 594k full-text pages and images
32
Aggregation and Indexing Plan – Q3 2014
● Staatsbibliothek zu Berlin / Berlin State Library – Option 1● Next batch of 248k full-text pages and images
● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● Next batch of 1707k full-text pages and images
33
Aggregation and Indexing Plan – Q4 2014
● Bibliotheque Nationale de France / National Library France – Option 5● Final batch of 2.388k full-text pages
● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● Final batch of 594k full-text pages and images
34
Aggregation and Indexing Plan – Q4 2014
● Staatsbibliothek zu Berlin / Berlin State Library – Option 1● Final batch of 248k full-text pages and images
● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● Final batch of 1707k full-text pages and images
● Kansalliskirjasto / National Library of Finland – Option 1● Final batch of 132k full-text pages and images
35
Operations Officers
Anastasia Gasia
Junior Operations Officer
anastasia.gasia@kb.nl
Chiara Latronico
Operations Officer
chiara.latronico@kb.nl
Operations Mailbox: collections@theeuropeanlibrary.org
Thank you for your attention!
Markus Muhr (markus.muhr@kb.nl)
www.europeana-newspapers.eu