CartoHeritage 2011: Georeferencer & MapRank Search

46
International Cartographic Association: 6th International Workshop on Digital Approaches in Cartographic Heritage The Hague, Netherlands, 7-8 April 2011 Petr Přidal [email protected] GEOREFERENCER: COLLABORATIVE ONLINE GEOREFERENCING TOOL FOR SCANNED MAPS Moravian Library Brno: OldMapsOnline.org & TEMAP R&D grant of the Czech Ministry of Culture (DC08P02OUK006)

description

Presentation from the workshop: Digital Approaches in Cartographic Heritage 2011: 6th International Workshop, The Hague, 7-8 April 2011.http://xeee.web.auth.gr/ICA-Heritage/Commission/6th_Workshop/TheHague/Presentations:P. Pridal, Georeferencer: Collaborative online georeferencing tool for scanned maps.P. Pridal, MapRank Search: Intuitive geographical searching in map collections and metadata catalogs.

Transcript of CartoHeritage 2011: Georeferencer & MapRank Search

Page 1: CartoHeritage 2011: Georeferencer & MapRank Search

International Cartographic Association: 6th International Workshop on Digital Approaches in Cartographic HeritageThe Hague, Netherlands, 7-8 April 2011

Petr Př[email protected]

GEOREFERENCER:COLLABORATIVE ONLINE GEOREFERENCING TOOL FOR

SCANNED MAPS

Moravian Library Brno: OldMapsOnline.org & TEMAPR&D grant of the Czech Ministry of Culture (DC08P02OUK006)

Page 2: CartoHeritage 2011: Georeferencer & MapRank Search

Large collections of old maps are scanned and published online in digital repositories. But ...

... once they are georeferenced, they are much more attractive and useful: visualizations, comparing, location context, annotations, geographical search...

INTRODUCTION

MAPS ARE NOT ONLY IMAGES!

Page 3: CartoHeritage 2011: Georeferencer & MapRank Search

• Hand made georeferencing is time consuming process

• The scanned collections are two large (thousands of maps to handle)

• There are more technical approaches how to georeference old maps, not always clear workflow especially for pre-1850 - you need to keep this “how”

• Disk space: georeferenced copy next to the original scan

• GIS software and knowledge necessary

• Workers need to have access to large files (JPEG/TIFF/...)

PROBLEMS

Page 4: CartoHeritage 2011: Georeferencer & MapRank Search

GEOREFERENCER AS A SOLUTION

• User-friendly online tool: runs in a web browser

• Simply visit a website, log-in and you can start

• Easy and enjoyable process

• Open for online visitors, organized volunteers, crowdsourcing

• No GIS software necessary, no local files to handle

• GIS-ready outputs possible - you can easily create GeoTIFF/GeoJP2 if you have access to the high-resolution images

• Service for EU libraries, archives & citizens

• Developed in a library - for libraries and other culture heritage institutions

GEOREFERENCER: TURN AN IMAGE INTO A MAP.

Page 5: CartoHeritage 2011: Georeferencer & MapRank Search
Page 6: CartoHeritage 2011: Georeferencer & MapRank Search
Page 7: CartoHeritage 2011: Georeferencer & MapRank Search
Page 8: CartoHeritage 2011: Georeferencer & MapRank Search

• Enrichment of metadata (MARC, DC, ISO 19139)

• Engaging visitors

• Attractive 3D visualization (Google Earth)

• GIS services (WMS, WMTS/TMS, ...)

• Statistics & crediting

MOTIVATION FOR GEOREFERENCER

INSTITUTIONS: INDIVIDUALS:

• Georeference any online image on the Internet

• Combine images from different institutions

• Create your own virtual collection

• Conduct research online, compare, analyze, share, ...

Page 9: CartoHeritage 2011: Georeferencer & MapRank Search

INSTITUTIONS: PILOT APPLICATIONS

• Moravian Library, Czech Republic (students)

• Nationaal Archief, Netherlands (organized volunteers)

• National Library of Scotland (online visitors, anybody)

TRY THE GEOREFERENCER PILOT:HTTP://NLS.GEOREFERENCER.COM/

OR THE PUBLIC SERVICE FOR INDIVIDUALS:HTTP://WWW.GEOREFERENCER.ORG/

Page 10: CartoHeritage 2011: Georeferencer & MapRank Search

http://nls.georeferencer.com/

Page 11: CartoHeritage 2011: Georeferencer & MapRank Search

GEOREFERENCER PILOTSPilot: Nationaal Archief. 13 volunteers. Great feedback!

Page 12: CartoHeritage 2011: Georeferencer & MapRank Search

GEOREFERENCER PILOTSResult: 650 maps georeferenced in less then one month!

Metadata enrichment.

Page 13: CartoHeritage 2011: Georeferencer & MapRank Search

GEOREFERENCER PILOTS

Crowdsourcing: Retired volunteers. Excellent local knowledge. Competition & a public winner.

Page 14: CartoHeritage 2011: Georeferencer & MapRank Search

GEOREFERENCER PILOTS

First public pilot: the National Library of Scotland.

Page 15: CartoHeritage 2011: Georeferencer & MapRank Search

GEOREFERENCER PILOTSAnybody can participate. It is easy & fun.

http://geo.nls.uk/maps/georeferencer/

Page 16: CartoHeritage 2011: Georeferencer & MapRank Search

• Unique scalable architecture. One centralized service for all libraries and maps. Upgrades and new functionality deployed automatically.

• Runs “in the cloud” - Google AppEngine / Amazon.

• Server side based on open-source GIS stack (Proj4, GDAL, MapSever). Closure Library on the client side.

ADVANCED TECHNOLOGY

MINIMAL EFFORT TO PARTICIPATE.IMAGES ARE USED DIRECTLY FROM EXISTING ONLINE

REPOSITORIES!

Page 17: CartoHeritage 2011: Georeferencer & MapRank Search

NEXT STEPS?

Page 18: CartoHeritage 2011: Georeferencer & MapRank Search

GEOREFERENCER: NEW DESIGN

Page 19: CartoHeritage 2011: Georeferencer & MapRank Search

SPATIAL REFERENCE SYSTEMS

Coordinate systems with map projection, for example OSGB or S-JTSK and in general any reference system

from EPSG database usable in Georeferencer

Page 20: CartoHeritage 2011: Georeferencer & MapRank Search

IMPROVED BASE MAPSOut-of-copyright base maps, agreements with local

authorities for using official mapping data (WMS).Fast and visually appealing version of OpenStreetMap as

an alternative - probably from MapQuest. OpenAerialMap.

Page 21: CartoHeritage 2011: Georeferencer & MapRank Search

HIGH QUALITY GEOCODING

Fast and reliable search in gazeteer database based on OpenStreetMap or GeoNames.org data.

Online service, without license restrictions of Google GeoCoding API - usable both in GeoReferencer a

GeoParser services.

Page 22: CartoHeritage 2011: Georeferencer & MapRank Search

API FOR DATA INTERCHANGEAND 3RD PARTY FUNCTIONALITY

REST API based on JSON for exchange of data between Georeferencer and third-party applications.

JavaScript API for easy integration into library websites.

Page 23: CartoHeritage 2011: Georeferencer & MapRank Search

COLLABORATIVE EDITING

Technical solution of the problem of concurent team editing of map georeference by serveral people in the

same moment - locking and communication a la online chat in a web browser (via HTTP Comet)

Page 24: CartoHeritage 2011: Georeferencer & MapRank Search

Social aspects:Motivation. Team. Competition. Guiding. Rewarding.

CROWDSOURCING

Page 25: CartoHeritage 2011: Georeferencer & MapRank Search

Research and development of algorithms and

application for raster image analysis.

Target: detection of scans of the same map (via

“image hash”) and following automatical assignment of ground

control points.

DUPLICATES + AUTOGEOREFERENCING

Page 26: CartoHeritage 2011: Georeferencer & MapRank Search

Automatic detection of map sheet borders based on the image analysis algorithms.

NEATLINE DETECTION

Page 27: CartoHeritage 2011: Georeferencer & MapRank Search

Database of Czech map series with GIS-ready map indexes.

Integration with Georeferencer: georeferencing with the help of existing map

indexes.

Authorities for cataloguing of map series.

Maybe a new project:OpenMapIndexes.org

GEOREFERENCING OF MAP SERIES

Page 28: CartoHeritage 2011: Georeferencer & MapRank Search

WEBGL EARTH

Google Earth-like visualization of georeferenced maps in a web browser or on mobile devices.

100% open-source, no license restrictions, runs without a need to install any extra software. Under development.

http://www.webglearth.com/

Page 29: CartoHeritage 2011: Georeferencer & MapRank Search

http://www.webglearth.com/

Page 30: CartoHeritage 2011: Georeferencer & MapRank Search

HTTP://NLS.GEOREFERENCER.COM/HTTP://WWW.GEOREFERENCER.ORG/

[email protected]

• Institutions interested in participation

• Share expectations - ideas for improvement?

• Sources of support on international level: development of a service for EU institutions and citizens?

• We are seeking research partners: image processing, ...

COOPERATE. CONTACT US.

Page 31: CartoHeritage 2011: Georeferencer & MapRank Search

International Cartographic Association: 6th International Workshop on Digital Approaches in Cartographic HeritageThe Hague, Netherlands, 7-8 April 2011

Petr Př[email protected]

INTUITIVE GEOGRAPHICAL SEARCHING IN MAP COLLECTIONS AND METADATA CATALOGS

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Page 32: CartoHeritage 2011: Georeferencer & MapRank Search

• Research, scientific articles, work on Ph.D. thesis (Petr Přidal), Computer Science + Geodesy and Cartography

• Alternative projects tested in OldMapsOnline.org (R&D project in MZK):

• Lucene with GeoTemporal Ext. (LGTE) from DigMap.eu

• Alexandria Digital Library

• TimeMap.net (ECAI.org)

• Google GeoSearch API

• Knowledge of traditional GIS search methods (PostGIS, ...)

GEOGRAPHICAL SEARCHING

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Result: Development of a new geographical search engine for map collections.

Page 33: CartoHeritage 2011: Georeferencer & MapRank Search

Explore library catalog by browsing a map.

Page 34: CartoHeritage 2011: Georeferencer & MapRank Search

• Map-driven interface: Simple. Intuitive. Web 2.0. Users love it!

• Combined queries: Where, When, Who & What

• Results sorted geographically with an advanced spatial ranking algorithm.

• Instant search. Immediate delivery of the results by moving the map or typing.

• It is fast. Even for large databases (million+ records).

CORE FEATURES

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Page 35: CartoHeritage 2011: Georeferencer & MapRank Search

• Indexing metadata in MARC, DublinCore, ISO 19139 formats, ...

• Batch import or harvesting: OAI-PMH or OGC CWS

• Numerical geographical boundaries necessary! (MARC 034, DublinCore DCMI) - enrichment with Geoparser or Georeferencer tools

• Possible customization of the user interface, integration into products or websites

• For developers: REST API (JSON format), queries following OpenSearch.org

METADATA & CUSTOMIZATION

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Page 36: CartoHeritage 2011: Georeferencer & MapRank Search

Try it at: http://demo.mapranksearch.com/

Page 37: CartoHeritage 2011: Georeferencer & MapRank Search

Try it at: http://rumsey.mapranksearch.com/

Page 38: CartoHeritage 2011: Georeferencer & MapRank Search

Try it at: http://mapy.mzk.cz/

Page 39: CartoHeritage 2011: Georeferencer & MapRank Search

TOUCH INTERFACE

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Page 40: CartoHeritage 2011: Georeferencer & MapRank Search

• MapRank ranking: an advanced algorithm calculating similarity of geographical area of query and area covered by every document and combining it with map scale, time range, predefined categories and traditional full text search.

• Data indexing implemented specifically to provide ranked real-time results even for large databases. Traditional R-Tree indexing or Lucene are failing.

• Server component: FastCGI, C++

• Default user interface developed with Closure JavaScript Library + Closure Compiler.

MAPRANK RANKING & SCALABILITY

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Page 41: CartoHeritage 2011: Georeferencer & MapRank Search

Geoparser

Text parsing and semi-automatic assignment of bounding boxes to map records with detectable geonames (gazetteer), known physical size and map scale.

Georeferencer

Crowdsourced georeferencing. Higher accuracy, more outputs beside geometadata such as visualization, 3D view, comparing of maps, OGC WMS for use in GIS.

ENRICHMENT OF METADATA

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Page 42: CartoHeritage 2011: Georeferencer & MapRank Search
Page 43: CartoHeritage 2011: Georeferencer & MapRank Search
Page 44: CartoHeritage 2011: Georeferencer & MapRank Search

NEXT STEPS?

Page 45: CartoHeritage 2011: Georeferencer & MapRank Search

IMPROVEMENT OF THE SYSTEM

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

• The ranking: new requirements in Switzerland, union catalog. Boosting some maps, etc.

• Improvements related to map series - visualization of map indexes in the search interface

• Faceted search - refine the query: suggested authors

• Advanced search options: alternative sorting, ...

IDEA: FEDERATED MAP SEARCH SYSTEM

Page 46: CartoHeritage 2011: Georeferencer & MapRank Search

CONTACT & LINKS

KLOKAN TECHNOLOGIES - Museo Sans 500

KLOKAN TECHNOLOGIES - Museo Sans 500 ItalicKLOKAN TECHNOLOGIES - Museo Sans 700

KLOKAN TECHNOLOGIES - Museo 300

KLOKAN TECHNOLOGIES - Museo 500KLOKAN TECHNOLOGIES - Museo 700

CMYK100 75 10 0

RGB0 81 151

#005096

Try the applications:

http://www.kartenportal.ch/

http://rumsey.mapranksearch.com/

http://mapy.mzk.cz/

http://www.mapranksearch.com/

[email protected]

More details and enquiries: