Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State and University Library

Post on 29-Aug-2014

1.512 views 2 download

Tags:

description

 

Transcript of Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State and University Library

Controlled Vocabularies and Text Mining –

Use Cases at the Goettingen State and University Library

Ralf StockmannTextGrid Workshops – July 13th 2011

Textmining

Enhanced Context-Search

Multilingual Access

DBPedia, ...

Visualisation

Metadata

OCR/Fulltext

Named Entity Recognition

Catalog Data

Crowd- sourcing

Annotation Tools

Relationship Graphs

Linked Open Data

Ontologies

Scholars

Libraries

Reposi-tories

Textmining

Enhanced Context-Search

Multilingual Access

DBPedia, ...

Visualisation

Metadata

OCR/Fulltext

Named Entity Recognition

Catalog Data

Crowd- sourcing

Annotation Tools

Relationship Graphs

Linked Open Data

Ontologies

Scholars

Libraries

Reposi-tories

Use case #1:

eAqua

Projekt: eAqua

• Partners:– Institut of Computer Science - Computerlinguistic,

Leipzig (Büchler, Eckart, Heyer, Baumgardt)– SUB Göttingen (Stockmann, Kothe, Mahnke)

• Comparing semantic graphs between– Headings of journal articles and– Fulltext of the same articles

Search Term „socialism“ on title elements

„Mephisto“ on fulltext

Textmining

Enhanced Context-Search

Multilingual Access

DBPedia, ...

Visualisation

Metadata

OCR/Fulltext

Named Entity Recognition

Catalog Data

Crowd- sourcing

Annotation Tools

Relationship Graphs

Linked Open Data

Ontologies

Scholars

Libraries

Reposi-tories

Use case #2:

Europeana 4D visualisation

Partner:

Concept

MAP

Concept

MAP TIMELINE

Concept

MAP TIMELINE

• Multiple data layers• Interaction• Animation• Aggregation of data• Connections• Drilldown• Historical/custom

maps• Result table• Splitting Datasets• ...

Refinement

Technological Framework

• OpenLayers• Simile Timeline/Timeplot• GeoNames (Geoparser...)• Explorer Canvas (Google)• GeoServer (OpenStreetmap, Google Maps)• Google Web Toolkit (GWT)• KML (XML)

Data Model

WHAT?

NAME

description

url

MANDATORY

optional

Data Model

WHAT?

WHERE?

NAME

description

url

COORDINATES

address

MANDATORY

optional

KML

Data Model

WHAT?

WHERE? WHEN?

NAME

description

url

COORDINATES

address

TIMESTAMP

range

MANDATORY

optional

Exchange Format: KML (XML)

Questonnaire

Questonnaire

Questonnaire

Questonnaire

Questonnaire

Datasets

• Library catalog• Flickr• IMDB• DBpedia• WikiLeaks

Flickr: „tsunami“

Use your own data in 5 easy steps!

1. Take a look at the .kml specificationhttp://tinyurl.com/e4d-kml

2. Build your own KML dataset3. Upload it to a webserver4. Put the URL into the prototype at http://tinyurl.com/e4d-

demo25. Share your set via the magnetic link!

Ressources

• e4D info website: http://tinyurl.com/e4d-project

• Europeana thoughtLab: http://www.europeana.eu/portal/thoughtlab.html