Finding, Linking and Organizing Resources with Linked Data & Natural Language Processing Paul...

Post on 02-Jan-2016

219 views 1 download

Tags:

Transcript of Finding, Linking and Organizing Resources with Linked Data & Natural Language Processing Paul...

Finding, Linking and Organizing Resources with Linked Data & Natural Language

Processing

Paul Buitelaar

Unit for Natural Language ProcessingDigital Enterprise Research Institute - National University of Ireland, Galway

Copyright 2010 Digital Enterprise Research Institute. All rights reserved, Paul Buitelaar

What does that mean?What is a (re)source? What is a link? What

resources can we link – and how? How to find and organize resources and links?

Let’s go through an example…

Finding, Linking & Organizing Resources

Linking van Gogh (resources)

Linking van Gogh (links)

Linking van Gogh (objects)

March 30th, 1853

personObj-1personObj-1

Zundert

July 29th, 1890

Auvers-sur-Oise

Vincent van Gogh

personRepresentationObj-1personRepresentationObj-1

personBirthObj-1personBirthObj-1locationObj-1locationObj-1

locationObj-2locationObj-2

personDeathObj-1personDeathObj-1personObj-2personObj-2

personRepresentationObj-2personRepresentationObj-2

Theo van Gogh

Finding Resources (and Links)

Structured Data (Proprietary databases, thesauri etc.) Open-domain databases, thesauri, etc. … … increasingly turned into ‘Linked Open Data’

Unstructured Data (Proprietary textual descriptions, images, videos etc.) Open-domain textual descriptions, images, videos etc. … … to be turned into & connected with ‘Linked Open

Data’

Finding Links in Text

Linking van Gogh - continued

personObj-1personObj-1

artistObj-1artistObj-1

bold brushstrokes

Diego Velazquez

personRepresentationObj-2personRepresentationObj-2

artistObj-2artistObj-2

artistTechniqueObj-1artistTechniqueObj-1

Vincent van Gogh

personRepresentationObj-1personRepresentationObj-1

The Remainder of this Talk

Linked Open Data (LOD) Some LOD applications & tools LOD and Natural Language Processing

Linked Open Data

Turning web of documents into a Web of Data Uniquely identifying web objects (documents,

images, named-entities, facts, …) Enabling the discovery & interlinking of web

objects through semantic metadata Open access to data

Linked Open Data ‘cloud’

Linked Open Media Data

LOD Applications

Search Engine for the Web of Data SIGMA http://sig.ma (builds on http://sindice.com/)

Contact: Giovanni Tummarello, DERI

Music Recommendation http://dbrec.net

Contact: Alexandre Passant, DERI

Research Collaboration Support, Expert Finding http://saffron.deri.ie/

Contact: Paul Buitelaar, DERI

Search the Web of Data with SIGMA

More Data – but also more issues…

dbrec : the Web of Data recommends…

Mary Black is related to Frances Black …

… and this is why

Saffron : Expert Finding

Expertise Topic Extraction

Publication Browsing

Expert Browsing

Publication Details: Abstract/PDF

Publication Details: Authors/Topics

Expertise Topic Details

Personalization

Personalized Expert Recommendation

Linking Objects in Saffron

AuthorAuthor

Document Document

TitleTitle

PDFPDF

TopicTopic

TopicTopic

AffiliationAffiliationResearcherResearcher

PicturePicture

ResearcherResearcher

ExpertiseTopicExpertiseTopic

Other LOD Application Areas

Linked Open Drug Data (Matthias Samwald, DERI) http://esw.w3.org/HCLSIG/LODD - W3C WG includes

participation by Johnson & Johnson, AstraZeneca

http://esw.w3.org/HCLSIG/LODD/Data

Open Government Data (Richard Cyganiak, DERI) http://linkeddata.deri.ie/node/72 - includes data sets

from USA, UK, Australia, Canada, Sweden, New Zealand

Library Linked Data (Jodi Schneider, DERI) http://www.w3.org/2005/Incubator/lld/

Financial Linked Data (Sean O’Riain, DERI) Linking Enterprise Data for Business Intelligence

Linked with extracted Financial Facts (amounts) – annotated with semantic metadata (financial meaning) according to eXtensible Business Reporting Language (XBRL)

http://www.monnet-project.eu/

Financial Linked Data

Some LOD Tools

‘RDB2RDF’ - mapping relational DBs to RDF http://www.w3.org/2001/sw/rdb2rdf/ (incl. Survey Report)

‘Silk’ (Freie Universitaet Berlin) - specify links to use in discovering relationships between LOD data items http://www4.wiwiss.fu-berlin.de/bizer/silk/

Semantic Drupal, ‘sparqlviews’ (Lin Clark, DERI) - easy integration of Linked Data in CMS Drupal http://semantic-drupal.com/

http://drupal.org/project/sparql_views

EU Projects http://latc-project.eu/ http://lod2.eu/

Open LOD Issues

How to integrate new LOD into the LOD cloud – with addition of information rather than duplication? Entity consolidation

dbpedia:JohnSmith owl:sameAs bbcmusic:JohnSmith

Vocabulary alignment

geonames:location owl:sameAs dbpedia:place

How to identify the most fitting LOD resources for a particular application/domain? Estimate application/domain semantics

Match application/domain semantics with LOD semantics

Linked Open Data

LOD and Natural Language Processing

Domain/Application Semantics

Linked Open Data forDomain/Application

Domain Corpus

YZ

X

YZ

X

LOD vocabularies

YZ

X

Y1Z1

LOD instances from domain corpus

Acknowledgements & Further Info DERI colleagues on all things ‘linked open data’, for more info

http://linkeddata.deri.ie/

The Saffron team (in alphabetical order)

Georgeta Bordea, Fergal Monaghan, Krystian Samp

http://saffron.deri.ie/

Grant support

Science Foundation Ireland Grant No. SFI/08/CE/I1380 for Lion-2 http://nlp.deri.ie/

EU FP7 Grant No. 248458 for the Monnet project on Multilingual Ontologies for Networked Knowledge http://www.monnet-project.eu