Linked Data: opportunities and challenges

32
Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.i e Enabling Networked Knowledge Linked Data: opportunities and challenges Dr. Michael Hausenblas, DERI, NUI Galway Open Science Data Cloud NSF PIRE Workshop, Edinburgh, UK, 18 July 2012

description

Presentation about opportunities and challenges concerning Linked Data at the Open Science Data Cloud NSF PIRE Workshop [1] on 18 July 2012 in Edinburgh, UK. [1] http://www.opensciencedatacloud.org/osdc-edinburgh-workshop-71612-71712/

Transcript of Linked Data: opportunities and challenges

Page 1: Linked Data: opportunities and challenges

Copyright 2011 Digital Enterprise Research Institute. All rights reserved.

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Data: opportunities and

challengesDr. Michael Hausenblas, DERI, NUI Galway

Open Science Data Cloud NSF PIRE Workshop, Edinburgh, UK, 18 July 2012

Page 2: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Data 101

Page 3: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Data principles

① Use URIs to identify the “things” in your data

② Use HTTP URIs so people & machines can look them up

③ When a URI is looked up return a description of the thing in a structured format (RDF)

④ Link to related things to provide context

http://www.w3.org/DesignIssues/LinkedData.html

Page 4: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://sw-app.org/mic.xhtml#i

Page 5: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://dbpedia.org/resource/Edinburgh

curl -L -H "Accept: application/rdf+xml" http://dbpedia.org/resource/Edinburgh

<?xml version="1.0" encoding="utf-8" ?><rdf:RDF

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"xmlns:owl="http://www.w3.org/2002/07/owl#"xmlns:foaf="http://xmlns.com/foaf/0.1/"xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"xmlns:dbpedia-owl="http://dbpedia.org/ontology/"xmlns:dcterms="http://purl.org/dc/terms/"xmlns:dbpprop="http://dbpedia.org/property/"

xmlns:ns10="http://dbpedia.org/property/start/" > <rdf:Description rdf:about="http://dbpedia.org/resource/Firrhill_High_School"> <dbpedia-owl:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description> <rdf:Description rdf:about="http://dbpedia.org/resource/Murrayfield_Stadium"> <dbpedia-owl:location rdf:resource="http://dbpedia.org/resource/Edinburgh" /> <dbpprop:location rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description> <rdf:Description rdf:about="http://dbpedia.org/resource/Stewart%27s_Melville_College"> <dbpedia-owl:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> <dbpprop:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description>

Page 6: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://data.ordnancesurvey.co.uk/id/7000000000017765

curl -L -H "Accept: text/turtle" http://data.ordnancesurvey.co.uk/id/7000000000017765

<http://data.ordnancesurvey.co.uk/doc/7000000000017765> rdf:type foaf:Document, dctype:Text ; foaf:primaryTopic <http://data.ordnancesurvey.co.uk/id/7000000000017765> ; dct:title "Linked Data for The County of Hampshire" ; dct:hasFormat <http://data.ordnancesurvey.co.uk/doc/7000000000017765.rdf> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.html> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.json> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.ttl> .

<http://data.ordnancesurvey.co.uk/id/7000000000017636> rdfs:label "Tadley" ; skos:prefLabel "Tadley" .

<http://data.ordnancesurvey.co.uk/id/7000000000017510> rdfs:label "Newton Valence" ; skos:prefLabel "Newton Valence" .

<http://data.ordnancesurvey.co.uk/id/7000000000017817> rdfs:label "Ashmansworth" ; skos:prefLabel "Ashmansworth" .

Page 7: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://bio2rdf.org/genbank:AC008393

curl -L -H "Accept: text/turtle” http://bio2rdf.org/genbank:AC008393

@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix owl: <http://www.w3.org/2002/07/owl#> .@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix dc: http://purl.org/dc/elements/1.1/ .

<http://bio2rdf.org/genbank:AC008393> a <http://bio2rdf.org/genbank_resource:Sequence> ;rdfs:label "Homo sapiens chromosome 5 clone CTC-241N9, complete sequence

[genbank:AC008393]" ;owl:sameAs <http://bio2rdf.org/genbank:ac008393> ;dc:title "Homo sapiens chromosome 5 clone CTC-241N9, complete sequence" ;dc:modified "26-FEB-2002" ;<http://bio2rdf.org/bio2rdf_resource:length> "166847" ;<http://bio2rdf.org/bio2rdf_resource:linkedToFrom>

<http://bio2rdf.org/iproclass:A8MPR0> , <http://bio2rdf.org/iproclass:B5MCC3> .

Page 8: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://bnb.data.bl.uk/doc/resource/009468944

curl -L -H "Accept: text/turtle” http://bnb.data.bl.uk/doc/resource/009468944

@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix dct: <http://purl.org/dc/terms/> .@prefix blterms: <http://www.bl.uk/schemas/bibliographic/blterms#> .@prefix elements: <http://iflastandards.info/ns/isbd/elements/> .@prefix bibo: <http://purl.org/ontology/bibo/> .@prefix owl: <http://www.w3.org/2002/07/owl#> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix linked-data: <http://purl.org/linked-data/api/vocab#> .@prefix void: <http://rdfs.org/ns/void#> .

<http://bnb.data.bl.uk/id/resource/009468944> dct:language <http://lexvo.org/id/iso639-3/eng> ; rdfs:seeAlso <http://www4.wiwiss.fu-berlin.de/bookmashup/books/0859761541> ; elements:P1053 "vii,147p."@en ; rdfs:label "William Wallace / Andrew Fisher" ; blterms:bnb "GB8714157" ; dct:creator <http://bnb.data.bl.uk/id/person/FisherAndrew1935-> ; bibo:isbn10 "0859761541" ; dct:title "William Wallace" ; rdf:type bibo:Book , dct:BibliographicResource ; dct:subject <http://bnb.data.bl.uk/id/concept/ddc/e19/941.1020924> .

Page 9: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Open Data

Page 10: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

20082007

20082008

20082009

20092010

10

Linked Open Data cloud

Page 11: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Over 300 open data sets with 40 billion facts, interlinked by 500 million typed links.

http://lod-cloud.net/

Linked Open Data cloud

Page 12: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Open Data cloud stats

triples distribution

links distribution

http://lod-cloud.net/state/

Page 13: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

… cost and benefits

Page 14: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Data life cycles

Page 15: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Data life cycles

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

http://linked-data-life-cycles.info

Page 16: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Modeling

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 17: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Neologism

http://neologism.deri.ie/

Page 18: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Neologism

http://vocab.data.gov/

Page 19: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Schema.org – Linked Data

Page 20: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Publishing

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 21: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Google Refine extension

http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/

Page 22: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

RDB2RDF – D2RQ

http://d2rq.org/

Page 23: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Discovery

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 24: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Follow-Your-Nose

24

Page 25: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

http://dsi.lod-cloud.net/

Dataset discovery

Page 26: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Integration

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 27: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Why linking?

Central Contractor Registration (CCR)

Geonames

http://webofdata.wordpress.com/2011/05/22/why-we-link/

Page 28: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Effort distribution

Third Party Effort

Consumer‘s Effort

Publisher‘s Effort

Fix Overall Data Integration

Effort

Page 29: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

LATC – Interlinking Platform

http://latc-project.eu/platform

Page 30: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

http://www4.wiwiss.fu-berlin.de/latc/toollibrary/screencast.html

Page 31: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Conclusion

Use the LOD cloud as test-bed (experiments) Benefit from LOD cloud in apps (context) Contribute to make your data more valuable

Op

po

rtu

nit

ies

Large-scale processing of Linked Data Distributed/federated SPARQL queries Quality of links and the data C

hal

len

ges

Page 32: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Resources

Tutorials, technologies, specifications: http://linkeddatabook.com http://lod-cloud.net http://linkeddata.org http://linkeddata-specs.info http://schema.rdfs.org

Videos: http://ted.com/talks/tim_berners_lee_on_the_next_web.html - Tim Berners-Lee’s TED

talk http://www.youtube.com/watch?v=GKfJ5onP5SQ - Linked Data (and the Web of Data) http://www.youtube.com/watch?v=4x_xzT5eF5Q - What is Linked Data? http://vimeo.com/36752317 - Linked Open Data (by Europeana)