Linked Data

download Linked Data

If you can't read please download the document

Transcript of Linked Data

  1. 1. Linking Open Data
      • Danny Ayers
      • SemAst2009
  2. 2. Obligatory Children Slide
  3. 3. Credits:
      • Many slides derived from those of
      • Tom Heath, Michael Hausenblas, Chris Bizer, Richard Cyganiak, Olaf Hartig
  4. 4. Quote
      • Linked Data is the Semantic Web done right, and the Web done right.
      • - Tim Berners-Lee
  5. 5.
    • Introduce the concept of Linked Data
    • Highlight why you would want to publish Linked Data on the Web
    • Introduce the principles and best practices of publishing Linked Data on the Web
    • Demonstrate the creation and consumption of Linked Data
    • Answer your burning Linked Data publishing questions
    Objectives
  6. 6. Overview
    • From a Web of Documents to a Web of Data
      • Web APIs, Microformats, and Linked Data
    • Linked Data Deployment on the Web
      • What data is out there?
    • Applications
      • What is being done with the data?
  7. 7. Classic Data : in silos Image: Bob Jagensdorf, http://flickr.com/photos/darwinbell/, CC-BY
  8. 8. The Classic Web
    • Single global information space
    • URLs as
      • globally unique IDs
      • retrieval mechanism
    • HTML as shared content format
    • Hyperlinks
    • Shortcomings
    • Content is not well structured
    • You can not ask expressive queries
    • You can not process content within applications
    B C HTML HTML HTML WebBrowsers SearchEngines hyper- links A
  9. 9. What do we actually want? Use the Web like asingle globaldatabase.
  10. 10. Solution
    • Web APIs
    • Microformats
    • Linked Data
    Publish structured data directly on the Web Different Approaches
  11. 11. Web APIs
  12. 12. Mashups
    • Positive
    • APIs expose structured data
    • APIs enable new applications
    • Negative
    • Usually proprietary interfaces
    • Mashups are based only on fixed set of sources
    • You can't usually set hyperlinks between data objects
    Mashup Up Web API A Web API B Web API C Web API D
  13. 13. Web APIs slice the Web into separate data silos Image: Bob Jagensdorf, http://flickr.com/photos/darwinbell/, CC-BY
  14. 14. What do we want? Use the Web as a single global databaseusing what we know works on the Web
  15. 15. Connecting Worlds One Web! Data Documents
  16. 16. What works on the Web?
    • Uniform Identifiers (URIs)
    • Common Interface Protocol (HTTP)
    • Standard Representation Formats (HTML, Atom/RSS etc)
  17. 17. What works on the Web?
    • The Hyperlink
  18. 18. Evolving the Link
    • Home Page
    page.html home.html
  19. 19. Evolving the Link
    • rel=home>
    • Home Page
    page.html home.html home
  20. 20. Microformats
    • Embed structured data into HTML pages.
    • hCard, hCalender, hReview, XFN,
    • Compatible with the idea of the Web as single information space.
    • Shortcomings
      • Only a fixed set of microformats exist
      • No direct way to connect data items
    bdigitalMay 20 -22
  21. 21. Evolving the Link
      • rdf:about= http://example.org/page >
      • rdf:resource= http://example.org/home />
    page home x:home
  22. 22. Linked Data B C Thing typed links A D E typed links typed links typed links Thing Thing Thing Thing Thing Thing Thing Thing Thing
    • Use Semantic Web technologies to
    • publish structured data on the Web,
    • set links between data from one data sourceto data within other data sources .
  23. 23. RDF Relations
      • rdf:about= http://example.org/page >
      • rdf:resource= http://example.org/home />
    x:home subject object http://example.org /page ... http://example.org /home ...
  24. 24. Identifiers
    • Identification by Description
    • Non-HTTP URIs
    • HTTP URIs
  25. 25. Linked Data Principles
    • Use URIs as names for things
    • Use HTTP URIs so that people can look up those names.
    • When someone looks up a URI, provide useful information.
    • Include links to other URIs. so that they can discover more things.
    • Tim Berners-Lee 2007
    • http://www.w3.org/DesignIssues/LinkedData.html
  26. 26. The RDF Data Model Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type pd:cygri
  27. 27. Data objects are identified with HTTP URIs pd:cygri Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type pd:cygri= http://richard.cyganiak.de/foaf.rdf#cygri dbpedia:Berlin= http://dbpedia.org/resource/Berlin
  28. 28. Follow Your Nose... Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type pd:cygri dp:Cities_in_Germany 3.405.259 dp:population skos:subject
  29. 29. Dereferencing URIs over the Web Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type dbpedia:Hamburg dbpedia:Muenchen skos:subject skos:subject pd:cygri dp:Cities_in_Germany 3.405.259 dp:population skos:subject
  30. 30. Universal Connector rdfs:seeAlso
  31. 31. Universal Connector rdfs:seeAlso foaf:Person rdf:type
  32. 32. Take Care! owl:sameAs
  33. 33. The Disco Hyperdata Browser
  34. 35. Semantic Web Specifications
    • HTTP + URIs
    • RDF + RDFs
    • OWL
    • SKOS
    • SPARQL
    • RDFa
    • GRDDL
    • Turtle/N3
    Coming soon: POWDER, OWL2, RIF
  35. 36. What of Web Services? Web APIs Are Just Web Sites - Paul Downey Linked Data is exposed as (very simple) RESTful services
  36. 37. What of RDF Stores? A triplestore is just a cache of a chunk of the Semantic Web Convenient for data merging, querying (SPARQL) and inference See also: Semantic Web Client Library
  37. 38. What of the Real World? source:http://danbri.org/words/2008/04/15/300 based on timbl 1994
  38. 39. W3C Linking Open Data Project
    • Community effort to
      • publish existing open license datasets as Linked Data on the Web
      • interlink things between different data sources
  39. 40. LOD Datasets on the Web:May 2007
  40. 41. LOD Datasets on the Web:August 2007
  41. 42. LOD Datasets on the Web: February 2008
  42. 43. LOD Datasets on the Web: September 2008
  43. 44. LOD Datasets on the Web:March 2009
  44. 45. Spotlight: Geonames
    • over 8 milliongeographicallocations
    • featurehierarchy
  45. 46. Spotlight: DBpedia
    • extracts structured data from Wikipedia.
    • covers over 2.2 million concepts from various domains.
  46. 47. Example RDF Links
    • RDF links from DBpedia to other data sources
    • RDF link from a FOAF profile to DBpedia
    owl:sameAs . foaf:topic_interest . owl:sameAs .
  47. 48.
    • Universities and Research Institutes
      • Massachusetts Institute of Technology (USA)
      • University of Southampton (UK)
      • Freie Universitt Berlin (DE)
      • DERI (IRE)
      • KMi, Open University (UK)
      • University of London (UK)
      • Universitt Hannover (DE)
      • University of Pennsylvania (USA)
      • Universitt Leipzig (DE)
      • Universitt Karlsruhe (DE)
      • Joanneum (AT)
      • University of Toronto (CA)
    Organizations publishing Linked Data
    • Companies
      • BBC (UK)
      • OpenLink (UK)
      • Zitgist (USA)
      • Talis (UK)
      • Garlik (UK)
      • Mondeca (FR)
      • Cyc Foundation (USA)
  48. 49. Applications B C Thing typed links A D E typed links typed links typed links Thing Thing Thing Thing Thing Thing Thing Thing Thing SearchEngines Linked Data Mashups Linked Data Browsers
    • What can I do with this?
  49. 50. Linked Data Browsers
    • Tabulator Browser (MIT, USA)
    • Marbles (FU Berlin, DE)
    • OpenLink RDF Browser (OpenLink, UK)
    • Zitgist RDF Browser (Zitgist, USA)
    • Disco Hyperdata Browser (FU Berlin, DE)
    • Fenfire (DERI, Irland)
  50. 51. Tabulator
  51. 52. Linked Data Mashups
    • Domain-specific applications
    • - using Linked Data from the Web
  52. 53. DBpedia Mobile
    • Geospatial entry point into the Web of Data
    • Starts with DBpedia, Revyu and Flickr data
  53. 54. Web of Data Search Engines
    • Falcons (IWS, China)
    • Sindice (DERI, Ireland)
    • MicroSearch (Yahoo, Spain)
    • Watson (Open University, UK)
    • SWSE (DERI, Ireland)
    • Swoogle (UMBC, USA)
  54. 55. Falcons
  55. 56. Why publish Linked Data on the Web?
    • Your data becomes part of a single global data space (the Web of Data = Semantic Web)
    • People can use various data browsers to explore your data
    • Your data is crawled by Semantic Web search engines and can be used by independent applications
    • People start setting links to your data, which allows more people to find and use your data
  56. 57. Why publish Linked Data on the Web?
    • Linked Data builds on the classic architecture of the Web
    • Linked Data is more generic then Web APIs or Microformats
      • Builds on standards in contrast to proprietary Web APIs
      • Enables applications that work against an unbound set of data sources and incorporate new data sources as they become available on the Web
  57. 58. Linked Data Principles
    • Use URIs as names for things
    • Use HTTP URIs so that people can look up those names.
    • When someone looks up a URI, provide useful information.
    • Include links to other URIs. so that they can discover more things.
    • Tim Berners-Lee 2007
    • http://www.w3.org/DesignIssues/LinkedData.html
  58. 59. Example : Source Data
  59. 60. Example : First Pass @prefix rdf: . @prefix rdfs: . @prefix owl: . @prefix x: . x:Megrez rdf:type x:A3_dwarf ; x:magnitude "3.1" ; x:partOf [ rdf:type x:Constellation ; x:hasName "Ursa Major"] . (RDF Turtle/N3 Syntax)
  60. 61. Example : Vocabulary @prefix rdf: . @prefix rdfs: . @prefix x: . x:Star rdf:type rdfs:Class . x:Constellation rdf:typerdfs:Class . x:A3_Dwarf rdfs:subClassOf x:Star . x:magnitude rdf:type rdf:Property . x:partOfrdf:type rdf:Property . x:hasNamerdf:type rdf:Property .
  61. 62. But...
    • Use URIs as names for things
    • Use HTTP URIs so that people can look up those names.
    • When someone looks up a URI, provide useful information.
    • Include links to other URIs. so that they can discover more things.
    Existing vocabularies and resource URIs are likely to be well-linked already reuse is good!
  62. 63. Example : Better Linkage @prefix rdf: . @prefix rdfs: . @prefix x: . @prefix astro: . @prefix foaf: . @prefix dolce: . @prefix dbpedia: . dbpedia:Delta_Ursae_Majoris rdf:type x:A3_Dwarf ; foaf:name "Megrez"; x:magnitude "3.1" ; dolce:part-of dbpedia:Ursa_Major ; rdfs:seeAlso . dbpedia:Ursa_Major rdf:type astro:Constellation .
  63. 64. Mass-Publication Techniques
    • Generally either:
    • bulk conversion
    • live mapping from existing databases
  64. 65. New Vocabularies : voiD Vocabulary of Interlinked Datasets Classes:Dataset | Linkset | TechnicalFeature | Properties:statItem | feature | subset | target | sparqlEndpoint | linkPredicate | exampleResource | vocabulary | subjectsTarget | objectsTarget | dataDump | uriLookupEndpoint | uriRegexPattern |
  65. 66. New Vocabularies : Scovo Statistical Core Vocabulary
  66. 67. "It is important to look as it as an interconnection bus, and it hasn't started working in earnest until one person's data is being used by some other unplanned use. This unexpected reuse is the measure." -Tim Berners-Lee, re. State of the Semantic Web Finishing Quotes
  67. 68. "Engineer for serendipity." -Roy T. Fielding, re. REST Finishing Quotes
  68. 69. http://hyperdata.org/astronomy/ Related Links etc.