Practical Cross-Dataset Queries with SPARQL (Introduction)

18
Practical Cross- Dataset Queries on the Web of Data Tutorial @ WWW2012, Lyon, France Richard Cyganiak, Knud Möller, Anja Jentzsch, Andreas Schultz, Robert Isele, Pablo Mendes

description

Introduction slide deck for the tutorial on “Practical Cross-Dataset Queries for the Web of Data” we presented at WWW2012. The tutorial homepage, including other presentations, is here: http://latc-project.eu/events/www2012-tutorial-cross-dataset-queries

Transcript of Practical Cross-Dataset Queries with SPARQL (Introduction)

Page 1: Practical Cross-Dataset Queries with SPARQL (Introduction)

Practical Cross-Dataset Queries on the Web of Data

Tutorial @ WWW2012, Lyon, FranceRichard Cyganiak, Knud Möller, Anja

Jentzsch, Andreas Schultz, Robert Isele, Pablo Mendes

Page 2: Practical Cross-Dataset Queries with SPARQL (Introduction)

The Web is becoming a platform for data exchange.

• Microdata, Schema.org, web APIs, Linked Data Cloud, Open Data movement, …

• Often need to combine local and remote data from several heterogeneous sources

• Scripting and mash-ups. This works, but can we do better?

Page 3: Practical Cross-Dataset Queries with SPARQL (Introduction)

SPARQL as a query languagefor the Web

• Data from all of these data sources can be converted to RDF using off-the-shelf tools, or the sources are already RDF.

• SPARQL is W3C's standard query language for RDF

• SPARQL 1.1 just out, great new features for working with heterogeneous data

Page 4: Practical Cross-Dataset Queries with SPARQL (Introduction)

Caveats

• We will focus on ad-hoc queries.• This is not just about what works, but also

about what doesn't work.

Page 5: Practical Cross-Dataset Queries with SPARQL (Introduction)

How to get data into RDF format

• Relational: R2RML standard; D2RQ, Virtuoso RDF Views, Revelytix Spyder

• Excel, CSV: RDF Extension for Google Refine, XLWrap

• XML: XSPARQL• JSON: JSON-LD• Microformats, Microdata: Apache Any23• Collect data from many web pages: LDSpider

Page 6: Practical Cross-Dataset Queries with SPARQL (Introduction)

SPARQL: The big picture

Page 7: Practical Cross-Dataset Queries with SPARQL (Introduction)

SPARQL client

SPARQL engine

RDFStore

SPARQL Protocol

Scenario: Remote SPARQL endpoint

Page 8: Practical Cross-Dataset Queries with SPARQL (Introduction)

SPARQL client SPARQL engine

RDFStore

Scenario: Local SPARQL store

Page 9: Practical Cross-Dataset Queries with SPARQL (Introduction)

SPARQL client

SPARQL engine

Scenario: Local SPARQL engine, load data from files on the fly, no store

LocalRDFfile

RemoteRDFfile

Non-RDFfile

Conversion

Page 10: Practical Cross-Dataset Queries with SPARQL (Introduction)

SPARQL engine

RDFStore

Scenario: CONSTRUCT the input data

SPARQL client

SPARQL engineLocalRDFfile

SPARQLCONSTRUCT

query

SPARQL engine

RDFStore

LocalRDFfile

SPARQLCONSTRUCT

query

Page 11: Practical Cross-Dataset Queries with SPARQL (Introduction)

SPARQL engine

RDFStore

Scenario: Federated Query

SPARQL client

SPARQL engine

Basic Federated Query

LocalRDFfile

Page 12: Practical Cross-Dataset Queries with SPARQL (Introduction)

… or any combination of these.

Page 13: Practical Cross-Dataset Queries with SPARQL (Introduction)

Agenda – Morning

• Linked Data Basics• SPARQL Basics• 10:30–11:00 Coffee• Federated queries with SPARQL• Hands-on session 1• 12:30–13:30 Lunch

Page 14: Practical Cross-Dataset Queries with SPARQL (Introduction)

Agenda – Afternoon

• 12:30–13:30 Lunch• Schema mapping with SPARQL CONSTRUCT• Instance matching with Silk• Finding RDF datasets• 15:00–15:30 Coffee• Visualizing SPARQL query results• Hands-on session 2• 17:00 Adjourn

Page 15: Practical Cross-Dataset Queries with SPARQL (Introduction)

Hands-on sessions

• USB sticks with data, queries, and instructions• Install Apache Jena command line tools• Need a browser with a JavaScript console

(recommended: Firefox+Firebug or Chrome)

Page 16: Practical Cross-Dataset Queries with SPARQL (Introduction)

Music

Page 17: Practical Cross-Dataset Queries with SPARQL (Introduction)

Presenters

• Richard Cyganiak, DERI• Knud Möller, Talis• Anja Jentzsch, FU Berlin• Andreas Schultz, FU Berlin• Robert Isele, FU Berlin• Pablo Mendes, FU Berlin• (Christophe Guéret, VUA)• (Michael Hausenblas, DERI)

Page 18: Practical Cross-Dataset Queries with SPARQL (Introduction)

Please interrupt and ask questions!