Open belgium 2015 - open tourism

Post on 14-Jul-2015

496 views 2 download

Transcript of Open belgium 2015 - open tourism

Open Tourism! The importance of enriching your online content with semantic annotations.

#openbelgium15 #opentourism

Open Tourism! AN OPEN KNOWLEDGE BELGIUM WORKING GROUP

@rafke #opentourism

WHAT

WHAT

WHATCOULDBE?

WHATSHOULDWE DO?

Joined forces with ‘sustainable mobile tourism guides’, Thomas More - iMinds

Open Tourism! Enriching your online content with semantic annotations.

Anastasia Dimou

@natadimou #opentourism

http://dbpedia.org/resource/Belfry_of_ghent

Data model - Schemas - Vocabularies Identify an Entity

http://dbpedia.org/resource/Belfry_of_ghent

Data model - Schemas - Vocabularies Identify an Entity

http://dbpedia.org/resource/Belfry_of_ghent

Describe the attributes of an Entity

Belfry construction_started 1313Belfry construction_finished 1380

Data model - Schemas - Vocabularies Identify an Entity

http://dbpedia.org/resource/Belfry_of_ghent

Describe the attributes of an Entity

Belfry construction_started 1313Belfry construction_finished 1380

Describe the relationship of an Entity with other EntitiesBelfry located_in Gent

Reasons to semantically enrich your data

• discoverability / searchabilityinformation recognized by major search providers

• indexing

• cross-referenced data

• (structured) data interoperability / integration / reuse

• automation

Semantically Enrich your Web presence Web of Documents

Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues)

Linked Open Data

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsdata catalogues (raw data with metadata)

Linked Open Data

Inline Markup of HTML Web pagesnested metadata within HTML Web pages content (RDFa, Microdata)

(+) Search Engine Optimization (SEO) and Indexing(+) Single infrastructure(-) manual annotation by editors(-) hard to maintain(-) adjust existing Content Management System (CMS)

plugin OR custom code

1-time adjustment/development cost + maintenance cost (perhaps extra adjustment/development cost)

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsdata catalogues (raw data with metadata)

Linked Open Data

Embedded metadata within HTML Web pagesstructured data islands embedded in HTML Web pages (JSON-LD)

(+) Search Engine Optimization (SEO) and Indexing(+) Single infrastructure(+) easily deployable(-) manual incorporation by admins

perhaps 1-time adjustment/development cost + maintenance cost (perhaps extra adjustment/development cost)

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsdata catalogues (raw data with metadata)

Linked Open Data

Raw metadata complementary to the Web pagesstructured data complementary to the Web pages (raw JSON-LD)

(+) easily deployable(-) manual annotation by experts(-) shortage of crawlers

at no cost

Semantically Enrich your Web presence

Many Web sites are generated from structured data, which is often stored in databases.

and in general, you have more data to share…

Semantically Enrich your Web presence Web of Documents

Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues)

Linked Open Data

Open Dataraw data

(+) simple solution(+) at no cost

data as Web APIs(+) easily deployable(+) publishing data in Web readable formats(+) low cost - low maintenance cost

(-) reusable data only from people who know this data exists(-) not synchronized with the Web site content

Semantically Enrich your Web presence Web of Documents

Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues)

Linked Open Data

Open Data with metadata - Data catalogues(data) with metadata → data catalogues

(+) catalogue integration with existing CMSs(+) increased Searchability / Discoverability(+) Federated structure: easily set up instances with common search

1-time cost to set up the infrastructure + maintenance costintegration with CMS:

1-time cost + maintenance/synchronization cost

Semantically Enrich your Web presence Web of Documents

Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues)

Linked Open Data

Linked Open Data semantically annotated data

domain modelingmap raw data to their semantic representationpublishing Linked Open Data

(+) Answering to queries(+) data integration / interlinking

1-time (human resources) cost for the domain modeling1-time cost to set up the infrastructure (mapping and publishing)maintenance cost

Reasons to semantically enrich your data

• discoverability / searchabilityinformation recognized by major search providers

• indexing

• cross-referenced data

• (structured) data interoperability / integration / reuse

• automation

Open Tourism! How to add semantic annotations?

Laurens De Vocht

@Laurens_d_v #opentourism

Without Metadata With Metadata

Web Documents HTML HTML with rich snippets

HTML with schema.org

Raw Data File Dump, API DCAT, VOID (description)

CKAN (catalog)

Linked Data RDF (data model)

file dumps, dereferencing (retrieval)

endpoint, API (queries)

Adding Semantic Annotations

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues)

Linked Open Data

Adding Metadata to Web Documents

<div>

<a href="http://www.example.com/events/spinaltap">Spinal Tap</a>

<img src="spinal_tap.jpg" />

After their highly-publicized search for a new drummer,

Spinal Tap kicks off their latest comeback tour with a San

Francisco show.

When: Oct 15, 7:00PM—9:00PM

Where: Warfield Theatre, 982 Market St, San Francisco, CA

<div xmlns:v="http://rdf.data-vocabulary.org/#" typeof="v:Event">

<a href="http://www.example.com/events/spinaltap" rel="v:url"

property="v:summary">Spinal Tap</a>

<img src="spinal_tap.jpg" rel="v:photo" />

<span property="v:description">After their highly-publicized search for a new drummer,

Spinal Tap kicks off their latest comeback tour with a San Francisco show. </span>

When:

<span property="v:startDate" content="2015-10-15T19:00-08:00">Oct 15,

7:00PM</span>

<span property="v:endDate" content="2015-10-15T21:00-08:00">9:00PM</span>

Where:

<span rel="v:location">

<span typeof="v:Organization">

<span property="v:name">Warfield Theatre</span>,

<span rel="v:address">

<span typeof="v:Address">

<span property="v:street-address">982 Market St</span>,

<span property="v:locality">San Francisco</span>,

<span property="v:region">CA</span>

</span>

</span>...

what search enginesdisplay to us

what search engines read

Adding Metadata to Web Documentshttp://rdf.data-vocabulary.org/a 'lightweight' alternative tohttp://schema.org

schema.org:has broader range of typesmore documentationbroad support by different search engines

it has types for Event, Organization, Person, Product, Review, AggregateRating, Offer and hundreds of others. full list -> http://schema.org/docs/full.html

Schemas - Vocabularies - Schema.org

collection of schemas to markup HTML pages

Leveraging the CMS infrastructure

source: builtwith.com

Wordpress, Drupal and Joomla cover over 50% of the internet website CMS.

Wordpress: Adding Rich Snippets Example

e.g. use Google SEO Pressor Plugin(https://wordpress.org/plugins/google-seo-author-snippets/)

Wordpress: Adding Schema.org Example

e.g. use Add Metadata Tags Plugin(https://wordpress.org/plugins/add-meta-tags/)

Other CMS offer plugins with similar functionality

e.g. Joomla: J4Schema(http://extensions.joomla.org/extensions/extension/site-management/seo-a-metadata/j4schema)

e.g. Drupal 7 Schema.org (https://www.drupal.org/node/1194024)

Alternative to CMS plugins

Custom Scripts

Manual Annotations

Annotation Templates

Does it really work? How to be sure?

Try with your HTML snippet or URL -> https://developers.google.com/structured-data/testing-tool/

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues)

Linked Open Data

What's next?

Raw

Data

CMS

?

publishing raw data directly

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as Web APIsraw data with metadata (data catalogues)

Linked Open Data

Publishing Data Directly

e.g. http://datahub.io/dataset/outbound-travel-by-new-zealanders

Publish link to raw data (file dump) from website to platform like datahub.io

How can others find it?

Add metadata and register it!

catalog with metadata of dataset

(Meta)Data Catalog

CKAN is one of the most common platforms for maintaining a data catalog

e.g. Flanders Open Data portal: opendataforum.info is built on top of CKAN.e.g. Also datahub.io makes use of it.

(Meta)Data Description of DataDCAT VOID

:ds1 a dcat:Dataset ;

dcat:distribution :dist1 .

:dist1 a dcat:Download ;

dcat:accessURL <http://example.org/dist1.csv>;

dcat:format [ rdfs:label "CSV" ].

integrate with catalogs,such as CKAN.http://www.w3.org/TR/vocab-dcat/

express additional information such as categories, links with other datasets, hierarchical relationships etchttp://www.w3.org/TR/void/

:DBpedia rdf:type void:Dataset ;

foaf:homepage <http://dbpedia.org/> .

:DBLP rdf:type void:Dataset ;

foaf:homepage <http://www4.wiwiss.fu-berlin.de/dblp/all> ;

dcterms:subject <http://dbpedia.org/resource/Computer_science> ;

:DBpedia void:subset :DBpedia2DBLP .

:DBpedia2DBLP rdf:type void:Linkset ;

void:target :DBpedia ;

void:target :DBLP .what machines see (more)

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues )

Linked Open Data

How to query and link datasets?

Raw

Data

CMS

querying, linking?

?

?

The catalog indicates supported ways of accessing data.

API

Querying

Website

Dereferencing

Metadata

http://datahub.io/dataset/tourpedia

Adding Semantics and Representing Data as Linked Data

When?

http://rml.io http://www.w3.org/TR/r2rml/

How?

CSV, HTML,

JSON, XMLRDB Transformation

http://d2rq.org/

Layer on top of RDB

http://any23.apache.org/

Web documents to triples

Raw

Data

CMS

Other

Raw DataCatalog

Web of Documents

Web of Data

Semantically Enrich your Web presence

Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document

Web of DataOpen Data

raw data AND data as APIsraw data with metadata (data catalogues)

Linked Open Data

Open Tourism! Barriers and Solutions to Open Tourism Data

Panel discussion

@rafke @mindwraps #opentourism

Panel discussion

Join our discussion on http://bit.ly/opendisc

Marc

@mportier

Anastasia

@natadimou

Laurens

@Laurens_d_v

Raf

@rafke

Veronique

@VeroniqueCosse

Wouter

@mindwraps

join Open Tourism working group!

#opentourism

raf{dot}buyle{at}okfn{dot}be

A world where knowledge creates power for the many, not the few.

A world where data frees us — to make informed choices about how we live, what we buy and who gets our vote.

A world where information and insights are accessible — and apparent — to everyone.

This is the world we choose

#openbelgium15 #opentourism