Post on 14-Jul-2015
Open Tourism! The importance of enriching your online content with semantic annotations.
#openbelgium15 #opentourism
Open Tourism! AN OPEN KNOWLEDGE BELGIUM WORKING GROUP
@rafke #opentourism
WHAT
…
WHAT
WHATCOULDBE?
WHATSHOULDWE DO?
Joined forces with ‘sustainable mobile tourism guides’, Thomas More - iMinds
Open Tourism! Enriching your online content with semantic annotations.
Anastasia Dimou
@natadimou #opentourism
http://dbpedia.org/resource/Belfry_of_ghent
Data model - Schemas - Vocabularies Identify an Entity
http://dbpedia.org/resource/Belfry_of_ghent
Data model - Schemas - Vocabularies Identify an Entity
http://dbpedia.org/resource/Belfry_of_ghent
Describe the attributes of an Entity
Belfry construction_started 1313Belfry construction_finished 1380
Data model - Schemas - Vocabularies Identify an Entity
http://dbpedia.org/resource/Belfry_of_ghent
Describe the attributes of an Entity
Belfry construction_started 1313Belfry construction_finished 1380
Describe the relationship of an Entity with other EntitiesBelfry located_in Gent
Reasons to semantically enrich your data
• discoverability / searchabilityinformation recognized by major search providers
• indexing
• cross-referenced data
• (structured) data interoperability / integration / reuse
• automation
Semantically Enrich your Web presence Web of Documents
Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues)
Linked Open Data
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsdata catalogues (raw data with metadata)
Linked Open Data
Inline Markup of HTML Web pagesnested metadata within HTML Web pages content (RDFa, Microdata)
(+) Search Engine Optimization (SEO) and Indexing(+) Single infrastructure(-) manual annotation by editors(-) hard to maintain(-) adjust existing Content Management System (CMS)
plugin OR custom code
1-time adjustment/development cost + maintenance cost (perhaps extra adjustment/development cost)
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsdata catalogues (raw data with metadata)
Linked Open Data
Embedded metadata within HTML Web pagesstructured data islands embedded in HTML Web pages (JSON-LD)
(+) Search Engine Optimization (SEO) and Indexing(+) Single infrastructure(+) easily deployable(-) manual incorporation by admins
perhaps 1-time adjustment/development cost + maintenance cost (perhaps extra adjustment/development cost)
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsdata catalogues (raw data with metadata)
Linked Open Data
Raw metadata complementary to the Web pagesstructured data complementary to the Web pages (raw JSON-LD)
(+) easily deployable(-) manual annotation by experts(-) shortage of crawlers
at no cost
Semantically Enrich your Web presence
Many Web sites are generated from structured data, which is often stored in databases.
and in general, you have more data to share…
Semantically Enrich your Web presence Web of Documents
Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues)
Linked Open Data
Open Dataraw data
(+) simple solution(+) at no cost
data as Web APIs(+) easily deployable(+) publishing data in Web readable formats(+) low cost - low maintenance cost
(-) reusable data only from people who know this data exists(-) not synchronized with the Web site content
Semantically Enrich your Web presence Web of Documents
Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues)
Linked Open Data
Open Data with metadata - Data catalogues(data) with metadata → data catalogues
(+) catalogue integration with existing CMSs(+) increased Searchability / Discoverability(+) Federated structure: easily set up instances with common search
1-time cost to set up the infrastructure + maintenance costintegration with CMS:
1-time cost + maintenance/synchronization cost
Semantically Enrich your Web presence Web of Documents
Inline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues)
Linked Open Data
Linked Open Data semantically annotated data
domain modelingmap raw data to their semantic representationpublishing Linked Open Data
(+) Answering to queries(+) data integration / interlinking
1-time (human resources) cost for the domain modeling1-time cost to set up the infrastructure (mapping and publishing)maintenance cost
Reasons to semantically enrich your data
• discoverability / searchabilityinformation recognized by major search providers
• indexing
• cross-referenced data
• (structured) data interoperability / integration / reuse
• automation
Open Tourism! How to add semantic annotations?
Laurens De Vocht
@Laurens_d_v #opentourism
Without Metadata With Metadata
Web Documents HTML HTML with rich snippets
HTML with schema.org
Raw Data File Dump, API DCAT, VOID (description)
CKAN (catalog)
Linked Data RDF (data model)
file dumps, dereferencing (retrieval)
endpoint, API (queries)
Adding Semantic Annotations
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues)
Linked Open Data
Adding Metadata to Web Documents
<div>
<a href="http://www.example.com/events/spinaltap">Spinal Tap</a>
<img src="spinal_tap.jpg" />
After their highly-publicized search for a new drummer,
Spinal Tap kicks off their latest comeback tour with a San
Francisco show.
When: Oct 15, 7:00PM—9:00PM
Where: Warfield Theatre, 982 Market St, San Francisco, CA
<div xmlns:v="http://rdf.data-vocabulary.org/#" typeof="v:Event">
<a href="http://www.example.com/events/spinaltap" rel="v:url"
property="v:summary">Spinal Tap</a>
<img src="spinal_tap.jpg" rel="v:photo" />
<span property="v:description">After their highly-publicized search for a new drummer,
Spinal Tap kicks off their latest comeback tour with a San Francisco show. </span>
When:
<span property="v:startDate" content="2015-10-15T19:00-08:00">Oct 15,
7:00PM</span>
<span property="v:endDate" content="2015-10-15T21:00-08:00">9:00PM</span>
Where:
<span rel="v:location">
<span typeof="v:Organization">
<span property="v:name">Warfield Theatre</span>,
<span rel="v:address">
<span typeof="v:Address">
<span property="v:street-address">982 Market St</span>,
<span property="v:locality">San Francisco</span>,
<span property="v:region">CA</span>
</span>
</span>...
what search enginesdisplay to us
what search engines read
Adding Metadata to Web Documentshttp://rdf.data-vocabulary.org/a 'lightweight' alternative tohttp://schema.org
schema.org:has broader range of typesmore documentationbroad support by different search engines
it has types for Event, Organization, Person, Product, Review, AggregateRating, Offer and hundreds of others. full list -> http://schema.org/docs/full.html
Schemas - Vocabularies - Schema.org
collection of schemas to markup HTML pages
Leveraging the CMS infrastructure
source: builtwith.com
Wordpress, Drupal and Joomla cover over 50% of the internet website CMS.
Wordpress: Adding Rich Snippets Example
e.g. use Google SEO Pressor Plugin(https://wordpress.org/plugins/google-seo-author-snippets/)
Wordpress: Adding Schema.org Example
e.g. use Add Metadata Tags Plugin(https://wordpress.org/plugins/add-meta-tags/)
Other CMS offer plugins with similar functionality
e.g. Joomla: J4Schema(http://extensions.joomla.org/extensions/extension/site-management/seo-a-metadata/j4schema)
e.g. Drupal 7 Schema.org (https://www.drupal.org/node/1194024)
Alternative to CMS plugins
Custom Scripts
Manual Annotations
Annotation Templates
Does it really work? How to be sure?
Try with your HTML snippet or URL -> https://developers.google.com/structured-data/testing-tool/
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues)
Linked Open Data
What's next?
Raw
Data
CMS
?
publishing raw data directly
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as Web APIsraw data with metadata (data catalogues)
Linked Open Data
Publishing Data Directly
e.g. http://datahub.io/dataset/outbound-travel-by-new-zealanders
Publish link to raw data (file dump) from website to platform like datahub.io
How can others find it?
Add metadata and register it!
catalog with metadata of dataset
(Meta)Data Catalog
CKAN is one of the most common platforms for maintaining a data catalog
e.g. Flanders Open Data portal: opendataforum.info is built on top of CKAN.e.g. Also datahub.io makes use of it.
(Meta)Data Description of DataDCAT VOID
:ds1 a dcat:Dataset ;
dcat:distribution :dist1 .
:dist1 a dcat:Download ;
dcat:accessURL <http://example.org/dist1.csv>;
dcat:format [ rdfs:label "CSV" ].
integrate with catalogs,such as CKAN.http://www.w3.org/TR/vocab-dcat/
express additional information such as categories, links with other datasets, hierarchical relationships etchttp://www.w3.org/TR/void/
:DBpedia rdf:type void:Dataset ;
foaf:homepage <http://dbpedia.org/> .
:DBLP rdf:type void:Dataset ;
foaf:homepage <http://www4.wiwiss.fu-berlin.de/dblp/all> ;
dcterms:subject <http://dbpedia.org/resource/Computer_science> ;
:DBpedia void:subset :DBpedia2DBLP .
:DBpedia2DBLP rdf:type void:Linkset ;
void:target :DBpedia ;
void:target :DBLP .what machines see (more)
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues )
Linked Open Data
How to query and link datasets?
Raw
Data
CMS
querying, linking?
?
?
The catalog indicates supported ways of accessing data.
API
Querying
Website
Dereferencing
Metadata
http://datahub.io/dataset/tourpedia
Adding Semantics and Representing Data as Linked Data
When?
http://rml.io http://www.w3.org/TR/r2rml/
How?
CSV, HTML,
JSON, XMLRDB Transformation
http://d2rq.org/
Layer on top of RDB
http://any23.apache.org/
Web documents to triples
Raw
Data
CMS
Other
Raw DataCatalog
Web of Documents
Web of Data
Semantically Enrich your Web presence
Web of DocumentsInline markup of HTML documentsEmbedded metadata within HTML documentsRaw metadata accompanying the HTML document
Web of DataOpen Data
raw data AND data as APIsraw data with metadata (data catalogues)
Linked Open Data
Open Tourism! Barriers and Solutions to Open Tourism Data
Panel discussion
@rafke @mindwraps #opentourism
Marc
@mportier
Anastasia
@natadimou
Laurens
@Laurens_d_v
Raf
@rafke
Veronique
@VeroniqueCosse
Wouter
@mindwraps
join Open Tourism working group!
#opentourism
raf{dot}buyle{at}okfn{dot}be
A world where knowledge creates power for the many, not the few.
A world where data frees us — to make informed choices about how we live, what we buy and who gets our vote.
A world where information and insights are accessible — and apparent — to everyone.
This is the world we choose
#openbelgium15 #opentourism