Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

49
Linked Data at globo.com Semantic Team [email protected] Ícaro Medeiros [email protected] globo.com Web of Linked Entities 2013 WWW 2013 Monday, May 13, 13

Transcript of Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Page 1: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Linked Data at globo.com

Semantic [email protected]

Ícaro [email protected]

globo.comWeb of Linked Entities 2013WWW 2013

Monday, May 13, 13

Page 2: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Who we are

Monday, May 13, 13

Page 3: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

BROADCAST MOVIES PAY TV INTERNET

EVENTS MUSIC

PUBLISHING

NEW VENTURES NEWSPAPERRADIO NETWORK

Monday, May 13, 13

Page 4: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

31.4MM unique visitors/month*

* source: Ibope, 04/13

globo.com

Monday, May 13, 13

Page 5: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Linked Data atglobo.com

Monday, May 13, 13

Page 6: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Semantic Webteam

Monday, May 13, 13

Page 7: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Organize and distribute all content produced by Organizações Globo

Mission

Monday, May 13, 13

Page 8: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Ontologies

Monday, May 13, 13

Page 9: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Base

Current scenario

news sports gossip tv

Monday, May 13, 13

Page 10: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Upper

Person Place Organization

Music

Politics

Programme Education

Sports

Future scenario

Monday, May 13, 13

Page 11: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Annotation tool

Monday, May 13, 13

Page 12: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Technologies

Monday, May 13, 13

Page 13: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Embedded into our existing CMSs

Web CMS

Video publishing system (developed in-house)

Common UX for content producers

Interface adapts itself to ontology

Annotations stored in Virtuoso triple store

Monday, May 13, 13

Page 14: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Monday, May 13, 13

Page 15: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Interface follows the ontology

Fields

Search ranges

Suggest as you type

Automatic entity extraction

Monday, May 13, 13

Page 16: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Automatic pagegeneration

Monday, May 13, 13

Page 17: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

globoesporte.com

Monday, May 13, 13

Page 18: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

globoesporte.com

Monday, May 13, 13

Page 19: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

globoesporte.com

Monday, May 13, 13

Page 20: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

globoesporte.com

Monday, May 13, 13

Page 21: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

globoesporte.com

Monday, May 13, 13

Page 22: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

globoesporte.com

Monday, May 13, 13

Page 23: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Brainiak:Linked data RESTful API

Monday, May 13, 13

Page 24: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

triple store

Legacy architecture

process queue

suggest

annotation

entity extractor

search engineAPI

suggest

annotation

entity extractor

entity extractor

suggest

annotation

annotation

suggest entity extractor

CMS

CDA

Monday, May 13, 13

Page 25: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

New architecture

triple store

search engine

BRAINIAK

API

Monday, May 13, 13

Page 26: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Linked data made simple!

Goal

Monday, May 13, 13

Page 27: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Authorization and authentication

Reduce need of writing SPARQL queries

Single point of access to the triplestore

Requirements

Data management quality (deduplication, validation, integration with external datasets, etc)

Monday, May 13, 13

Page 28: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

RESTful hypermedia API

Requirements

Performance enhancement - caching

Monday, May 13, 13

Page 29: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Technologies

Monday, May 13, 13

Page 30: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Main concepts

Monday, May 13, 13

Page 31: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

ContextIsolated dataspace (graph)

Sports

Context

Monday, May 13, 13

Page 32: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

SchemaDefinition of a data type(i.e. definition of a class)

Sports

Team

Context

Schema

Monday, May 13, 13

Page 33: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Sports

Team

Schema

Instance

Collection

Barcelona

Instance

CollectionTeam

Context

Monday, May 13, 13

Page 34: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Hypermedia API

Monday, May 13, 13

Page 35: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Content negotiation

Decoupling server and client side

Access Simplicity (don’t remember URLs, navigate)

Hypermedia

Application is treated as a state machine

Monday, May 13, 13

Page 36: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

self

/sports/Team/Barcelona

/sports/Team

inCollection

item

createdeletereplace

Relations

Monday, May 13, 13

Page 37: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Services

Monday, May 13, 13

Page 38: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Context List contexts

Collections List collections (of a context)

Services

Schemas Retrieve schema of a class (collection)

Monday, May 13, 13

Page 39: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

List instances (same type)

Retrieve instance

Instances

Create instance

Update instance

Delete instanceMonday, May 13, 13

Page 40: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

{ "@context": { "rdfs": "http://www.w3.org/2000/01/rdf-schema#" "sports": "http://semantica.globo.com/sports/", "upper": "http://semantica.globo.com/upper/", }, "@id": "http://semantica.globo.com/sports/Team/Barcelona", "@type": "sports:Team", "rdfs:label": "Barcelona", "upper:fullName": "Futbol Club Barcelona", "upper:acronym": "BARCELONA"}

/sports/Team/BarcelonaGET

Monday, May 13, 13

Page 41: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Filtering instances

Monday, May 13, 13

Page 42: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Filtering instances

SELECT * FROM <http://semantica.globo.com/sports/>WHERE { ?s a sports:Team . ?s rdfs:label “Barcelona” .}

Monday, May 13, 13

Page 43: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Filtering instances

Monday, May 13, 13

Page 44: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

/sports/Team?p=rdfs:label&?o=Barcelona

Filtering instances

Monday, May 13, 13

Page 45: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Dealing with legacy and external graphs

/sports/&graph_uri=dbpedia:sports

&class_uri=dbpedia-ont:Team

Monday, May 13, 13

Page 46: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Brainiak will be open sourced next Monday at

1st Globo Semantic Day

Join now for free:

bit.ly/semantic_day_globo

Follow us on github:github.com/globocom

Monday, May 13, 13

Page 47: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

How we seethe future

Monday, May 13, 13

Page 48: Linked data at globo.com - Web of Linked Entities (WoLE 2013) - WWW 2013

Inference-based navigation

SEO (automatic schema.org)

Richer content (e.g. timelines), frequent automatic updates

Better annotation suggestion (DBpedia Spotlight)

Linked with open data (DBPedia, dados.gov.br)

Monday, May 13, 13