BOTTARI: How to offer innovative services listening to the Social Media

Post on 27-Jan-2015

107 views 1 download

Tags:

description

The presentation about BOTTARI I gave at Senso Comune workshop

Transcript of BOTTARI: How to offer innovative services listening to the Social Media

BOTTARI: How to offer innovative services

listening to the Social Media

Emanuele Della Valle DEI - Politecnico di Milano

emanuele.dellavalle@polimi.ithttp://emanueledellavalle.org

• Introduction

• Architecture

• Components

• Query Processing

• Evaluation– Efficacy– Efficiency

• Conclusions

Agenda

227.02.2012 - Workshop di Senso Comune, Trento, Italy

327.02.2012 - Workshop di Senso Comune, Trento, Italy

Introduction

A stream of live opinions

• Micro-posts carry the live opinion of social media users about topics and points of interest.

27.02.2012 - Workshop di Senso Comune, Trento, Italy

• In 2011, an average of three million tweets per day was posted in Seoul.

• A large number of these micro-posts are about restaurants, bars, coffees and many other semi-public points of interest (POIs) in the city.

• Let us consider, Insadong district with its +200 restaurants

Introduction

Seoul’s stream of live opinions

427.02.2012 - Workshop di Senso Comune, Trento, Italy

Number of tweets per day related to Insadong’srestaurants

Insadong’s restaurantsordered by decreasingfrequency of tweets

• Trusting this collective opinion, can novel commercial and social services be considered?

• The case of BOTTARI

Introduction

A commercial/social opportunity?

527.02.2012 - Workshop di Senso Comune, Trento, Italy

http://www.youtube.com/watch?v=XGOKe_lhSks

• An augmented reality application for personalized recommendation of restaurants in Insadong (Seoul)

Introduction

What have you seen?

627.02.2012 - Workshop di Senso Comune, Trento, Italy

• Yes and no!

• Same use case

• BOTTARI listens to the social media

Introduction

Yet another ?

727.02.2012 - Workshop di Senso Comune, Trento, Italy

outout

Query RewriterQuery Rewriter

Query Evaluator

Query Evaluator

Machine LearningMachine Learning

Streaming ProcessorStreaming Processor

Geo Filtering

Geo Filtering

Geo-spatial KB

Geo-spatial KB

Social Media Analyzer

Social Media Analyzer

Architecture

Android Client

27.02.2012 - Workshop di Senso Comune, Trento, Italy 8

outout

Query RewriterQuery Rewriter

Query Evaluator

Query Evaluator

Machine LearningMachine Learning

Semantic Stream

Processor

Semantic Stream

Processor

Geo Filtering

Geo Filtering

Geo-spatial KB

Social Media Analyzer

Social Media Analyzer

Architecture

Geo-Spatial KB

27.02.2012 - Workshop di Senso Comune, Trento, Italy 9

outout

Query RewriterQuery Rewriter

Query Evaluator

Query Evaluator

Machine LearningMachine Learning

Semantic Stream

Processor

Geo Filtering

Geo Filtering

Geo-spatial KB

Geo-spatial KB

Social Media Analyzer

PUSH: Data Initiated

ArchitecturePUSH Segment – Data Initiated

27.02.2012 - Workshop di Senso Comune, Trento, Italy 10

out

Query Rewriter

Query Evaluator

Machine Learning

SemanticStream

Processor

SemanticStream

Processor

Geo Filtering

Geo-spatial KB

Geo-spatial KB

Social Media Analyzer

Social Media Analyzer

PU

LL

: Qu

ery

Initi

ate

d

ArchitecturePULL Segment – Query Initiated

27.02.2012 - Workshop di Senso Comune, Trento, Italy 11

out

Query Rewriter

Query Evaluator

Machine Learning

Semantic Stream

Processor

Geo Filtering

Geo-spatial KB

Social Media Analyzer

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

ArchitectureThe Entire BOTTARI System

27.02.2012 - Workshop di Senso Comune, Trento, Italy 12

Components

SOR - Geo-Spatial KB

1327.02.2012 - Workshop di Senso Comune, Trento, Italy

Components

Social Media Analyzer

1427.02.2012 - Workshop di Senso Comune, Trento, Italy

Micro-post

MorphologicallyAnalyzable?

Rule based Analysis

Auto generated rules

Auto generated rules

Learneddocuments

SVMs

Syllable KernelSyllable Kernel

Sentiment of the tweet

Yes No

• Precision tests:– Auto-generated

rules ≈ 70%

– Manually-coded rules ≈ 90%

– Syllable kernel ≈ 50~60%

• Our target > 85%

Components

Semantic Stream Processor

1527.02.2012 - Workshop di Senso Comune, Trento, Italy

• SUNS (Statistical Unit Node Sets)– A machine learning framework for inductive

materialization• Detects interesting data patterns• Predics RDF-triples

– i.e., which restaurant a user will tweet positively about

– Caractheristics • Capability to deal with sparse, high-dimensional and

incomplete data• Multivariate latent space based approach• Modularized approach for easily integrating contextual

information

Components

Machine Learning Component

1627.02.2012 - Workshop di Senso Comune, Trento, Italy

out

Query Rewriter

Query Evaluator

Machine Learning

Semantic Stream

Processor

Geo Filtering

Geo-spatial KB

Social Media Analyzer

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

Query ProcessingAn Ontology-based Approach

27.02.2012 - Workshop di Senso Comune, Trento, Italy

17

SELECT DISTINCT ?poi ?name ?lat ?long ?numPos ?prob WHERE { ?poi a ns:NamedPlace ; ns:name ?name ; geo:lat ?lat ; geo:long ?long . FILTER (f:within_distance(37.5, 126.9, ?lat, ?long, 200)) FILTER (f:dest_point_viewing(37.5, 126.9, ?lat, ?long, 90, 200)) { :someUser sioc:creator_of ?tweet . ?tweet twd:talksAboutPositively ?poi . WITH PROBABILITY ?prob ENSURE PROBABILITY [0.5..1) } ?poi twd:numberOfPositiveTweets ?numPos . } ORDER BY DESC(?numPos), ?prob, f:distance(37.5, 126.9, ?lat, ?long)LIMIT 10

1827.02.2012 - Workshop di Senso Comune, Trento, Italy

GEO-SPATIAL FILTER

RECCOMENDENDATION

REPUTATION

Query Processing

An Example Query

outQuery Evaluator

Query Rewriter

Machine Learning

Semantic Stream

Processor

Geo Filtering

Geo-spatial KB

Social Media Analyzer

The issued query is rewritten in three queries; one for each

run time component

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

Query ProcessingThe Input Received is Rewritten

27.02.2012 - Workshop di Senso Comune, Trento, Italy 19

outQuery Evaluator

Query Rewriter

Machine Learning

Semantic Stream

Processor

Geo Filtering

Geo-spatial KB

Social Media Analyzer

Generates personalized recommendations

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

Query ProcessingEach Component Answers

27.02.2012 - Workshop di Senso Comune, Trento, Italy 20

outQuery Evaluator

HTTP

Query Rewriter

Machine Learning

Semantic Stream

Processor

Geo Filtering

Geo-spatial KB

Social Media Analyzer

Gets trends in restaurants’ social reputation P

UL

L: Q

ue

ry In

itiat

ed

PUSH: Data Initiated

Query ProcessingEach Component Answers

27.02.2012 - Workshop di Senso Comune, Trento, Italy 21

outQuery Evaluator

Query Rewriter

Machine Learning

Semantic Stream

Processor

Geo Filtering

Geo-spatial KB

Social Media Analyzer

Gets restaurants

close to user location

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

Query ProcessingEach Component Answers

27.02.2012 - Workshop di Senso Comune, Trento, Italy 22

outQuery Evaluator

Joins results and returns

them

Query Rewriter

Machine Learning

Semantic Stream

Processor

Geo Filtering

Geo-spatial KB

Social Media Analyzer

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

Query ProcessingResults Are Joined and Returned

27.02.2012 - Workshop di Senso Comune, Trento, Italy

23

Evaluation

Efficacy – High Accuracy

2427.02.2012 - Workshop di Senso Comune, Trento, Italy

Evaluation

Efficacy – Remembering 90 Days is Enough

2527.02.2012 - Workshop di Senso Comune, Trento, Italy

Evaluation

Efficiency – High Throughput

2627.02.2012 - Workshop di Senso Comune, Trento, Italy

• End-user application

• Attractive and functional interface

• Real-world dynamic data

• Fully based on Semantic Web technologies– RDF as common data format between heterogeneous

components– SPARQL as query language

• Rigorously evaluated– Effective– High throughput for handling dynamic data

• Commercial Potential

Conclusions

2727.02.2012 - Workshop di Senso Comune, Trento, Italy

• Tony Lee, Seonho Kim (SALTLUX)– Android Application – Social Media Analyzer– Tweets and restaurant descriptiond for Insadong

• Marco Balduini (PoliMi)– Semantic Stream Processor

• Irene Celino, Daniele Dell’Aglio (CEFRIEL)– LarKC workflows– Ontology

• Volker Tresp, Yi Huang (SIEMENS)– Machine Learning component

Credits

BOTTARI is a joint work with

2827.02.2012 - Workshop di Senso Comune, Trento, Italy

Thank You! Questions?

2927.02.2012 - Workshop di Senso Comune, Trento, Italy

Much More to Come!Keep an eye on

http://www.streamreasoning.org

BOTTARI: How to offer innovative services

listening to the Social Media

Emanuele Della Valle DEI - Politecnico di Milano

emanuele.dellavalle@polimi.ithttp://emanueledellavalle.org