Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

Post on 16-Feb-2017

132 views 0 download

Transcript of Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

Laurens De VochtChristian Beecks, Ruben Verborgh, Erik Mannens, Thomas Seidl, Rik Van de Walle

BA

Introduction

Pathfinding

Semantic Distance

Evaluation

Conclusions & Next Steps

BA

Introduction

Pathfinding

Semantic Distance

Evaluation

Conclusions & Next Steps

BA

?

How to consistently improve and tailor existing pathfinding approaches? [pathfinding]

How well do heuristics effect user expectations so users are able to discover feeling confident about the story facts relevance? [serendipity]

Is semantic distance between facts a good criterion for optimizing the paths forming a story? [user judgments]

6

- trivial

randomness -

familiarity

surprise

sense making

+ discovery

Serendipity

BA

Introduction

Pathfinding

Semantic Distance

Evaluation

Conclusions & Next Steps

8

Original Core Algorithm A* based

A*

h =Jaccard

Distance

w = Common

Node Degree

Optimizations

9

Improved Algorithm Wraps Core

Algorithm

h

w

Domain Delineation

Iterative Refinementto increase semantic relatedness

10

Heuristics [h]

Jaccard

NormalizedDBpedia Distance

Confidence

11

Weights [w]

Jaccard

Jiang-ConrathDistance (JCW)

Common NodeDegree (CND)

BA

Introduction

Pathfinding

Semantic Distance

Evaluation

Conclusions & Next Steps

13

Semantic Distance0.62 via Physics 0.45 via Hume

EinsteinNewton

Physics

Hume

:influences

:discipline

:birthPlace :deathPlace

Semantic Distances

14

Normalized Web Search Distancee.g. Google Distance, Bing Distance…

15

Motivating Example

16

17

18

Semantic Distances (continued)

BA

Introduction

Pathfinding

Semantic Distance

Evaluation

Conclusions & Next Steps

20

Serendipity – Semantic Distance

21

Serendipity – User Judgments

22

Serendipity – User Judgments

Least agreement (high standard deviation): Carl Linnaeus and Albert Einstein [JCWJaccard]Carl Linnaeus and Baruch Spinoza are Expert, Intellectual and Scholar Baruch Spinoza’s and Albert Einstein’s are both Pantheists Intellectuals and Jewish Philosophers

Most relevant and consistent: Charles Darwin and Carl Linnaeus [CNDJaccard]Copley Medal’s the award of Alfred Russel Wallace and Charles Darwin Alfred Russel Wallace’s and Charles Darwin’s awards are Royal Medal and Copley Medal Alfred Russel Wallace and Charles Darwin are known for their Natural selection Carl Linnaeus and Alfred Russel Wallace have as subject ‘Fellows of the Royal Society’ Carl Linnaeus and Alfred Russel Wallace are Biologists and Colleagues

BA

Introduction

Pathfinding

Semantic Distance

Evaluation

Conclusions & Next Steps

24

Conclusions

Reducing the number of arbitrary resources/facts revealed for a story.

Dbpedia example: telling a story with better link estimation, in cases where the original algorithm did not make optimal choices of links.

The most consistent output was generated with the Jaccard distance used both as weight and heuristic; or as heuristic in combination with the Jiang-Conrath distance as weight.

The most arbitrary facts occur in a story when using the combined node degree as weight with the Jaccard distance as heuristic, both in the optimized and the original algorithm.

User judgments confirm the findings for the Jiang-Conrath weight, original algorithm and for the Jaccard distance used as weight and heuristic in terms of discovery.

25

Validate the correlation between the effect of the link estimation on the arbitrariness as perceived by users and computational semantic relatedness measures such as SemRank.

Measure the scalability of the approach by implementing the algorithms: (i) solely on the client, (ii) completely on the sever, and (iii) in a distributed client/server architecture.

Next Steps

Additional questions?@laurens_d_vlaurens.devocht@ugent.behttp://slideshare.net/laurensdv