Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story...

Post on 17-Jun-2020

2 views 0 download

Transcript of Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story...

ProjectWeb Search

1

Searching trending topics

• Answering some queries with a static list of documents does not provide the full picture.

• But… • The Web is highly dynamic.

• Information occurs in cascades.

• In this project we will target the problem of searchingdeveloping stories in Web data.

2

How to sumarize search results?

3

Setting

• The user submits a story topic and the corresponding storysegments.

• The system must to return the sequence of documents thatbest match the submitted story topic segments.

S1 S2 S2

...

News story topic

Story segments

T1

Story segments candidate media

S4

Social-media images and video

4

Applications

• Search results presented as Wikipage

• Search results provide a summary

• Provide illustrations for news

• Infer the storyline from UGC

• Learn the diferent story branches

• Detect new developments in an event-plot

• Discovery of event specific tags

5

Everyone wants it!!!

Omar Alonso, Vasileios Kandylas, Serge-Eric Tremblay: Automatic Story Evolution Wikification from Social Data. ICWSM 2018: 713-714

6

Project-based learning

Topic filter

Visualanalysis

Temporal analysis

Data storycreation

Semanticvisual analysis

7

Step 1: Static retrieval (25%)

• Text retrieval with BoW and named entities.

• Image retrieval with automatic tags.

• Searching by similarity for pseudo relevance feedback.

S1 S2 S2

...

News story topic

Story segments

T1

Story segments candidate media

S4

Social-media images and video

8

Step 2: Graph representations (25%)

• In this step you ought to use graph representations of yourdata.

• These graph representations will allow you to navigate yourdata and search for the optimal sequence of documents.

S1 S2 S2

...

News story topic

Story segments

T1

Story segments candidate media

S4

Social-media images and video

9

Step 3: Rich embeddings (50%)

• The goal is to use semantic embeddings to find more relevant information.

• Such multimodal embeddings can capture relevantinteractions between text and visual data.

• This will enable the discoveryof richer information to createthe search results.

10

Project grading

• Scoring:• Implement. correctness 30%

• Results analysis 30%

• Critical discussion 40%

• Report: • Maximum of 8 pages.

• No cover page.

• Must include graphs, tables, etc.

• Report organization:• Introduction

• Algorithms

• Implementation

• Evaluation• Dataset description

• Baselines

• Results analysis

• Critical discussion

• References

11

Q&A?

12