Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story...

12
Project Web Search 1

Transcript of Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story...

Page 1: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

ProjectWeb Search

1

Page 2: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Searching trending topics

• Answering some queries with a static list of documents does not provide the full picture.

• But… • The Web is highly dynamic.

• Information occurs in cascades.

• In this project we will target the problem of searchingdeveloping stories in Web data.

2

Page 3: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

How to sumarize search results?

3

Page 4: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Setting

• The user submits a story topic and the corresponding storysegments.

• The system must to return the sequence of documents thatbest match the submitted story topic segments.

S1 S2 S2

...

News story topic

Story segments

T1

Story segments candidate media

S4

Social-media images and video

4

Page 5: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Applications

• Search results presented as Wikipage

• Search results provide a summary

• Provide illustrations for news

• Infer the storyline from UGC

• Learn the diferent story branches

• Detect new developments in an event-plot

• Discovery of event specific tags

5

Page 6: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Everyone wants it!!!

Omar Alonso, Vasileios Kandylas, Serge-Eric Tremblay: Automatic Story Evolution Wikification from Social Data. ICWSM 2018: 713-714

6

Page 7: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Project-based learning

Topic filter

Visualanalysis

Temporal analysis

Data storycreation

Semanticvisual analysis

7

Page 8: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Step 1: Static retrieval (25%)

• Text retrieval with BoW and named entities.

• Image retrieval with automatic tags.

• Searching by similarity for pseudo relevance feedback.

S1 S2 S2

...

News story topic

Story segments

T1

Story segments candidate media

S4

Social-media images and video

8

Page 9: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Step 2: Graph representations (25%)

• In this step you ought to use graph representations of yourdata.

• These graph representations will allow you to navigate yourdata and search for the optimal sequence of documents.

S1 S2 S2

...

News story topic

Story segments

T1

Story segments candidate media

S4

Social-media images and video

9

Page 10: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Step 3: Rich embeddings (50%)

• The goal is to use semantic embeddings to find more relevant information.

• Such multimodal embeddings can capture relevantinteractions between text and visual data.

• This will enable the discoveryof richer information to createthe search results.

10

Page 11: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Project grading

• Scoring:• Implement. correctness 30%

• Results analysis 30%

• Critical discussion 40%

• Report: • Maximum of 8 pages.

• No cover page.

• Must include graphs, tables, etc.

• Report organization:• Introduction

• Algorithms

• Implementation

• Evaluation• Dataset description

• Baselines

• Results analysis

• Critical discussion

• References

11

Page 12: Project - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/Project.pdfNews story topic Story segments T1 Story segments candidate media S4 Social -media images and video 4 Applications

Q&A?

12