Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia...

19
Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger

Transcript of Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia...

Page 1: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Relevance feedback using query-logs

Gaurav Pandey

Supervisors:Prof. Gerhard Weikum

Julia Luxenburger

Page 2: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Motivation

Results

Search Engine

Query

“One size fits all”

Page 3: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Motivation

Results

Search Engine

User infoQuery

Page 4: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Motivation

Python

Page 5: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Motivation

PythonCGI code Debugging programming

Page 6: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Usage of Query Logs

Clickthrough data

•Past queries

• Documents clicked

Page 7: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Usage of Query Logs

Query

Clicked Documents

History Instance

Page 8: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Query Reformulation

Result Query: “python information CGI code examples program code debugging bug removal programming”

But,

p(python/query)=? p(CGI)/query)=? p(code)/query)=?

…………………………..

Page 9: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Language ModelNormally(without using history), w:term

d: document q:query

Importance of term w in current query

Considers only the current query

But, not history instances

Page 10: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Language ModelNormally(without using history), w:term

d: document q:query

Importance of term w in current query

Now, using history:

Importance of term w in current query + history instances

?

Page 11: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Language Model+History

Importance of the term w at one instance in the history

Importance of term w in history instances

History query: “CGI code”

Documents: “CGI examples”,

“program code”

History query: “CGI code”

Documents: “CGI examples”, “program

code”

Page 12: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Equal Weighting

Works,but can be improved

Page 13: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Discriminative Weighting

Choose different for every history instance.. How?

Page 14: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Overlap

if a history query has common terms with the current query then

λi= 1,

Else if there is no common term

λi=0

Example:

Current query “python information”

History query:”python code” λi= 1

History query:”world cup” λi= 0

Page 15: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Soft overlap

if a history query has common terms with the current query then

λi= a,

Else if there is no common term

λi=b (a>b)

Example:

Current query “python information”

History query:”python code” λi= 8

History query:”world cup” λi= 2

Page 16: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Decrease with time

Use uniformly decreasing values

If there are n history instances,1 =n2 =n-13 =n-2……n-1 =2n =1

Page 17: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Decrease with time

Use geometrically decreasing values

If there are n history instances,1 =n2 =n/23 =n/3……n-1 =n/(n-1)n =1

Page 18: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Experiment Comparison of the 4 techniques Equal weighting Basic model (without history) Use similar techniques for: Probabilistic model Vector space model

Page 19: Relevance feedback using query-logs Gaurav Pandey Supervisors: Prof. Gerhard Weikum Julia Luxenburger.

Thanks