Domain Dependent Query Reformulation for Web Search

DOMAIN DEPENDENT QUERY REFORMULATION FOR WEB SEARCH

Date : 2013/06/17Author : Van Dang, Giridhar Kumaran, Adam TroySource : CIKM’12Advisor : Dr. Jia-Ling KohSpeaker : Yi-Hsuan Yeh

Outline

Introduction Method Experiment Conclusions

Introduction (1/5)

Query reformulation techniques aim to modify user queries to make then more effective for retrieval.

Identify candidate terms that are similar to the words in the query and then expend or substitute the original query terms.

Introduction (2/5)

Existing work : − Spelling correction− Stemming expends query with

morphological variants− Look for candidates that are semantically

related to the query terms

However, none of the techniques consider the task as domain dependent.

Introduction (3/5)- Motivation

In the domain of commerce queries, however, expanding the query from ‘flat screen tv’ to ‘flat screen tv television’ drastically hurts NDCG.

Introduction (4/5)- Goal

Provide different candidate terms for queries in deferent domain.

Build different reformulation systems provide substantially better retrieval effectiveness than having a single system handling all queries.

Introduction (5/5)- Framework

Query logs

Pseudo parallel corpus

Translation model

New querie

Filer out bad

candidates(classifier)

Reformulation model

(domain-specific)

Outline

Introduction Method

− Pseudo parallel corpus− Translation model− Reformulation model− Candidate query generation

Experiment Conclusions

Pseudo parallel corpusParallel pair : <Query, Translation>1. Consecutive query pairs2. Query and clicked document title3. Query suggestion via Random Walk

Translation model (1/3)

Aim to provide multiple semantically related candidates as “translations” for any given query term.

The translation probability indicates similarity between the term and candidate.

Translation model (2/3)

Expectation Maximization algorithm

Identify the most probable alignments for all parallel pairs.

S : source sentenceT : correct translationa = {a1, a1, …, am} : an alignment between S and TTr(tj | s(aj)) : translation probability

Translation model (3/3)- Enhanced by learning

Train a boosted decision tree classifier which classifies if a <term, candidate> pair is desirable.

Reformulation model (1/4) Given a query q = w1, ..., wi−1, wi, wi+1, ..., wn

and the candidate s for the query term wi.

The reformulation model determines whether or not to accept this candidate.

Consider：1. How similar s is to wi 2. How fit s is to the context of the query

Reformulation model (2/4)

Reformulation model (3/4)

G : L1, L2, R1 , R2

C(S) : the set of words that occur in the context of sCollection : all original and “translation” queries

The probability of seeing w (query term) in the context of s (candidate)

The probability of seeing w in the entire collection

Reformulation model (4/4)- Domain-specific model

Estimated from the domain specific log

Estimated from the generic log

smooth

Candidate queries generation Each of these candidates sij for the query

term wi is accepted if and only if:

Expend the original queries with these accepted candidates.

θ= 0.9

Outline

Experiment (1/6) Query set

− Web query log− Health− Commerce

Reformulation approach− Baseline： noalter (without reformulation)− Generic (domain independent)− Health − Commerce

Model parameters λ= 0.9, β= 0.9 and θ= 0.9.

Experiment (2/6)- Transaction model filtering：effectiveness

Experiment (3/6)- Generic vs. domain specific

Experiment (4/6)- Reason for improvement

1. Eliminate ineffective general candidates provided by the generic model. (Reject)

2. Domain system provide additional domain-specific candidates. (Add)

Experiment (5/6)- Domain-specific training data

Experiment (6/6)- Failure analysis

1. Domain model misses several good candidates.

2. Domain system introduce bad candidates that are not provided by the generic system.

Outline

Conclusions (1/2) Demonstrate the advantage of domain

dependent query reformulation over the domain independent approach.

Using the same reformulation technique, both reformulation systems for the health and commerce domains outperform the generic system that learns from the same log.

Conclusions (2/2)- Future work

1. Higher-order contextual n-gram models

2. Consider the relationship among candidates

3. Train boosted decision tree with various feature

Domain Dependent Query Reformulation for Web Search

Documents

Transcript of Domain Dependent Query Reformulation for Web Search

REFORMULATION DES QUESTIONS DANS L'INTERROGATION DES … · BASES DES DONNEES BIBLIOGRAPHIQUES : SYSTEME LEXIQUEST SERPIA-CDST-CNRS, ... LEXIQUEST is designed to guide any query made

Optimizing Reformulation-based Query Answering in RDFRDF query answering, SPARQL, query reformulation, query optimization, heuristic algorithms Résumé La technique de r eponse aux

RIN: Reformulation Inference Network for Context-Aware Query Suggestion - CSIE · 2019. 3. 6. · 2.1 Query Suggestion and Auto-Completion To understand search intents behind queries,

Criticism and Reformulation

Query Planning for Searching Inter-dependent Deep-Web ...jin/Papers/SSDBM08.pdf · Query Planning for Searching Inter-dependent Deep-Web Databases Fan Wang 1, Gagan Agrawal , and

Analyzing and Evaluating Query Reformulation Strategies in Web Search Logs ReporterHsan-Yu Lin.

Problem Reformulation and Search

Graph Query Reformulation with Diversity – Davide Mottin, Francesco Bonchi, Francesco Gullo 1 Graph Query Reformulation with Diversity Davide Mottin, University.

The Use of Query Reformulation to Predict Future User Actions

Top 10 Tips for Sugar Reformulation - Leatherhead Food … · Top 10 Tips for Sugar Reformulation . How to Develop a Successful Sugar Reformulation Strategy . Persis Subramaniam .

An Eye-Tracking Study of Query Reformulation

XML QUERY REFORMULATION OVER MIXED AND REDUNDANT ...

Keyword Query Reformulation on Structured Datanet.pku.edu.cn/~cuibin/Papers/2012ICDE keyword.pdfKeyword Query Reformulation on Structured Data Junjie Yao Bin Cui Liansheng Hua Yuxin

Tsinghua @ TRECVID2007.search - NIST · c tf idf c d freq c d Concept Relevance Concept Specificity Query-Dependent Rank Query-Independent Rank c-tf-idf is a good combination of query-dependent

Beyond Clicks: Query Reformulation as a Predictor of ... · Beyond Clicks: Query Reformulation as a Predictor of Search Satisfaction ABSTRACT To understand whether a user is satisfied

Query Reformulation as a Predictor of Search Satisfaction...Ahmed Hassan, Xiaolin Shi, Nick Craswell and Bill Ramsey . Online Satisfaction Measurement •Satisfying users is the main

9. Relevance Feedback and Query Expansion1. Doc2 . 2. Doc4 . 3. Doc5 .. Query. Reformulation. 4 Query Reformulation • Revise query to account for feedback: – Query Expansion: Add

Assisting Code Search with Automatic Query Reformulation for Bug Localization

Experiments with Semantic- lf avored Query …research.nii.ac.jp/ntcir/workshop/OnlineProceedings8/...Experiments with Semantic-lf avored Query Reformulation of Geo-Temporal Queries

Evaluating different query reformulation techniques for the GIR … · 2018. 5. 4. · Introduction 3 GeoDoc'2012 - Kuala Lumpur, Malaysia May 29, 2012 Geographical Information Retrieval