Personalizing Search

Personalizing SearchJaime Teevan, MIT

Susan T. Dumais, MSR

and Eric Horvitz, MSR

Relevant result

“pia workshop”Query:

Outline

Approaches to personalizationThe PS algorithmEvaluationResultsFuture work

Approaches to Personalization

Content of user profile Long-term interests

Liu, et al. [14], Compass Filter [13] Short-term interests

Query refinement [2,12,15], Watson [4]

How user profile is developed Explicit

Relevance feedback [19], query refinement [2,12,15] Implicit

Query history [20, 22], browsing history [16, 23]

Very rich user profile

PS Search Engine

dog cat monkey banana

baby infant

child boy girl

forest hiking

walking gorp

baby infant

child boy girl

csail mit artificial research

robotweb

search retrieval ir

1.6 0.26.0

0.2 2.7

PS Search Engine

Search results page

web search retrieval ir hunt

Calculating a Document’s Score

Based on standard tf.idf

Score = Σ tfi * wi

0.05 0.5 0.35

0.3 0.7 0.1 0.23 0.6 0.6

0.002 0.7 0.1 0.01 0.6

0.2 0.8 0.1 0.001

0.3 0.4

0.1 0.7 0.001

0.23 0.6

0.1 0.7 0.001 0.23 0.6

0.1 0.05

0.5 0.35 0.3

0.1 0.05

0.5 0.35 0.3

Score = Σ tfi * wi

World (N)

(ni)wi = log

Σ 1.3

0.002 0.7 0.1 0.01 0.6

0.3 0.7 0.1 0.23 0.6 0.6

0.002 0.7 0.1 0.01 0.6

0.2 0.8 0.1 0.001

0.3 0.4

0.1 0.7 0.001

0.23 0.6

0.1 0.7 0.001 0.23 0.6

0.1 0.05

0.5 0.35 0.3

Score = Σ tfi * wi

(ni)wi = logWorld

(ri+0.5)(N-ni-R+ri+0.5)

(ni-ri+0.5)(R-ri+0.5)wi = log

† From Sparck Jones, Walker and Roberson, 1998 [21].

Where: N = N+R, ni = ni+ri’’

Client

(ri+0.5)(N-ni-R+ri+0.5)

(ni-ri+0.5)(R-ri+0.5)wi = log

Finding the Parameter Values

Corpus representation (N, ni) How common is the term in general? Web vs. result set

User representation (R, ri) How well does it represent the user’s interest? All vs. recent vs. Web vs. queries vs. none

Document representation What terms to sum over? Full document vs. snippet

Building a Test Bed

15 evaluators x ~10 queries 131 queries total

Personally meaningful queries Selected from a list Queries issued earlier (kept diary)

Evaluate 50 results for each query Highly relevant / relevant / irrelevant

Index of personal information

Evaluating Personalized Search

Measure algorithm quality

DCG(i) = {Look at one parameter at a time

67 different parameter combinations! Hold other parameters constant and vary one

Look at best parameter combination Compare with various baselines

Gain(i),DCG(i–1) + Gain(i)/log(i),

if i = 1otherwise

Analysis of Parameters

0.35Full

Recent

Analysis of Parameters

0.35Full

Recent

Corpus User Document

PS Improves Text Retrieval

No RF PS Web Combo

No modelRelevance

FeedbackPersonalized

Search0.37

0.410.46

Text Features Not Enough

No RF PS Web Combo

0.370.41

Take Advantage of Web Ranking

No RF PS Web Combo

0.370.41

0.56 0.58

PS+Web

Summary

Personalization of Web search Result re-ranking User’s documents as relevance feedback

Rich representations important Rich user profile particularly important Efficiency hacks possible Need to incorporate features beyond text

Further Exploration

Improved non-text components Usage data Personalized PageRank

Learn parameters Based on individual Based on query Based on results

UIs for user control

User Interface Issues

Make personalization transparentGive user control over personalization

Slider between Web and personalized results Allows for background computation

Exacerbates problem with re-finding Results change as user model changes Thesis research – Re:Search Engine

Thank you!teevan@csail.mit.edu

sdumais@microsoft.com

horvitz@microsoft.com

Much Room for Improvement

Group ranking Best improves on

Web by 23% More people

Less improvement

Personal ranking Best improves on

Web by 38% Remains constant

1 2 3 4 5 6

Number of People

Personalized Group

Potential forPersonalization

Evaluating Personalized Search

Query selection Chose from 10 pre-selected queries Previously issued query

cancerMicrosofttraffic…

bison friseRed Soxairlines…

Las VegasriceMcDonalds…

Pre-selected

53 pre-selected (2-9/query)

Total: 137

JoeMary

Making PS Practical

Learn most about personalization by deploying a system

Best algorithm reasonably efficientMerging server and client

Query expansion Get more relevant results in the set to be re-ranked

Design snippets for personalization

Personalizing Search

Documents

Transcript of Personalizing Search

Personalizing Indistar®

Personalizing Search on Shared Devices - Ryen Whiteryenwhite.com/talks/pdf/WhiteSIGIR2015.pdf · 2015. 8. 12. · Personalizing Search on Shared Devices Ryen White and Ahmed Hassan

Personalizing Education

Personalizing Web Search using Long Term Browsing History Nicolaas Matthijs, Cambridge Filip Radlinski, Microsoft In Proceedings of WSDM 2011 1.

Personalizing Search on Shared Devices Ryen White and Ahmed Hassan Awadallah Microsoft Research, USA Contact: ryenw@microsoft.comryenw@microsoft.com.

OLSET: Personalizing Travel Search

Culatta Personalizing Learning

Exploring Personalizing Content Density Preference from ...1(42) 1 Exploring Personalizing Content Density Preference from User Behavior 1.1 Introduction Personalizing the content

Personalizing Oa Framework Pages

Generalized Mixed Effect Models for Personalizing Job Searchpeople.cs.uchicago.edu/~ankans/Papers/JobSearch_SIGIR.pdf · Generalized Mixed E•ect Models for Personalizing Job Search

Personalizing & Recommender Systems

Personalizing Your Windows 7

Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.

Personalizing treatment choice

Personalizing LinkedIn Feed

David&Sontag,&KevynCollinsThompson,& …people.csail.mit.edu/dsontag/papers/sontag_wsdm12_slides.pdf · Probabilis)c+Models+for+Personalizing+Web+Search+ (WSDM‘12)+ David&Sontag,&KevynCollinsThompson,&

Personalizing Atypical Web Search Sessions (WSDM'13)

Personalizing Illenss and Modernity

Personalizing for dousman learners

PERSONALIZING NEGLIGENCE LAW