Building a Click Model - From Idea to...
Transcript of Building a Click Model - From Idea to...
![Page 1: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/1.jpg)
Building a Click ModelFrom Idea to Implementation
Yiqun LIUDepartment of Computer Science and Technology
Tsinghua University
![Page 2: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/2.jpg)
•A simple solution: user click = relevance voting•Works well for most navigational queries (e.g. SIGIR15)
•Problem: position bias•Users tend to click more on higher-‐ranked results
User Behavior & Search Result Ranking
Courtesy of http://hubdesignsmagazine.com/2011/03/27/its-good-to-be-on-the-first-page-of-google/
![Page 3: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/3.jpg)
Result Examination: Click models
•Examination Hypothesis (Richardson et al., 2007)
•Estimating examination probabilities•Cascade model: •Dependent click model (DCM):
•User browsingmodel (UBM):
•Other models: DBN, CCM, TCM...
![Page 4: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/4.jpg)
From Idea to Implementation
•Click model assumptions• Ideal case (homogeneous results): Position is the only factor that affects behavior•Problem: results have different presentation styles• Ideal case (cascade assumption): Results are examined from top to bottom•Problem: results are often skipped or revisited
![Page 5: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/5.jpg)
Research Questions
•Presentation bias problem: a click model incorporating vertical results•Non-‐sequential examination problem: a click model incorporating revisiting behaviors
![Page 6: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/6.jpg)
Heterogeneous Search Results
•Vertical results are everywhere (over 80% SERPs)
Encyclo-pediaVertical
![Page 7: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/7.jpg)
Vertical-‐aware Click Model
Rank 1st Rank 3rd Rank 5th
•Attractiveness Effect (Wang et al., 2013; Liu et al., 2015)•Certain verticals draw much attention from users
![Page 8: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/8.jpg)
Vertical-‐aware Click Model
•Cut-‐off Effect (Liu et al., 2015)•After users have viewed on-‐topic verticals, they are more likely to decrease their visual attention on theorganic results which are below verticals.RelevantVertical Textual Encyclo-
pedia Image-only Application-download News
OrganicVertical 30.13% 16.70% 8.44% 13.04% 22.61%
Diff -12.95% -51.74%* -75.62%** -62.32%** -34.68%
OrganicVertical 26.30% 19.27% 10.33% 6.21% 38.69%
Diff 4.09% -23.76% -59.10%* -75.44%* 53.09%
Position = 334.61%
Position = 525.27%
![Page 9: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/9.jpg)
Vertical-‐aware Click Model
•Restart effect (Wang et al., 2013)•Most users (70%) will restart from the beginning to continue
![Page 10: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/10.jpg)
Vertical-‐aware Click Model
•Model Construction (based on UBM)
𝑃 𝐶# = 1 𝐸# = 0 = 0𝑃 𝐶# = 1 𝐸# = 1 = 𝑃(𝐴# = 1|𝐸# = 1)𝑃 𝐸# = 1 𝐹 = 0,𝐶.:#0. = 𝛾#,#023𝑃 𝐴# = 1|𝐸# = 1,𝐹 = 0 = 𝛼5,#
𝑃 𝐹 = 1 = 𝜙78,28
𝑃 𝐸# = 1 𝐹 = 1,𝐶.:#0. = 𝛾#,#023 + 𝜃5,#𝑃 𝐴# = 1|𝐸# = 1,𝐹 = 1 = 𝛼5,# + 𝛽5,#
𝑃 𝐵 = 1|𝐹 = 0 = 0𝑃 𝐵 = 1|𝐹 = 1 = 𝜎78,28
Effect on Examination
Effect on Click-‐through
Restart effect
Original UBM
Users examine vertical results at first
Simplified case: difficult to quantify the effect when not all results are affected
![Page 11: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/11.jpg)
Vertical-‐aware Click Model
•Experimental results•About 300,000 queries and 11,000,000 sessions collected from a major Chinese search engine
Perplexity UBM VCM VCM ImprovementText vertical 1.2266 1.2139 +5.58%
Multimedia vertical 1.3735 1.3071 +17.78%Application vertical 1.1908 1.1601 +16.09%
UBM VCM VCM ImprovementText vertical -‐2.9093 -‐2.7968 +11.90%
Multimedia vertical -‐4.1142 -‐3.8638 +28.44%Application vertical -‐2.2671 -‐2.1427 +13.24%
Click/skip perplexity
Log-‐likelihood
![Page 12: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/12.jpg)
Possible Future Directions
•How to rank items within vertical blocks?•An example vertical block composed of an image and several hyperlinks (e.g. news, Q&A portal results, etc. )
•Position bias: only for the top result•Attractiveness effect: not so significant
Title�
ImageItem1�
Item2-5�
0�
1� 2�
3�13.53%
9.74%
39.11%
9.08% 9.74% 10.73% 8.09%
0%
10%
20%
30%
40%
50%
60%
70%
Title Image Item1 Item2 Item3 Item4 Item5
news
![Page 13: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/13.jpg)
Research Questions
•Presentation bias problem: a click model incorporating vertical results•Non-‐sequential examination problem: a click model incorporating revisiting behaviors
![Page 14: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/14.jpg)
Sequential order of Examination/Clicking
•Sequential examination happens a lot•Cascade assumption: Users tend to examine results from top to bottom
•Mean time of arrival v.s. result ranking position
Joachims et.al, Eye-tracking analysis of user behavior in www search. SIGIR 2005
![Page 15: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/15.jpg)
Non-‐sequential Examination Problem
• Revisiting behavior cannot be ignored•Chinese search engine (Sogou): 27.9% sessions•Non-‐Chinese search engine (Yandex): 30.4% sessions
Danqing Xu, Yiqun Liu, et al. Incorporating Revisiting Behaviors into Click Models. WSDM 2012
![Page 16: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/16.jpg)
Non-‐sequential Examination Problem
•Depth-‐first strategy (cascade assumption)•Users examine search result lists from top to bottom•Accepted by most existing click models
1
23
45678
1 2 3 6 2S E
1 6 2
Examine(unobserved)Click(observed)
Reorganize data with cascade assumption
1 62
Problem#1: not the true last clickProblem#2: sequential information is missing
![Page 17: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/17.jpg)
Non-‐sequential Examination Problem
•Why is it difficult to retain sequential information1 52
1 2 3 4 5S E
1 23
1 2 3 4 2S E
1 2 3 2S E
1 3 2S E
LongRevisit
ShortRevisit
Skip and revisit
Cascade assumption
Retaining Sequential Information
![Page 18: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/18.jpg)
Non-‐sequential Examination Problem
•Any rules in users’ non-‐sequential examination?•How often do users change the direction of examination between clicks?click examine
1
2
1
2
3
1
2
1
2
3
4
click examine
![Page 19: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/19.jpg)
Non-‐sequential Examination Problem
•Any rules in users’ non-‐sequential examination?•How far do users’ eye gazes jump after examining the current clicked result? click examine
1
2
1
2
3
1
2
1
2
3
4
5
![Page 20: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/20.jpg)
Non-‐sequential Examination Problem
•Locally Unidirectional Examination•Users tend to examine search results in a single direction without changes between clicks
![Page 21: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/21.jpg)
Non-‐sequential Examination Problem
•Non First-‐order Examination•Users may skip a few results and examine a result at some distance away from the current one after clicks
![Page 22: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/22.jpg)
Non-‐sequential Examination Problem
•Locally Unidirectional Examination Assumption •Between adjacent clicks, user’s examination direction is strictly consistentwith click direction (↑ or ↓).•Compared with position based models: Allow non-‐sequential click/examination sequence•Compared with click sequence based models: Reducelots of examination sequences
•Non First-‐order Examination Assumption• Skipped results may not be examined: Cascade model and DCM do not work, should try UBM
![Page 23: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/23.jpg)
Non-‐sequential Examination Problem
•Partially Sequential Click Model (PSCM)
![Page 24: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/24.jpg)
Non-‐sequential Examination Problem
•Experimental results: Perplexity•Compared with UBM (Sogou: +30.1%, Yandex: +27.4%)•Compared with DBN (Sogou: +31.6%, Yandex: +27.9%)
•
![Page 25: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/25.jpg)
Non-‐sequential Examination Problem
•Experimental results: relevance estimation•Query-‐URL Relevance Generated from Click Models•Annotated by Sogou’s professional assessors
![Page 26: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/26.jpg)
Non-‐sequential Examination Problem
•Experimental results: user preference test•200 queries and 22 human assessors•Which ranking lists is preferred by real users
![Page 27: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/27.jpg)
Take-‐Home Messages
•Presentation bias problem: a click model incorporating vertical results•Attractiveness effect; Cut-‐off effect: for organic results; Restart effect: different examination sequence
•Non-‐sequential examination problem: a click model incorporating revisiting behaviors•Non-‐sequential examination happens a lot : Locally unidirectional and non First-‐order patterns
User studies Assumptions Better Click models
![Page 28: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/28.jpg)
Reference
• (Presentation bias problem) Chao Wang, Yiqun Liu, Min Zhang, ShaopingMa, MeihongZheng, Jing Qian, Kuo Zhang. Incorporating Vertical Results into Search Click Models. The 36th ACM SIGIR conference (SIGIR 2013)• (Non-‐sequential examination problem) Chao Wang, Yiqun Liu, MengWang, Ke Zhou, Jian-‐Yun Nie, ShaopingMa, Incorporating Non-‐sequential Behavior into Click Models. The 38th ACM SIGIR conference (SIGIR2015)• (Vertical Examination problem) Liu Zeyang, Yiqun Liu, Ke Zhou, Min Zhang, ShaopingMa, Influence of Vertical Result in Web Search Examination. The 38th ACM SIGIR conference (SIGIR2015) • Codes are available on https://github.com/THUIR/PSCMModel
![Page 29: Building a Click Model - From Idea to Implementationclickmodels.weebly.com/uploads/5/2/2/5/52257029/sigir2015-tutorial... · Text"vertical 1.2266 1.2139 +5.58% Multimedia"vertical](https://reader035.fdocuments.in/reader035/viewer/2022070915/5fb5eea2e2bb1a674e1f4d79/html5/thumbnails/29.jpg)
Dataset is available for academic use: Eye fixations, mouse movement features, clicks, relevance annotation, examination feedback, …
http://www.thuir.cn/group/~YQLiu/
Thank you