Latent Dirichlet Allocation and Its Application in...

12
Latent Dirichlet Allocation and Its Application in Recommder Systems Weike Pan Thanks to Ms. Qing Zhang College of Computer Science and Software Engineering, Shenzhen University

Transcript of Latent Dirichlet Allocation and Its Application in...

Page 1: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Latent Dirichlet Allocation and Its Application in Recommder Systems

Weike Pan

Thanks to Ms. Qing Zhang

College of Computer Science and Software Engineering, Shenzhen University

Page 2: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Introduction

Topic modeling

Page 3: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Introduction

Token vs. term

• Note that word-instance = token and term = word.

• For example, in a document “my name is peter and my nationality is usa”, there are two tokens of the term “my”.

Page 4: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Introduction

Notations

Page 5: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Modeling

Graphical model

Page 6: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Modeling

Generation

Page 7: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Modeling

Objective function

Page 8: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Approximate Inference

• Exact inference

• Approximate inference

– Variational method

– Collapsed Gibbs sampling (we adopt this approach in this slides)

• Collapsed Gibbs Sampling

– A Markov chain Monte Carlo (MCMC) algorithm

– Main idea:

• For the current token w

• Calculate the probability that w belongs to each topic

• Sample a topic according to the probability

Page 9: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Algorithm

Page 10: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Algorithm (Collapsed Gibbs Sampling)

Page 11: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

Application in Recommender Systems

• In recommender systems (in particular of one-class collaborative filtering), we may take users as documents, and items as terms, and model the users’ behaviors using LDA.

• Notice that the algorithm in previous pages can be used without modification.

• ...

Page 12: Latent Dirichlet Allocation and Its Application in ...csse.szu.edu.cn/staff/panwk/recommendation/OCCF/LDA.pdf · Latent Dirichlet Allocation. JMLR 2003. • Thomas L. Griffiths and

References

• David M. Blei, Andrew Y. Ng and Michael I. Jordan. Latent Dirichlet Allocation. JMLR 2003.

• Thomas L. Griffiths and Mark Steyvers. Finding Scientific Topics. PNAS 2004.

• David M. Blei. Probabilistic Topic Models. CACM 2012.

• Haijun Zhang, Zhoujun Li, Yan Chen, Xiaoming Zhang and Senzhang Wang. Exploit Latent Dirichlet Allocation for One-Class Collaborative Filtering. CIKM 2014.