Topic models Source: Topic models, David Blei, MLSS 09.

Topic models

Source: “Topic models”, David Blei, MLSS ‘09

Topic modeling - Motivation

Discover topics from a corpus

Model connections between topics

Model the evolution of topics over time

Image annotation

Extensions*

• Malleable: Can be quickly extended for data with tags (side information), class label, etc

• The (approximate) inference methods can be readily translated in many cases

• Most datasets can be converted to ‘bag-of-words’ format using a codebook representation and LDA style models can be readily applied (can work with continuous observations too)

Connection to ML research

Latent Dirichlet Allocation

Probabilistic modeling

Intuition behind LDA

Generative model

The posterior distribution

Graphical models (Aside)

LDA model

Dirichlet distribution

Dirichlet Examples

Darker implies lower magnitude

\alpha < 1 leads to sparser topics

Inference in LDA

Example inference

Topics vs words

Explore and browse document collections

Why does LDA “work” ?

LDA is modular, general, useful

Approximate inference

• An excellent reference is “On smoothing and inference for topic models” Asuncion et al. (2009).

Posterior distribution for LDA

The only parameters we need to estimate are \alpha, \beta

Posterior distribution

Posterior distribution for LDA

• Can integrate out either \theta or z, but not both

• Marginalize \theta => z ~ Polya (\alpha)• Polya distribution also known as Dirichlet

compound multinomial (models “burstiness”)• Most algorithms marginalize out \theta

MAP inference

• Integrate out z• Treat \theta as random variable• Can use EM algorithm• Updates very similar to that of PLSA (except

for additional regularization terms)

Collapsed Gibbs sampling

Variational inference

Can think of this as extension of EM where we compute expectations w.r.t “variational distribution” instead of true posterior

Mean field variational inference

MFVI and conditional exponential families

Variational inference

Variational inference for LDA

Collapsed variational inference

• MFVI: \theta, z assumed to be independent• \theta can be marginalized out exactly• Variational inference algorithm operating on

the “collapsed space” as CGS• Strictly better lower bound than VB• Can think of “soft” CGS where we propagate

uncertainty by using probabilities than samples

Estimating the topics

Inference comparison

Comparison of updates

“On smoothing and inference for topic models” Asuncion et al. (2009).

Choice of inference algorithm

• Depends on vocabulary size (V) , number of words per document (say N_i)

• Collapsed algorithms – Not parallelizable• CGS - need to draw multiple samples of topic

assignments for multiple occurrences of same word (slow when N_i >> V)

• MAP – Fast, but performs poor when N_i << V• CVB0 - Good tradeoff between computational

complexity and perplexity

Supervised and relational topic models

Supervised LDA

Variational inference in sLDA

ML estimation

Prediction

Example: Movie reviews

Diverse response types with GLMs

Example: Multi class classification

Supervised topic models

Upstream vs downstream models

Upstream: Conditional modelsDownstream: The predictor variable is generated based on actually observed z than \theta which is E(z’s)

Relational topic models

Predictive performance of one type given the other

Predicting links from documents

Things we didn’t address

• Model selection: Non parametric Bayesian approaches

• Hyperparameter tuning• Evaluation can be a bit tricky (comparing

approximate bounds) for LDA, but can use traditional metrics in supervised versions

Thank you!

Topic models Source: Topic models, David Blei, MLSS 09.

Documents

Transcript of Topic models Source: Topic models, David Blei, MLSS 09.

Blei Jordan Ba

Point2 for MLSs Presentation

Lenders’ Experiences With APIs and Chatbotsfanniemae.com/resources/file/research/mlss/pdf/may... · 5/17/2017 · Lenders’ Experiences With APIs and Chatbots Q1 2017 Topic Analysis

Gersh Man Blei 2012

mn mlss!ons - COnnecting REpositories · of' ~ . . : • __ , . • -,~, :- :' :.'~ • . _ :~-~ :, ' " '":-' " ' " . " ' : . . :. : . | ' • p~. .... . • • • • .... ; '

Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Topic Modeling in Embedding Spaces · the hidden semantic structure in a collection of documents (Blei et al.,2003;Blei,2012). Topic models and their extensions have been applied

Mortgage Technology Innovation - Fannie Maefanniemae.com/resources/file/research/mlss/pdf/mlss-072616-topic...Jul 26, 2016 · Mortgage Technology Innovation Q2 2016 Topic Analysis

Murka workshop on MLSS

Hierarchical Topic Models and the Nested Chinese Restaurant Process Blei, Griffiths, Jordan, Tenenbaum presented by Rodrigo de Salvo Braz.

Blei ngjordan2003

Collaborative Topic Modeling for Recommending …blei/papers/WangBlei2011.pdfCollaborative Topic Modeling for Recommending Scientiﬁc Articles Chong Wang Computer Science Department

The IBP Compound Dirichlet Process and its Application to Focused Topic Modeling Sinead Williamson, Chong Wang, Katherine A. Heller, David M. Blei Presented.

MLSS Tutorial on: Deep Belief Nets - University of Cambridgemlg.eng.cam.ac.uk/mlss09/mlss_slides/Hinton_1.pdf · MLSS Tutorial on: Deep Belief Nets (An updated and extended version

Introduction to Probabilistic Topic Models · Introduction to Probabilistic Topic Models David M. Blei Princeton University Abstract Probabilistic topic models are a suite of algorithms

Linear regression, Logistic regression, and Generalized ...blei/fogm/2016F/doc/glms.pdfLinear regression, Logistic regression, and Generalized Linear Models David M. Blei Columbia

Article 'What is this corpus about?': Using topic ...clok.uclan.ac.uk/19494/1/19494 Murakami et al.pdf · explanation of topic models describes LDA and is largely based on Blei (2012).

MLSS Score

A correlated topic model of SCIENCE - Columbia …blei/papers/BleiLafferty2007.pdf · A CORRELATED TOPIC MODEL OF SCIENCE 19 corpora, it is natural to expect that subsets of the underlying

innoCon 6800S Suspended Solids /MLSS Analyzer · MLSS Analyzer 2. innoSens 810S Probe（0-30.0 g/L） The innoCon 6800S SS/MLSS analyzer is based on the 90 ° light scattering principle