Practical Collapsed Stochastic Variational Inference
-
Upload
arnim-bleier -
Category
Education
-
view
212 -
download
1
description
Transcript of Practical Collapsed Stochastic Variational Inference
Arnim Bleier, [email protected] CSS Workshop, GESIS, Köln, 16.12.2013
Practical Collapsed Stochastic Variational Inference
Classic Collapsed LDA Inference
nkwdi= nkwdi
� zdik
nkwdi= nkwdi
+ zdik
zdik / (ndk + ↵⇡k)nkwdi
+ �
nk. + V �
ndk = ndk � zdik
ndk = ndk + zdik
repeat for each document d do for each word i in d do
end for end for until stopping criterion is met.
nkwdi= nkwdi
� zdik
nkwdi= nkwdi
+ zdik
zdik / (ndk + ↵⇡k)nkwdi
+ �
nk. + V �
ndk = ndk � zdik
ndk = ndk + zdik
repeat for each document d do for each word i in d do
end for end for until stopping criterion is met.
⇡k = ⇡̄k
k�1Y
l=1
(1� ⇡̄l)
vk = � +TX
l=k+1,d
E[ (ndl � 1)]
uk = 1 +X
d
E[ (ndk � 1)]
Sato, I., Kurihara, K., Nakagawa, H.: Practical Collapsed Variational Bayes Inference for Hierarchical Dirichlet Process. KDD (2012)
⇡̄k =uk
uk + vk; with
Stochastic Practical Collapsed Variational Inference for the HDP
Stochastic Practical Collapsed Variational Inference for the HDP
nkwdi= nkwdi
� zdik
nkwdi= nkwdi
+ zdik
zdik / (ndk + ↵⇡k)nkwdi
+ �
nk. + V �
ndk = ndk � zdik
repeat for each document d do for each word i in d do
ndk (1� ⇢dt )ndk + ⇢dtNdzkndk = ndk + zdiknkw (1� ⇢ct)nkw + ⇢ctNzk [wdi = w]
⇡k = ⇡̄k
k�1Y
l=1
(1� ⇡̄l)
vk = � +TX
l=k+1,d
E[ (ndl � 1)]
uk = 1 +X
d
E[ (ndk � 1)]
⇡̄k =uk
uk + vk; with
Bleier, A.: Practical Collapsed Stochastic Variational Inference for the HDP. NIPS Topic Models Workshop (2013)
uk (1� ⇢ht )uk + ⇢ht (1 +DE[ (ndk � 1)])
vk (1� ⇢ht )vk + ⇢ht (� +DTX
l=k+1
E[ (ndl � 1)])
end for end for until stopping criterion is met.
Stochastic Variational Inference for LDA
nkwdi= nkwdi
� zdik
nkwdi= nkwdi
+ zdik
zdik / (ndk + ↵⇡k)nkwdi
+ �
nk. + V �
ndk = ndk � zdik
repeat for each document d do for each word i in d do
ndk (1� ⇢dt )ndk + ⇢dtNdzkndk = ndk + zdiknkw (1� ⇢ct)nkw + ⇢ctNzk [wdi = w]
end for end for until stopping criterion is met.
Foulds, J., Boyles, L., Smyth, P., Welling, M.: Stochastic Collapsed Variational Bayesian Inference for LDA. KDD (2013)
Evaluation
Predictive performance of the algorithm on the Associated Press (TREC-1) data. For the evaluation we compared the perplexity versus the number of documents seen for PCSVB0, SCVB0 and PCVB0. We trained the model on 80% of the documents. All held out documents were split; 70% of the tokens in each held out document were used to estimate the document parameters, the remaining 30% were used to compute the perplexity. Step-size schedule: Configuration: corpus-wide schedule : s = 10, = 1000 document schedule : s = 1, = 10 hyperparameter & stick weights : s = 5, = 100
⇢t =s
(⌧ + t)0.9
⇢ct ⌧⇢dt ⌧
⌧⇢ht