SlideShare una empresa de Scribd logo
1 de 4
Descargar para leer sin conexión
Expectation-Propagation for Latent Dirichlet Allocation
Tomonari MASADA @ Nagasaki University
November 22, 2018
1
This manuscript contains a derivation of the update formulae of expectation-propagation (EP) presented
in the following paper:
Thomas Minka and John Lafferty.
Expectation-Propagation for the Generative Aspect Model.
in Proc. of the 18th Conference on Uncertainty in Artificial Intelligence, pp. 352-359, 2002.1
2
A joint distribution for document d in LDA can be written as below.
p(x, θd|α) = p(θd|α)p(x|θd)
= p(θd|α)
nd
i=1
p(xdi|θd) = p(θd|α)
nd
i=1 k
θdkφkxdi
= p(θd|α)
w k
θdkφkw
ndw
, (1)
where p(θd|α) is a Dirichlet prior distribution.
We approximate p(w|θd) = k θdkφkw by tw(θd) = sw k θβwk
dk .
Then we obtain an approximated joint distribution as follows:
p(x, θd|α) ≈ p(θd|α)
w
tw(θd)ndw
= p(θd|α)
w
sw
k
θβwk
dk
ndw
=
Γ( k αk)
k Γ(αk) w
sndw
w
k
θ
αk−1+ w βwkndw
dk (2)
Let γdk = αk + w βwkndw. That is,
p(xd, θd|α) ≈
Γ( k αk)
k Γ(αk) w
sndw
w
k
θγdk−1
dk . (3)
An approximated posterior q(θd) can be obtained as follows:
q(θd) =
Γ( k αk)
k Γ(αk) w sndw
w k θγdk−1
dk
Γ( k αk)
k Γ(αk) w sndw
w k θγdk−1
dk dθd
= k θγdk−1
dk
k θγdk−1
dk dθd
=
Γ( k γdk)
k Γ(γdk)
k
θγdk−1
dk . (4)
1http://research.microsoft.com/en-us/um/people/minka/papers/aspect/
1
3
For each word token in document d, we remove its contribution to w tw(θd)ndw
= w sw k θβwk
dk
ndw
.
When the token is a token of word w, this corresponds to a division by sw k θβwk
dk .
Note that we here consider a removal of word token, not a removal of word type.
Then we obtain an ‘old’ posterior after this division as follows:
qw
(θd) =
Γ k(γdk − βwk)
k Γ(γdk − βwk)
k
θγdk−βwk−1
dk =
Γ( k γ
w
dk )
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk , (5)
where γ
w
dk ≡ γdk − βwk.
By combining p(w|θd) = k θdkΦkw with qw
(θd), we obtain a something similar to joint distribution
as follows:
(
k
θdkφkw) ·
Γ( k γ
w
dk )
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk .
Therefore, by integrating θd out, we obtain a something similar to evidence as follows:
k
θdkφkw
Γ( k γ
w
dk )
k Γ(γ
w
dk ) k
θ
γ
w
dk
−1
dk dθd = k φkwγ
w
dk
k γ
w
dk
.
Consequently, we obtain a new posterior as follows:
˜q(θd) = k γ
w
dk
k φkwγ
w
dk k
θdkφkw
Γ( k γ
w
dk )
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk
=
ˆγd
w
k φkwγ
w
dk k
θdkφkw
Γ( ˆγd
w
)
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk , (6)
where ˆγd
w
≡ k γ
w
dk .
4
Based on Eq. (6), we calculate E˜q[θdk] and E˜q[θ2
dk].
E˜q[θdk] = θdk ˜q(θd)dθd
= θdk
ˆγd
w
k φkwγ
w
dk k
θdkφkw
Γ( ˆγd
w
)
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk dθd
=
ˆγd
w
k φkwγ
w
dk
·
Γ( ˆγd
w
)
k Γ(γ
w
dk )
θdk
k
θdkφkw
k
θ
γ
w
dk −1
dk dθd
=
ˆγd
w
k φkwγ
w
dk
·
Γ( ˆγd
w
)
k Γ(γ
w
dk )
θ2
dkφkw
k
θ
γ
w
dk −1
dk dθd +
k =k
θdkθdk φk w
k
θ
γ
w
dk −1
dk dθd
=
ˆγd
w
k φkwγ
w
dk
·
Γ( ˆγd
w
)
k Γ(γ
w
dk )
φkw θ2
dk
k
θ
γ
w
dk −1
dk dθd +
k =k
φk w θdkθdk
k
θ
γ
w
dk −1
dk dθd
=
ˆγd
w
k φkwγ
w
dk
·
Γ( ˆγd
w
)
k Γ(γ
w
dk )
φkw θ
γ
w
dk +1
dk
l=k
θ
γ
w
dl −1
dl dθd +
k =k
φk w θ
γ
w
dk
dk θ
γ
w
dk
dk
l=k∧l=k
θ
γ
w
dl −1
dl dθd
It should be noted that k Γ(γdk)
Γ( k γdk) = k θγdk−1
dk dθd holds for any γd = (γd1, . . . , γdK). This is how
we obtain the normalizing constant of Dirichlet distribution. Therefore, θ
γ
w
dk +1
dk l=k θ
γ
w
dl −1
dl dθd =
2
Γ(γ
w
dk +2) l=k Γ(γ
w
dl )
Γ( ˆγd
w+2)
. Further, θ
γ
w
dk
dk θ
γ
w
dk
dk l=k∧l=k θ
γ
w
dl −1
dl dθd =
Γ(γ
w
dk +1)Γ(γ
w
dk
+1) l=k∧l=k Γ(γ
w
dl )
Γ( ˆγd
w+2)
. With
these results, we can continue the formula rearrangement as follows:
∴ E˜q[θdk] =
ˆγd
w
k φkwγ
w
dk
·
Γ( ˆγd
w
)
k Γ(γ
w
dk )
φkw
Γ(γ
w
dk + 2) l=k Γ(γ
w
dl )
Γ( ˆγd
w
+ 2)
+
k =k
φk w
Γ(γ
w
dk + 1)Γ(γ
w
dk + 1) l=k∧l=k Γ(γ
w
dl )
Γ( ˆγd
w
+ 2)
=
ˆγd
w
k φkwγ
w
dk
φkw
Γ( ˆγd
w
)
k Γ(γ
w
dk )
·
Γ(γ
w
dk + 2) l=k Γ(γ
w
dl )
Γ( ˆγd
w
+ 2)
+
k =k
φk w
Γ( ˆγd
w
)
k Γ(γ
w
dk )
·
Γ(γ
w
dk + 1)Γ(γ
w
dk + 1) l=k∧l=k Γ(γ
w
dl )
Γ( ˆγd
w
+ 2)
=
ˆγd
w
k φkwγ
w
dk
φkw
γ
w
dk (γ
w
dk + 1)
ˆγd
w
( ˆγd
w
+ 1)
+
k =k
φk w
γ
w
dk γ
w
dk
ˆγd
w
( ˆγd
w
+ 1)
=
ˆγd
w
k φkwγ
w
dk
·
γ
w
dk
ˆγd
w
·
φkw + k φkwγ
w
dk
ˆγd
w
+ 1
(7)
E˜q[θ2
dk] = θ2
dk ˜q(θd)dθd
= θ2
dk
ˆγd
w
k φkwγ
w
dk k
θdkφkw
Γ( ˆγd
w
)
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk dθd
=
ˆγd
w
k φkwγ
w
dk
φkw
γ
w
dk (γ
w
dk + 1)(γ
w
dk + 2)
ˆγd
w
( ˆγd
w
+ 1)( ˆγd
w
+ 2)
+
k =k
φk w
γ
w
dk (γ
w
dk + 1)γ
w
dk
ˆγd
w
( ˆγd
w
+ 1)( ˆγd
w
+ 2)
=
ˆγd
w
k φkwγ
w
dk
·
γ
w
dk
ˆγd
w
·
γ
w
dk + 1
ˆγd
w
+ 1
·
2φkw + k φkwγ
w
dk
ˆγd
w
+ 2
. (8)
We determine a Dirichlet distribution Dir(γd) so that the mean
γdk
ˆγd
and the average second moment2
1
K k
γdk(γdk+1)
ˆγd(ˆγd+1) are equal to those of ˜q(θd), where ˆγd = k γdk. This procedure is called moment
matching.
To be precise, we obtain γd by solving the following equations:
γdk
ˆγd
= E˜q[θdk] for each k, and (9)
1
K
k
γdk(γdk + 1)
ˆγd(ˆγd + 1)
=
1
K
k
E˜q[θ2
dk] . (10)
2http://www.ucl.ac.uk/statistics/research/pdfs/135.zip
3
We can obtain γdk as below.
γdk = E˜q[θdk]ˆγd
k
E˜q[θdk]ˆγd(E˜q[θdk]ˆγd + 1)
ˆγd(ˆγd + 1)
=
k
E˜q[θ2
dk]
k
E˜q[θdk](E˜q[θdk]ˆγd + 1) =
k
E˜q[θ2
dk](ˆγd + 1)
k
E˜q[θdk]2
ˆγd +
k
E˜q[θdk] =
k
E˜q[θ2
dk]ˆγd +
k
E˜q[θ2
dk]
k
E˜q[θdk]2
− E˜q[θ2
dk] ˆγd =
k
E˜q[θ2
dk] − E˜q[θdk]
∴ ˆγd = k(E˜q[θ2
dk] − E˜q[θdk])
k(E˜q[θdk]2 − E˜q[θ2
dk])
(11)
∴ γdk = k(E˜q[θ2
dk] − E˜q[θdk])
k(E˜q[θdk]2 − E˜q[θ2
dk])
· E˜q[θdk] (12)
5
We now have a Dirichlet distribution that approxiates the posterior ˜q(θd) in Eq. (6), i.e.,
˜q(θd) = k γ
w
dk
k φkwγ
w
dk k
θdkφkw
Γ( k γ
w
dk )
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk .
To obtain a density function of Dirichlet distribution from Eq. (6), we replace the term ( k θdkφkw)
by tw(θd) = sw k θ
βwk
dk . We set the resulting function equal to the density function of Dir(γd) and obtain
the following equation:
k γ
w
dk
k φkwγ
w
dk
sw
k
θ
βwk
dk
Γ( k γ
w
dk )
k Γ(γ
w
dk ) k
θ
γ
w
dk −1
dk =
Γ( k γdk)
k Γ(γdk)
k
θ
γdk−1
dk . (13)
Consequently, sw and βwk are determined as follows:
βwk = γdk − γ
w
dk , (14)
sw = k φkwγ
w
dk
k γ
w
dk
Γ( k γdk)
k Γ(γdk)
k Γ(γ
w
dk )
Γ( k γ
w
dk )
. (15)
4

Más contenido relacionado

La actualidad más candente

Hideitsu Hino
Hideitsu HinoHideitsu Hino
Hideitsu HinoSuurist
 
Generarlized operations on fuzzy graphs
Generarlized operations on fuzzy graphsGenerarlized operations on fuzzy graphs
Generarlized operations on fuzzy graphsAlexander Decker
 
A non-stiff boundary integral method for internal waves
A non-stiff boundary integral method for internal wavesA non-stiff boundary integral method for internal waves
A non-stiff boundary integral method for internal wavesAlex (Oleksiy) Varfolomiyev
 
Tetsunao Matsuta
Tetsunao MatsutaTetsunao Matsuta
Tetsunao MatsutaSuurist
 
Hiroyuki Sato
Hiroyuki SatoHiroyuki Sato
Hiroyuki SatoSuurist
 
Appendix of downlink coverage probability in heterogeneous cellular networks ...
Appendix of downlink coverage probability in heterogeneous cellular networks ...Appendix of downlink coverage probability in heterogeneous cellular networks ...
Appendix of downlink coverage probability in heterogeneous cellular networks ...Cora Li
 
2013추계학술대회 인쇄용
2013추계학술대회 인쇄용2013추계학술대회 인쇄용
2013추계학술대회 인쇄용Byung Kook Ha
 
Introduction to CNN with Application to Object Recognition
Introduction to CNN with Application to Object RecognitionIntroduction to CNN with Application to Object Recognition
Introduction to CNN with Application to Object RecognitionArtifacia
 
ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-
ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-
ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-ssusere0a682
 
Neural Art (English Version)
Neural Art (English Version)Neural Art (English Version)
Neural Art (English Version)Mark Chang
 
InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...Hansol Kang
 
Gaps between the theory and practice of large-scale matrix-based network comp...
Gaps between the theory and practice of large-scale matrix-based network comp...Gaps between the theory and practice of large-scale matrix-based network comp...
Gaps between the theory and practice of large-scale matrix-based network comp...David Gleich
 
基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」Ken'ichi Matsui
 
ODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHS
ODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHSODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHS
ODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHSijscmcj
 
0-1 KNAPSACK PROBLEM
0-1 KNAPSACK PROBLEM0-1 KNAPSACK PROBLEM
0-1 KNAPSACK PROBLEMi i
 
Combined Functions
Combined FunctionsCombined Functions
Combined FunctionsJoviBerbz
 

La actualidad más candente (20)

Yahoo Query Language
Yahoo Query LanguageYahoo Query Language
Yahoo Query Language
 
Hideitsu Hino
Hideitsu HinoHideitsu Hino
Hideitsu Hino
 
Generarlized operations on fuzzy graphs
Generarlized operations on fuzzy graphsGenerarlized operations on fuzzy graphs
Generarlized operations on fuzzy graphs
 
A non-stiff boundary integral method for internal waves
A non-stiff boundary integral method for internal wavesA non-stiff boundary integral method for internal waves
A non-stiff boundary integral method for internal waves
 
Tetsunao Matsuta
Tetsunao MatsutaTetsunao Matsuta
Tetsunao Matsuta
 
Hiroyuki Sato
Hiroyuki SatoHiroyuki Sato
Hiroyuki Sato
 
Appendix of downlink coverage probability in heterogeneous cellular networks ...
Appendix of downlink coverage probability in heterogeneous cellular networks ...Appendix of downlink coverage probability in heterogeneous cellular networks ...
Appendix of downlink coverage probability in heterogeneous cellular networks ...
 
2013추계학술대회 인쇄용
2013추계학술대회 인쇄용2013추계학술대회 인쇄용
2013추계학술대회 인쇄용
 
Introduction to CNN with Application to Object Recognition
Introduction to CNN with Application to Object RecognitionIntroduction to CNN with Application to Object Recognition
Introduction to CNN with Application to Object Recognition
 
ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-
ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-
ゲーム理論NEXT 線形計画問題第10回 -シンプレックス法5 人工変数続き-
 
Neural Art (English Version)
Neural Art (English Version)Neural Art (English Version)
Neural Art (English Version)
 
InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN : Interpretable Representation Learning by Information Maximizing Gen...
 
Gaps between the theory and practice of large-scale matrix-based network comp...
Gaps between the theory and practice of large-scale matrix-based network comp...Gaps between the theory and practice of large-scale matrix-based network comp...
Gaps between the theory and practice of large-scale matrix-based network comp...
 
Ma2002 1.10 rm
Ma2002 1.10 rmMa2002 1.10 rm
Ma2002 1.10 rm
 
基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」
 
ODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHS
ODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHSODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHS
ODD GRACEFULL LABELING FOR THE SUBDIVISON OF DOUBLE TRIANGLES GRAPHS
 
0-1 KNAPSACK PROBLEM
0-1 KNAPSACK PROBLEM0-1 KNAPSACK PROBLEM
0-1 KNAPSACK PROBLEM
 
Combined Functions
Combined FunctionsCombined Functions
Combined Functions
 
Bessel 1 div_3
Bessel 1 div_3Bessel 1 div_3
Bessel 1 div_3
 
Sor
SorSor
Sor
 

Similar a Expectation propagation for latent Dirichlet allocation

A Note on Correlated Topic Models
A Note on Correlated Topic ModelsA Note on Correlated Topic Models
A Note on Correlated Topic ModelsTomonari Masada
 
Topic modeling with Poisson factorization (2)
Topic modeling with Poisson factorization (2)Topic modeling with Poisson factorization (2)
Topic modeling with Poisson factorization (2)Tomonari Masada
 
A Note on Latent LSTM Allocation
A Note on Latent LSTM AllocationA Note on Latent LSTM Allocation
A Note on Latent LSTM AllocationTomonari Masada
 
A common random fixed point theorem for rational ineqality in hilbert space ...
 A common random fixed point theorem for rational ineqality in hilbert space ... A common random fixed point theorem for rational ineqality in hilbert space ...
A common random fixed point theorem for rational ineqality in hilbert space ...Alexander Decker
 
A common unique random fixed point theorem in hilbert space using integral ty...
A common unique random fixed point theorem in hilbert space using integral ty...A common unique random fixed point theorem in hilbert space using integral ty...
A common unique random fixed point theorem in hilbert space using integral ty...Alexander Decker
 
Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology Amro Elfeki
 
Integration techniques
Integration techniquesIntegration techniques
Integration techniquesKrishna Gali
 
Quantum mechanics and the square root of the Brownian motion
Quantum mechanics and the square root of the Brownian motionQuantum mechanics and the square root of the Brownian motion
Quantum mechanics and the square root of the Brownian motionMarco Frasca
 
Tensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantificationTensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantificationAlexander Litvinenko
 
SUPER MAGIC CORONATIONS OF GRAPHS
SUPER MAGIC CORONATIONS OF GRAPHS SUPER MAGIC CORONATIONS OF GRAPHS
SUPER MAGIC CORONATIONS OF GRAPHS IAEME Publication
 
Common fixed point and weak commuting mappings
Common fixed point and weak commuting mappingsCommon fixed point and weak commuting mappings
Common fixed point and weak commuting mappingsAlexander Decker
 
51554 0131469657 ism-13
51554 0131469657 ism-1351554 0131469657 ism-13
51554 0131469657 ism-13Carlos Fuentes
 
Conformable Chebyshev differential equation of first kind
Conformable Chebyshev differential equation of first kindConformable Chebyshev differential equation of first kind
Conformable Chebyshev differential equation of first kindIJECEIAES
 
Number theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-newNumber theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-newChristos Loizos
 
A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...
A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...
A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...Tomonari Masada
 
Fast parallelizable scenario-based stochastic optimization
Fast parallelizable scenario-based stochastic optimizationFast parallelizable scenario-based stochastic optimization
Fast parallelizable scenario-based stochastic optimizationPantelis Sopasakis
 
A Note on PCVB0 for HDP-LDA
A Note on PCVB0 for HDP-LDAA Note on PCVB0 for HDP-LDA
A Note on PCVB0 for HDP-LDATomonari Masada
 

Similar a Expectation propagation for latent Dirichlet allocation (20)

A Note on Correlated Topic Models
A Note on Correlated Topic ModelsA Note on Correlated Topic Models
A Note on Correlated Topic Models
 
Poisson factorization
Poisson factorizationPoisson factorization
Poisson factorization
 
Topic modeling with Poisson factorization (2)
Topic modeling with Poisson factorization (2)Topic modeling with Poisson factorization (2)
Topic modeling with Poisson factorization (2)
 
A Note on Latent LSTM Allocation
A Note on Latent LSTM AllocationA Note on Latent LSTM Allocation
A Note on Latent LSTM Allocation
 
A common random fixed point theorem for rational ineqality in hilbert space ...
 A common random fixed point theorem for rational ineqality in hilbert space ... A common random fixed point theorem for rational ineqality in hilbert space ...
A common random fixed point theorem for rational ineqality in hilbert space ...
 
A common unique random fixed point theorem in hilbert space using integral ty...
A common unique random fixed point theorem in hilbert space using integral ty...A common unique random fixed point theorem in hilbert space using integral ty...
A common unique random fixed point theorem in hilbert space using integral ty...
 
Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology
 
Integration techniques
Integration techniquesIntegration techniques
Integration techniques
 
Quantum mechanics and the square root of the Brownian motion
Quantum mechanics and the square root of the Brownian motionQuantum mechanics and the square root of the Brownian motion
Quantum mechanics and the square root of the Brownian motion
 
Tensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantificationTensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantification
 
SUPER MAGIC CORONATIONS OF GRAPHS
SUPER MAGIC CORONATIONS OF GRAPHS SUPER MAGIC CORONATIONS OF GRAPHS
SUPER MAGIC CORONATIONS OF GRAPHS
 
Common fixed point and weak commuting mappings
Common fixed point and weak commuting mappingsCommon fixed point and weak commuting mappings
Common fixed point and weak commuting mappings
 
51554 0131469657 ism-13
51554 0131469657 ism-1351554 0131469657 ism-13
51554 0131469657 ism-13
 
Conformable Chebyshev differential equation of first kind
Conformable Chebyshev differential equation of first kindConformable Chebyshev differential equation of first kind
Conformable Chebyshev differential equation of first kind
 
Number theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-newNumber theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-new
 
Talk5
Talk5Talk5
Talk5
 
A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...
A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...
A derivation of the sampling formulas for An Entity-Topic Model for Entity Li...
 
Fast parallelizable scenario-based stochastic optimization
Fast parallelizable scenario-based stochastic optimizationFast parallelizable scenario-based stochastic optimization
Fast parallelizable scenario-based stochastic optimization
 
A Note on PCVB0 for HDP-LDA
A Note on PCVB0 for HDP-LDAA Note on PCVB0 for HDP-LDA
A Note on PCVB0 for HDP-LDA
 
final19
final19final19
final19
 

Más de Tomonari Masada

Learning Latent Space Energy Based Prior Modelの解説
Learning Latent Space Energy Based Prior Modelの解説Learning Latent Space Energy Based Prior Modelの解説
Learning Latent Space Energy Based Prior Modelの解説Tomonari Masada
 
Denoising Diffusion Probabilistic Modelsの重要な式の解説
Denoising Diffusion Probabilistic Modelsの重要な式の解説Denoising Diffusion Probabilistic Modelsの重要な式の解説
Denoising Diffusion Probabilistic Modelsの重要な式の解説Tomonari Masada
 
Context-dependent Token-wise Variational Autoencoder for Topic Modeling
Context-dependent Token-wise Variational Autoencoder for Topic ModelingContext-dependent Token-wise Variational Autoencoder for Topic Modeling
Context-dependent Token-wise Variational Autoencoder for Topic ModelingTomonari Masada
 
A note on the density of Gumbel-softmax
A note on the density of Gumbel-softmaxA note on the density of Gumbel-softmax
A note on the density of Gumbel-softmaxTomonari Masada
 
トピックモデルの基礎と応用
トピックモデルの基礎と応用トピックモデルの基礎と応用
トピックモデルの基礎と応用Tomonari Masada
 
Mini-batch Variational Inference for Time-Aware Topic Modeling
Mini-batch Variational Inference for Time-Aware Topic ModelingMini-batch Variational Inference for Time-Aware Topic Modeling
Mini-batch Variational Inference for Time-Aware Topic ModelingTomonari Masada
 
A note on variational inference for the univariate Gaussian
A note on variational inference for the univariate GaussianA note on variational inference for the univariate Gaussian
A note on variational inference for the univariate GaussianTomonari Masada
 
Document Modeling with Implicit Approximate Posterior Distributions
Document Modeling with Implicit Approximate Posterior DistributionsDocument Modeling with Implicit Approximate Posterior Distributions
Document Modeling with Implicit Approximate Posterior DistributionsTomonari Masada
 
LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition
LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka CompositionLDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition
LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka CompositionTomonari Masada
 
A Simple Stochastic Gradient Variational Bayes for the Correlated Topic Model
A Simple Stochastic Gradient Variational Bayes for the Correlated Topic ModelA Simple Stochastic Gradient Variational Bayes for the Correlated Topic Model
A Simple Stochastic Gradient Variational Bayes for the Correlated Topic ModelTomonari Masada
 
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet AllocationA Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet AllocationTomonari Masada
 
Word count in Husserliana Volumes 1 to 28
Word count in Husserliana Volumes 1 to 28Word count in Husserliana Volumes 1 to 28
Word count in Husserliana Volumes 1 to 28Tomonari Masada
 
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet AllocationA Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet AllocationTomonari Masada
 
A Note on BPTT for LSTM LM
A Note on BPTT for LSTM LMA Note on BPTT for LSTM LM
A Note on BPTT for LSTM LMTomonari Masada
 
The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...
The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...
The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...Tomonari Masada
 
ChronoSAGE: Diversifying Topic Modeling Chronologically
ChronoSAGE: Diversifying Topic Modeling ChronologicallyChronoSAGE: Diversifying Topic Modeling Chronologically
ChronoSAGE: Diversifying Topic Modeling ChronologicallyTomonari Masada
 
A Topic Model for Traffic Speed Data Analysis
A Topic Model for Traffic Speed Data AnalysisA Topic Model for Traffic Speed Data Analysis
A Topic Model for Traffic Speed Data AnalysisTomonari Masada
 

Más de Tomonari Masada (20)

Learning Latent Space Energy Based Prior Modelの解説
Learning Latent Space Energy Based Prior Modelの解説Learning Latent Space Energy Based Prior Modelの解説
Learning Latent Space Energy Based Prior Modelの解説
 
Denoising Diffusion Probabilistic Modelsの重要な式の解説
Denoising Diffusion Probabilistic Modelsの重要な式の解説Denoising Diffusion Probabilistic Modelsの重要な式の解説
Denoising Diffusion Probabilistic Modelsの重要な式の解説
 
Context-dependent Token-wise Variational Autoencoder for Topic Modeling
Context-dependent Token-wise Variational Autoencoder for Topic ModelingContext-dependent Token-wise Variational Autoencoder for Topic Modeling
Context-dependent Token-wise Variational Autoencoder for Topic Modeling
 
A note on the density of Gumbel-softmax
A note on the density of Gumbel-softmaxA note on the density of Gumbel-softmax
A note on the density of Gumbel-softmax
 
トピックモデルの基礎と応用
トピックモデルの基礎と応用トピックモデルの基礎と応用
トピックモデルの基礎と応用
 
Mini-batch Variational Inference for Time-Aware Topic Modeling
Mini-batch Variational Inference for Time-Aware Topic ModelingMini-batch Variational Inference for Time-Aware Topic Modeling
Mini-batch Variational Inference for Time-Aware Topic Modeling
 
A note on variational inference for the univariate Gaussian
A note on variational inference for the univariate GaussianA note on variational inference for the univariate Gaussian
A note on variational inference for the univariate Gaussian
 
Document Modeling with Implicit Approximate Posterior Distributions
Document Modeling with Implicit Approximate Posterior DistributionsDocument Modeling with Implicit Approximate Posterior Distributions
Document Modeling with Implicit Approximate Posterior Distributions
 
LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition
LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka CompositionLDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition
LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition
 
A Note on ZINB-VAE
A Note on ZINB-VAEA Note on ZINB-VAE
A Note on ZINB-VAE
 
A Note on TopicRNN
A Note on TopicRNNA Note on TopicRNN
A Note on TopicRNN
 
A Simple Stochastic Gradient Variational Bayes for the Correlated Topic Model
A Simple Stochastic Gradient Variational Bayes for the Correlated Topic ModelA Simple Stochastic Gradient Variational Bayes for the Correlated Topic Model
A Simple Stochastic Gradient Variational Bayes for the Correlated Topic Model
 
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet AllocationA Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
 
Word count in Husserliana Volumes 1 to 28
Word count in Husserliana Volumes 1 to 28Word count in Husserliana Volumes 1 to 28
Word count in Husserliana Volumes 1 to 28
 
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet AllocationA Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
A Simple Stochastic Gradient Variational Bayes for Latent Dirichlet Allocation
 
FDSE2015
FDSE2015FDSE2015
FDSE2015
 
A Note on BPTT for LSTM LM
A Note on BPTT for LSTM LMA Note on BPTT for LSTM LM
A Note on BPTT for LSTM LM
 
The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...
The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...
The detailed derivation of the derivatives in Table 2 of Marginalized Denoisi...
 
ChronoSAGE: Diversifying Topic Modeling Chronologically
ChronoSAGE: Diversifying Topic Modeling ChronologicallyChronoSAGE: Diversifying Topic Modeling Chronologically
ChronoSAGE: Diversifying Topic Modeling Chronologically
 
A Topic Model for Traffic Speed Data Analysis
A Topic Model for Traffic Speed Data AnalysisA Topic Model for Traffic Speed Data Analysis
A Topic Model for Traffic Speed Data Analysis
 

Último

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Último (20)

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

Expectation propagation for latent Dirichlet allocation

  • 1. Expectation-Propagation for Latent Dirichlet Allocation Tomonari MASADA @ Nagasaki University November 22, 2018 1 This manuscript contains a derivation of the update formulae of expectation-propagation (EP) presented in the following paper: Thomas Minka and John Lafferty. Expectation-Propagation for the Generative Aspect Model. in Proc. of the 18th Conference on Uncertainty in Artificial Intelligence, pp. 352-359, 2002.1 2 A joint distribution for document d in LDA can be written as below. p(x, θd|α) = p(θd|α)p(x|θd) = p(θd|α) nd i=1 p(xdi|θd) = p(θd|α) nd i=1 k θdkφkxdi = p(θd|α) w k θdkφkw ndw , (1) where p(θd|α) is a Dirichlet prior distribution. We approximate p(w|θd) = k θdkφkw by tw(θd) = sw k θβwk dk . Then we obtain an approximated joint distribution as follows: p(x, θd|α) ≈ p(θd|α) w tw(θd)ndw = p(θd|α) w sw k θβwk dk ndw = Γ( k αk) k Γ(αk) w sndw w k θ αk−1+ w βwkndw dk (2) Let γdk = αk + w βwkndw. That is, p(xd, θd|α) ≈ Γ( k αk) k Γ(αk) w sndw w k θγdk−1 dk . (3) An approximated posterior q(θd) can be obtained as follows: q(θd) = Γ( k αk) k Γ(αk) w sndw w k θγdk−1 dk Γ( k αk) k Γ(αk) w sndw w k θγdk−1 dk dθd = k θγdk−1 dk k θγdk−1 dk dθd = Γ( k γdk) k Γ(γdk) k θγdk−1 dk . (4) 1http://research.microsoft.com/en-us/um/people/minka/papers/aspect/ 1
  • 2. 3 For each word token in document d, we remove its contribution to w tw(θd)ndw = w sw k θβwk dk ndw . When the token is a token of word w, this corresponds to a division by sw k θβwk dk . Note that we here consider a removal of word token, not a removal of word type. Then we obtain an ‘old’ posterior after this division as follows: qw (θd) = Γ k(γdk − βwk) k Γ(γdk − βwk) k θγdk−βwk−1 dk = Γ( k γ w dk ) k Γ(γ w dk ) k θ γ w dk −1 dk , (5) where γ w dk ≡ γdk − βwk. By combining p(w|θd) = k θdkΦkw with qw (θd), we obtain a something similar to joint distribution as follows: ( k θdkφkw) · Γ( k γ w dk ) k Γ(γ w dk ) k θ γ w dk −1 dk . Therefore, by integrating θd out, we obtain a something similar to evidence as follows: k θdkφkw Γ( k γ w dk ) k Γ(γ w dk ) k θ γ w dk −1 dk dθd = k φkwγ w dk k γ w dk . Consequently, we obtain a new posterior as follows: ˜q(θd) = k γ w dk k φkwγ w dk k θdkφkw Γ( k γ w dk ) k Γ(γ w dk ) k θ γ w dk −1 dk = ˆγd w k φkwγ w dk k θdkφkw Γ( ˆγd w ) k Γ(γ w dk ) k θ γ w dk −1 dk , (6) where ˆγd w ≡ k γ w dk . 4 Based on Eq. (6), we calculate E˜q[θdk] and E˜q[θ2 dk]. E˜q[θdk] = θdk ˜q(θd)dθd = θdk ˆγd w k φkwγ w dk k θdkφkw Γ( ˆγd w ) k Γ(γ w dk ) k θ γ w dk −1 dk dθd = ˆγd w k φkwγ w dk · Γ( ˆγd w ) k Γ(γ w dk ) θdk k θdkφkw k θ γ w dk −1 dk dθd = ˆγd w k φkwγ w dk · Γ( ˆγd w ) k Γ(γ w dk ) θ2 dkφkw k θ γ w dk −1 dk dθd + k =k θdkθdk φk w k θ γ w dk −1 dk dθd = ˆγd w k φkwγ w dk · Γ( ˆγd w ) k Γ(γ w dk ) φkw θ2 dk k θ γ w dk −1 dk dθd + k =k φk w θdkθdk k θ γ w dk −1 dk dθd = ˆγd w k φkwγ w dk · Γ( ˆγd w ) k Γ(γ w dk ) φkw θ γ w dk +1 dk l=k θ γ w dl −1 dl dθd + k =k φk w θ γ w dk dk θ γ w dk dk l=k∧l=k θ γ w dl −1 dl dθd It should be noted that k Γ(γdk) Γ( k γdk) = k θγdk−1 dk dθd holds for any γd = (γd1, . . . , γdK). This is how we obtain the normalizing constant of Dirichlet distribution. Therefore, θ γ w dk +1 dk l=k θ γ w dl −1 dl dθd = 2
  • 3. Γ(γ w dk +2) l=k Γ(γ w dl ) Γ( ˆγd w+2) . Further, θ γ w dk dk θ γ w dk dk l=k∧l=k θ γ w dl −1 dl dθd = Γ(γ w dk +1)Γ(γ w dk +1) l=k∧l=k Γ(γ w dl ) Γ( ˆγd w+2) . With these results, we can continue the formula rearrangement as follows: ∴ E˜q[θdk] = ˆγd w k φkwγ w dk · Γ( ˆγd w ) k Γ(γ w dk ) φkw Γ(γ w dk + 2) l=k Γ(γ w dl ) Γ( ˆγd w + 2) + k =k φk w Γ(γ w dk + 1)Γ(γ w dk + 1) l=k∧l=k Γ(γ w dl ) Γ( ˆγd w + 2) = ˆγd w k φkwγ w dk φkw Γ( ˆγd w ) k Γ(γ w dk ) · Γ(γ w dk + 2) l=k Γ(γ w dl ) Γ( ˆγd w + 2) + k =k φk w Γ( ˆγd w ) k Γ(γ w dk ) · Γ(γ w dk + 1)Γ(γ w dk + 1) l=k∧l=k Γ(γ w dl ) Γ( ˆγd w + 2) = ˆγd w k φkwγ w dk φkw γ w dk (γ w dk + 1) ˆγd w ( ˆγd w + 1) + k =k φk w γ w dk γ w dk ˆγd w ( ˆγd w + 1) = ˆγd w k φkwγ w dk · γ w dk ˆγd w · φkw + k φkwγ w dk ˆγd w + 1 (7) E˜q[θ2 dk] = θ2 dk ˜q(θd)dθd = θ2 dk ˆγd w k φkwγ w dk k θdkφkw Γ( ˆγd w ) k Γ(γ w dk ) k θ γ w dk −1 dk dθd = ˆγd w k φkwγ w dk φkw γ w dk (γ w dk + 1)(γ w dk + 2) ˆγd w ( ˆγd w + 1)( ˆγd w + 2) + k =k φk w γ w dk (γ w dk + 1)γ w dk ˆγd w ( ˆγd w + 1)( ˆγd w + 2) = ˆγd w k φkwγ w dk · γ w dk ˆγd w · γ w dk + 1 ˆγd w + 1 · 2φkw + k φkwγ w dk ˆγd w + 2 . (8) We determine a Dirichlet distribution Dir(γd) so that the mean γdk ˆγd and the average second moment2 1 K k γdk(γdk+1) ˆγd(ˆγd+1) are equal to those of ˜q(θd), where ˆγd = k γdk. This procedure is called moment matching. To be precise, we obtain γd by solving the following equations: γdk ˆγd = E˜q[θdk] for each k, and (9) 1 K k γdk(γdk + 1) ˆγd(ˆγd + 1) = 1 K k E˜q[θ2 dk] . (10) 2http://www.ucl.ac.uk/statistics/research/pdfs/135.zip 3
  • 4. We can obtain γdk as below. γdk = E˜q[θdk]ˆγd k E˜q[θdk]ˆγd(E˜q[θdk]ˆγd + 1) ˆγd(ˆγd + 1) = k E˜q[θ2 dk] k E˜q[θdk](E˜q[θdk]ˆγd + 1) = k E˜q[θ2 dk](ˆγd + 1) k E˜q[θdk]2 ˆγd + k E˜q[θdk] = k E˜q[θ2 dk]ˆγd + k E˜q[θ2 dk] k E˜q[θdk]2 − E˜q[θ2 dk] ˆγd = k E˜q[θ2 dk] − E˜q[θdk] ∴ ˆγd = k(E˜q[θ2 dk] − E˜q[θdk]) k(E˜q[θdk]2 − E˜q[θ2 dk]) (11) ∴ γdk = k(E˜q[θ2 dk] − E˜q[θdk]) k(E˜q[θdk]2 − E˜q[θ2 dk]) · E˜q[θdk] (12) 5 We now have a Dirichlet distribution that approxiates the posterior ˜q(θd) in Eq. (6), i.e., ˜q(θd) = k γ w dk k φkwγ w dk k θdkφkw Γ( k γ w dk ) k Γ(γ w dk ) k θ γ w dk −1 dk . To obtain a density function of Dirichlet distribution from Eq. (6), we replace the term ( k θdkφkw) by tw(θd) = sw k θ βwk dk . We set the resulting function equal to the density function of Dir(γd) and obtain the following equation: k γ w dk k φkwγ w dk sw k θ βwk dk Γ( k γ w dk ) k Γ(γ w dk ) k θ γ w dk −1 dk = Γ( k γdk) k Γ(γdk) k θ γdk−1 dk . (13) Consequently, sw and βwk are determined as follows: βwk = γdk − γ w dk , (14) sw = k φkwγ w dk k γ w dk Γ( k γdk) k Γ(γdk) k Γ(γ w dk ) Γ( k γ w dk ) . (15) 4