Exercise 21.9 - Variational EM for binary FA with sigmoid link

Answers

We begin with the likelihood:

p (𝐙, 𝐗 | 𝐖) = \prod_{n = 1}^{N} \prod_{j = 1}^{D} sigm {(𝐰_{j}^{T} 𝐳_{n})}^{x_{𝑛𝑗}} {(1 - sigm (𝐰_{j}^{T} 𝐳_{n}))}^{(1 - x_{𝑛𝑗})} .

The prior for the hidden variables is assumed to be:

\forall n, p (𝐳_{n}) = 𝒩 (0, 𝐈) .

Assume the factorized variational distribution:

p (𝐖, 𝐙 | 𝐗) \approx q (𝐖) \prod_{n = 1}^{N} q (𝐳_{n}) .

For the variational E-step, our goal is to match the logarithm of the variational distribution on the hidden variables:

\log q (𝐳),

with:

\begin{aligned} 𝔼_{q (𝐖)} [\log p (𝐙, 𝐗, 𝐖)] & = 𝔼_{q (𝐖)} [\sum_{n} \sum_{j} x_{𝑛𝑗} \log sigm (𝐰_{j}^{T} 𝐳_{n}) + (1 - x_{𝑛𝑗}) \log (1 - sigm (𝐰_{j}^{T} 𝐳_{n}))] \\ = \sum_{n, j} 𝔼_{q (𝐖)} [x_{𝑛𝑗} \log \frac{sigm (𝐰_{j}^{T} 𝐳_{n})}{1 - sigm (𝐰_{j}^{T} 𝐳_{n})} + \log (1 - sigm (𝐰_{j}^{T} 𝐳_{n}))] \\ = \sum_{n, j} 𝔼_{q (𝐖)} [x_{𝑛𝑗} \cdot 𝐰_{j}^{T} 𝐳_{n} + \log (1 - sigm (𝐰_{j}^{T} 𝐳_{n}))] . \end{aligned}

We can see that this form cannot painlessly reduce to an exponential family, hence approximation needs to be conducted to transfer $\log (1 - - sigm (𝐰_{j}^{T} 𝐳_{n}))$ to a linear function of $𝐳_{n}$ and optinally $𝐳_{n}^{T} 𝐳_{n}$ (e.g., the Laplace approximation). Then we can see that $𝔼_{q (𝐖)} [\log p (𝐙, 𝐗, 𝐖)]$ is a quadratic function in $𝐳_{n}$ , hence the E-step reduces $q (𝐳)$ to a Gaussian.

For the variational M-step:

𝔼_{q (𝐙)} [\log p (𝐙, 𝐗, 𝐖)]

can again be approximated as a quadratic function w.r.t. $𝐖$ , where the expectation of $𝐳$ shall be replaced by their counterpart in the E-step.

solour_lfq

2021-03-24 13:42

Exercise 21.9 - Variational EM for binary FA with sigmoid link

Answers

Comments

Add answer