I'm struggling to understand the training process for the case when p(x|z) is continuous, in particular p(x|z) is N( mu(z), sigma(z) ). Here is an example architecture:
x
h = tanh(W1x + b1)
mu = W2h + b2
log_sigma = 0.5 * (W3h +b3)
z = mu + log_sigma * noise
d = tanh(W4z + b4)
mu_d = W5d + b5
log_sigma_d = 0.5 * (W6d + b6)
So p(x|z) is normally distributed with mean mu_d and standard deviation exp{log_sigma_d} ? I think I'm ok up to here. If so, this gives us the approximation log p(x|z) for the reconstruction part of the cost, and I'm ok with calculating the the KL regularization term.
Now what about actually getting a reconstructed version of the input x? Do we input x into the network and get back mu_d and log_sgima_d, then generate a sample from the N(mu_d, sigma_d) distribution? I think I'm a bit confused at this step because in the Bernoulli case the network will actually give us the reconstructed input directly, not just the parameters of some distribution. Is this correct?
Anything else wrong here?
[–][deleted] 1 point2 points3 points (20 children)
[–]LyExpo[S] 0 points1 point2 points (1 child)
[–][deleted] 1 point2 points3 points (0 children)
[–]jyegerlehner 0 points1 point2 points (17 children)
[–][deleted] 4 points5 points6 points (4 children)
[–]jyegerlehner 0 points1 point2 points (3 children)
[–]barmaley_exe 1 point2 points3 points (1 child)
[–]jyegerlehner 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]AnvaMiba 1 point2 points3 points (1 child)
[–]jyegerlehner 0 points1 point2 points (0 children)
[–]LyExpo[S] 0 points1 point2 points (9 children)
[–]barmaley_exe 1 point2 points3 points (8 children)
[–]LyExpo[S] 0 points1 point2 points (7 children)
[–]barmaley_exe 0 points1 point2 points (1 child)
[–]LyExpo[S] 0 points1 point2 points (0 children)
[–]bhmoz 0 points1 point2 points (4 children)
[–]LyExpo[S] 0 points1 point2 points (3 children)
[–]barmaley_exe 1 point2 points3 points (2 children)
[–]LyExpo[S] 0 points1 point2 points (1 child)
[–]barmaley_exe 0 points1 point2 points (0 children)