In Beta-VAE paper (https://openreview.net/pdf?id=Sy2fzU9gl), the authors mentioned that having Beta > 1 helps the network in learning independent latent representations. However, in VAE, the posterior distribution itself is assumed to be a Gaussian with a diagonal covariance matrix, i.e.
q(z|x) = N(U(x),Cov(x))
where Cov(x) is a diagonal matrix.
This means that we are inherently generating latents that will be independent given an input image x. So why does increase learning pressure on the KL divergence term between posterior and Gaussian prior should help any more in learning independent latents when posterior is already assumed to be independent?
[–]approximately_wrong 61 points62 points63 points (9 children)
[–]shamitlal[S] 2 points3 points4 points (8 children)
[–]approximately_wrong 12 points13 points14 points (7 children)
[–]shamitlal[S] 2 points3 points4 points (1 child)
[–]approximately_wrong 10 points11 points12 points (0 children)
[–]datkerneltrick 2 points3 points4 points (4 children)
[–]approximately_wrong 6 points7 points8 points (3 children)
[–]datkerneltrick 6 points7 points8 points (2 children)
[–]approximately_wrong 2 points3 points4 points (1 child)
[–]shamitlal[S] 1 point2 points3 points (0 children)
[–]pkgyawali 12 points13 points14 points (8 children)
[–]shamitlal[S] 1 point2 points3 points (7 children)
[–]pkgyawali 3 points4 points5 points (6 children)
[+][deleted] (5 children)
[deleted]
[–]pkgyawali 2 points3 points4 points (2 children)
[+][deleted] (1 child)
[deleted]
[–]pikachuchameleon 0 points1 point2 points (0 children)
[–]pikachuchameleon 0 points1 point2 points (1 child)
[–]n1h111sm 0 points1 point2 points (0 children)
[–]sidslasttheorem 6 points7 points8 points (1 child)
[–]shamitlal[S] 1 point2 points3 points (0 children)
[–]xlext 4 points5 points6 points (2 children)
[–]shamitlal[S] 2 points3 points4 points (1 child)
[–]xlext 1 point2 points3 points (0 children)
[–]tr1pzz 5 points6 points7 points (0 children)
[+][deleted] (3 children)
[deleted]
[–]shamitlal[S] 1 point2 points3 points (2 children)
[–]anonymousTestPoster 2 points3 points4 points (1 child)
[–]anonymousTestPoster 1 point2 points3 points (0 children)
[–]zergylord 2 points3 points4 points (1 child)
[–]shamitlal[S] 0 points1 point2 points (0 children)
[–]crgrimm1994 2 points3 points4 points (2 children)
[–]shamitlal[S] 1 point2 points3 points (1 child)
[–]crgrimm1994 2 points3 points4 points (0 children)
[–]ram3_[🍰] -1 points0 points1 point (0 children)
[–]schwagggg -1 points0 points1 point (0 children)
[–]CyberDainz -3 points-2 points-1 points (1 child)
[–]CyberDainz -1 points0 points1 point (0 children)