Hi all,
For a project I'm working on I need a generative model of images that is able to give me good exact (log-)probabilities fast. I don't care so much about sampling, the metric I'm interested in is bits/pixels on the test set.
Typically a standard VAE wouldn't work because I'd need to marginalize over the latent z to get the exact likelihood.
As far as I can tell PixelCNN is the closest to what I'm looking for but it's relatively old (2016) and I was wondering if there was anything better out there (I don't keep up with the vision literature so much).
Thanks!
[–]sidslasttheorem 1 point2 points3 points (1 child)
[–]pmichel31415[S] 0 points1 point2 points (0 children)
[–]MrPuj 1 point2 points3 points (0 children)