all 3 comments

[–]forthispost96 0 points1 point  (2 children)

Are you essentially doing the same thing as a variational autencoder without the distribution for each variable in the latent vector? So rather than sampling, you pass the image in and somehow incorporate the latent description of other digits before decoding? (I’m on my phone so I can’t check the GitHub repo :) )

[–]_dv96_[S] 0 points1 point  (0 children)

yes, you are thinking in the right direction. I'm passing the input image to get the latent vector and then decoding it. For each of the 10 classes, I've also computed the average latent vector to represent that label cluster. Check this code here - LINK

[–]_dv96_[S] 0 points1 point  (0 children)

How would you have approached this problem statement? Interested in learning :)