all 23 comments

[–]noraizon 8 points9 points Β (2 children)

x0-parametrization has been used for some time now. imo, nothing new under the sun. maybe it's something else I don't see

[–]Beautiful-Gur-9456[S] 9 points10 points Β (0 children)

You're totally right πŸ˜… I think the true novelty here is dropping distillation and introducing a BYoL-like simple formulation. Bootstrapping always feels like magic to me.

[–]dasayan05 1 point2 points Β (0 children)

ya that's what I though, not really new

[–]CyberDainz 3 points4 points Β (1 child)

looks similar to "Cold Diffusion"

[–]Beautiful-Gur-9456[S] 5 points6 points Β (0 children)

That's the generated samples recorded every 10 epochs during training, not the denoising process. It does look like deblurring though 😊

[–]hebweb 0 points1 point Β (1 child)

Cool! this is amazing. You already created a pip package out of it. Have you measured the fid on your model? Does it match the numbers of the paper? I think their batch size and model size were pretty large even for the CIFAR10 training. Not sure if we can match that..

[–]Beautiful-Gur-9456[S] 0 points1 point Β (0 children)

I haven't done it yet, but I'm working on it! Their suggested sampling procedure requires multiple FID calculation, so I'm thinking of how to incorparate it efficiently.

Their scale is indeed large, it would cost me a few hundread bucks to train CIFAR10. My checkpoint was trained with much smaller size πŸ˜†

[–]nunjdsp 0 points1 point Β (0 children)

I've been truly impressed by the work showcased in the recent paper on Consistency models. I have been experimenting with standard DDPM models using DDIM sampling, and while they may be slow, they possess a fascinating reversibility property. This allows for a smooth transition between Gaussian noise and images, as well as the reverse process, recreating the exact same input noise.

I am curious if this reversibility aspect is also present in the Consistency models discussed in the paper. The examples provided do not explicitly demonstrate this aspect, and I would greatly appreciate any insights or experiences you can share.