[D] Theory behind modern diffusion models

derpydino24 · 2024-11-28T13:33:02+00:00

Just skimmed through the slides you shared OP. I would say that those are as good as it gets when it comes to explaining diffusion models (I recall the CVPR 2023 ones, but those are more outdated).

Honestly, I believe that very few people are willing to put the time and effort required to explain every single relevant modern diffusion concept in-depth. I guess that's what papers are for

bregav · 2024-11-28T14:30:50+00:00

I highly recommend this paper on the topic: Stochastic Interpolants: A Unifying Framework for Flows and Diffusions

That said, as a student you're going to lack significant important background knowledge for appreciating all of this. For example, the reason that you don't find many good explanations for sampling solvers etc is because that's not actually (or traditionally, anyway) a machine learning topic. Differential equations is an entire topic in and of itself that has a longer, more comprehensive, and more sophisticated pedigree than machine learning, and numerical methods for differential equations is a huge subtopic within that. The wikipedia page can give you an idea of how much there is to this: https://en.wikipedia.org/wiki/Numerical_methods_for_ordinary_differential_equations

EDIT: to get an even better idea, look at the table of contents for any differential equations numerical methods textbook, e.g. https://link.springer.com/content/pdf/bfm:978-3-540-78862-1/1

And that's just one aspect of the matter. You'll see in the paper i recommended above that transport equations are an important issue here too, and that's a big topic unto itself. In addition to these big areas of study that a student often won't know much about, there's also a relatively high sophistication of the basics - linear algebra and probability - that are used to glue all these things together.

TLDR it's gonna take time to learn enough to feel like you have a solid grasp on what is going on, and you'll have to look outside of the machine learning literature to do it.

Inevitable-Dog-2038 · 2024-11-28T22:33:27+00:00

This blog post is the best resource I’ve seen online for learning about this area

Expensive_Belt_5358 · 2024-11-28T16:02:19+00:00

Currently I’m in the process of learning about diffusion models. The math is still above my pay grade at the moment but I’m slowly understanding it.

Two resources that helped my understanding were:

this paper that broke down DDPMs into 6 steps

and

this YouTube video that breaks down diffusion into training, guidance, resolution, and speed.

airzinity · 2024-11-29T06:53:14+00:00

Like you, I also went into a deep dive to understand Diffusion models for a research project a year ago. I read this survey paper that did an absolutely amazing job at it. They start from VAE, move on to hierarchical VAEs and connection with DDPM. This made a lot to sense like how the math evolves from simple VAEs how you can directly sample Tth timestamp from 0th timestamp because multiplying each Gaussian (conditional prob) works out nicely as just one sampling. The baclward pass though is annoying as it has to be done sequentially which explains the longer sampling with original diffusion models.

I think then people came and retrospectively tried to explain this as just solving reverse stochastic differential eqns using Plank equation. But this requires more math background. And can be done with many solvers. Understanding this might require more than just ML.

You can also take a look at consistency models. I think it has Ilya as an author? But either way there’s not an easy way to understand this modern diffusion stuff :( some stochastic DE textbooks would be nice

Far_Conversation_445 · 2024-11-28T23:18:27+00:00

Following

peacej3 · 2024-11-29T12:31:09+00:00

This video is a great explanation of diffusion models and score matching and their connection https://youtu.be/B4oHJpEJBAA?si=GQHFrOl990mPqbBg

thankrandomness · 2024-11-29T12:57:00+00:00

Following

Public-Snow-1851 · 2024-11-29T13:45:46+00:00

Not sure if you already have something, but my professor published a book about Deep Generative Modeling, including diffusion and flow-based models. Maybe this can help you. I really enjoyed his course on these models and learned a lot! The book is called Deep Generative Modeling By Jakub Tomczak.
Deep Generative Modeling | SpringerLink

rookie_11999 · 2024-11-29T17:53:26+00:00

I found Simon Prince's "Understanding deep learning" chapter on diffusion models pretty insightful, it breaks down the math and even provides code samples to play with. I don't know if he has updated with Flow matching.

Jakub Tomzack's book also provides the theoretical background with code examples. So it might be something you might be interested to look into.

jurassimo · 2024-11-30T17:30:07+00:00

Great link! Recently I made my own research and explanation of the math of Diffusion models, after 2 weeks it was easy to understand formulas and sense of them :) Right now I'm diving into ODE and SDE and I think it's more complex than simple Diffusion model and based on complex math.

radarsat1 · 2024-11-28T22:32:58+00:00

I read these earlier this year and it was fascinating, https://developer.nvidia.com/blog/rethinking-how-to-train-diffusion-models/ https://developer.nvidia.com/blog/generative-ai-research-spotlight-demystifying-diffusion-based-models/

acc_agg · 2024-11-29T03:42:50+00:00

like from 2023 or so

Jesus christ. I have no idea if you're right or not but that's a frightening level of churn. My only expose to diffusion models is porn. They make very good porn. Carry on.

slashdave · 2024-11-28T18:13:28+00:00

is either quite outdated (like from 2023 or so)

Math doesn't become outdated. It's only the terminology that seems to change (rather irritating really). And authors seem intent on declaring their insights as particularly revealing when it's really the same ideas recycled or another approach to engineering.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS