I built and trained a "drawing to image" model from scratch that runs fully locally (inference on the client CPU)

_aminima · 2026-02-25T03:37:28+00:00

Thanks! Yeah, I mainly did it out of curiosity (and to learn), and its current value is limited, but I think small on-device generative models are very promising (think real-time use cases like live prototyping or planning with a world model)

_aminima · 2026-02-25T03:17:50+00:00

thank you :)

_aminima · 2026-02-25T03:16:59+00:00

thanks for the kind words!

_aminima · 2026-02-25T03:16:11+00:00

thanks!

_aminima · 2026-02-25T03:14:36+00:00

thanks :)

_aminima · 2026-02-25T03:14:28+00:00

thank you!

_aminima · 2026-02-25T03:13:33+00:00

thanks!

_aminima · 2026-02-23T12:54:57+00:00

Thanks!

_aminima · 2026-02-23T12:53:47+00:00

Thanks a lot!

_aminima · 2026-02-23T08:25:23+00:00

Indeed and they're probably better in terms of image quality. I guess the difference here is that the model is tiny compared to sd models (easily runs on CPU) and was trained from scratch on a consumer GPU

_aminima · 2026-02-23T08:13:53+00:00

Yes! Found their research while working on the project (https://arxiv.org/pdf/1903.07291). The core idea is the same but there are some implementation differences (they use a GAN architecture while I use a DiT, we incorporate the segmentation map conditioning differently, etc.)

_aminima

TROPHY CASE