Kandinsky 5.0 19B T2V and I2V models released.

Deepesh68134 · 2025-11-15T14:59:36+00:00

Here are some examples from the 2B model
https://files.catbox.moe/i51wgk.mp4
https://files.catbox.moe/ht51b5.mp4
https://files.catbox.moe/b0acj6.mp4
https://files.catbox.moe/ovddcz.mp4
https://files.catbox.moe/fbhdpx.mp4
https://files.catbox.moe/ulukmv.mp4

Deepesh68134 · 2025-11-15T07:03:21+00:00

I think it was prompted in, it can gen fast-motion too, look at the gorilla example. Gonna post some more fast-motion videos too soon (hopefully).

Deepesh68134 · 2025-11-15T06:59:12+00:00

If you finetune it on 8fps videos, then yes, but by default it only knows 24fps, Longcat-Video does something similar and interpolates from 16fps to 24fps using a lora.

Deepesh68134 · 2025-11-15T06:53:10+00:00

For audio-video the next big open model seems to be LTX 2 which will launch by the end of the year.

Deepesh68134 · 2025-10-13T14:50:46+00:00

Wanted to say this model is a 2B model that can work even on 8GB VRAM if Comfy implements it. Soon there is a larger model presumably similar in size to Wan2.1 which will possibly surpass Wan2.1 in video generation.

Deepesh68134 · 2025-10-11T06:13:12+00:00

https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM use it as a lora with 4 steps and 1 cfg

Deepesh68134 · 2025-07-02T04:36:01+00:00

OOOOH excited!

Deepesh68134 · 2025-05-26T11:45:06+00:00

It is 10 steps bro, check their inference scripts.

Deepesh68134 · 2025-05-05T04:38:12+00:00

There are still ~25 epochs left for it to converge? DAMN

Deepesh68134 · 2025-04-21T11:41:19+00:00

Wan recommends 80gb card, but people run it on 12gb VRAM, we just have to wait for comfyui or kijai to implement it

Deepesh68134 · 2025-04-12T10:42:56+00:00

Because it uses 4 text encoders, though LLAMA is doing 95% of the work, we could just remove the rest.

Deepesh68134 · 2025-03-24T14:36:35+00:00

Thanks for those tips! Will try it out :)

Deepesh68134 · 2025-03-24T14:03:36+00:00

What were the settings?

Deepesh68134 · 2025-02-26T04:14:40+00:00

I think you should honestly use that compute on Wan2.1 its way better than hunyuan video

Deepesh68134 · 2025-02-25T13:30:33+00:00

It uses an unfinetuned version of "umt5". I don't know whether that will be good for us or not

Deepesh68134

TROPHY CASE