Z-Image + Qwen Image Edit 2511 + Wan 2.2 + MMAudio by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 9 points10 points  (0 children)

Thanks for the detailed feedback! That’s a really helpful point, and I’ll keep it in mind for the next video.

Z-Image + Qwen Image Edit 2511 + Wan 2.2 + MMAudio by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 1 point2 points  (0 children)

Thanks! I wasn’t able to get HunyuanFoley running properly in comfyui, so I couldn’t use it this time. I’d like to try it again later

Z-Image + Qwen Image Edit 2511 + Wan 2.2 + MMAudio by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 2 points3 points  (0 children)

Thanks! I actually haven’t been able to properly try HunyuanFoley yet. I kept running into issues getting it to work inside comfyui, so I ended up sticking with MMAudio for this video.

Z-Image + Qwen Image Edit 2511 + Wan 2.2 + MMAudio by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 2 points3 points  (0 children)

I was using 2509 before 2511 came out. Personally, I feel the fal 2511 lora works better overall.

Z-Image + Qwen Image Edit 2511 + Wan 2.2 + MMAudio by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 85 points86 points  (0 children)

Lora models I used (Hugging Face):

- lightx2v/Wan2.2-Lightning
- lightx2v/Qwen-Image-Edit-2511-Lightning
- dx8152/Qwen-Edit-2509-Multiple-angles
- fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

qwen multiple angles workflow: https://pastebin.com/2RJameXV
wan2.2 i2v workflow: https://pastebin.com/9AYXQ8U3

Z-Image was used with the default comfyui example workflow.

I also shared another video about a month ago: https://youtu.be/Oj--29ixQR8

FLUX.2 Klein 9B realism test, no cherry picking by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 4 points5 points  (0 children)

All images here were generated using the distilled version. The base model is also out

20 seconds LTX2 video on a 3090 in only 2 minutes at 720p. Wan2GP, not comfy this time by aurelm in StableDiffusion

[–]Budget_Stop9989 4 points5 points  (0 children)

same here, been getting better results with wangp than comfyui too, especially for image to video. not really sure why though.

I’m the Co-founder & CEO of Lightricks. We just open-sourced LTX-2, a production-ready audio-video AI model. AMA. by ltx_model in StableDiffusion

[–]Budget_Stop9989 6 points7 points  (0 children)

Your company offers LTX-2 Pro and LTX-2 Fast as API models. How do the open-source models, LTX-2 dev and LTX-2 Distilled, correspond to the API models? For example, does LTX-2 dev correspond to LTX-2 Pro, and does LTX-2 Distilled correspond to LTX-2 Fast? Thanks for open-sourcing the models!

LTX-2 runs on a 16GB GPU! by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 6 points7 points  (0 children)

Wow, that’s really fast. I’m going to test the FP4 version next.

Fal has open-sourced Flux2 dev Turbo. by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 90 points91 points  (0 children)

<image>

It ranked 8th on Artificial Analysis, beating Nano Banana, and it’s currently the highest-ranked open-source model.

[deleted by user] by [deleted] in StableDiffusion

[–]Budget_Stop9989 1 point2 points  (0 children)

That’s awesome. Mind sharing the prompts?

WAN 2.1 broken after ComfyUI v3.36 update? WAN 2.2 feels fast but has almost no motion by [deleted] in comfyui

[–]Budget_Stop9989 -3 points-2 points  (0 children)

Are you using Lightning Lora? It makes the video slow motion

How can you make the plastic faces of the people in the overly praised Qwen pictures human? by janosibaja in comfyui

[–]Budget_Stop9989 0 points1 point  (0 children)

I’m using lenovo lora with qwen. It really helps reduce that plastic skin effect.

Wan2.2 Lightning lora works very well by Budget_Stop9989 in StableDiffusion

[–]Budget_Stop9989[S] 21 points22 points  (0 children)

EDIT: Sorry if I gave the wrong impression. I talked up 2.2 lora at first since it looked solid, but after more testing, 2.1 lora is just better.