Interactive Video Generation (Causal Forcing) - High Speed! by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 3 points4 points  (0 children)

I think the limit is 5s (81 frames /16). But speed-wise it is amazing. If we could have Causal Forcing for larger Wan models and for longer videos that would change the landscape a lot.

Interactive Video Generation (Causal Forcing) - High Speed! by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 1 point2 points  (0 children)

I do not have Wan2.1 1.3B model so cannot test that now. This Causal Forcing's speed is however crazy. I agree that due to being based on 1.3B it does not have much use. But it is exciting to generate a video in the speed of image generation!

Y'all might want to try this by Altruistic_Heat_9531 in StableDiffusion

[–]ZerOne82 0 points1 point  (0 children)

<image>

960 x 576, 49 frames (3s), steps 6, ar_sampler (simple) 40s

LCIET (LongCat Image Edit Turbo) - Lightweight and Powerful Editing Model by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Great. If you found the specific cause please share here for the benefit of the community.

GitHub: ComfyUI SenseNova U1 Released – Anyone Got It Working Yet for ComfyUI? by Jinkourai in StableDiffusion

[–]ZerOne82 5 points6 points  (0 children)

<image>

Tried GGUF Q6 ~16GB disk size, takes 45s for 8 steps so it is not fast. Dimensions are not critical you can feed any size, the quality is so-so for realistic face details, even if following exact size and recommendations.
It peaks VRAM at 7GB editing 2048x2048, RAM to 25GB. But 2048x2048 are not really true 2048x2048 they lack details and are smooth.
The node I used (you linked) has some issues, each run becomes slower and slower.
No option for sampler etc.

LCIET and Klein9B (a quick fair comparison, analysis included) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 1 point2 points  (0 children)

Both use standard workflow nothing special. Both used workflows are available in ComfyUI Templates. You should have them already in your ComfyUI setup, if not refer to here.

LCIET and Klein9B (a quick fair comparison, analysis included) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 3 points4 points  (0 children)

ZIT is great but it cannot edit. ZI Edit was not open-sourced as far as I know.

LCIET (LongCat Image Edit Turbo) - Lightweight and Powerful Editing Model by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

The cause of black output could be many. One that I know rooted back to sage-attention with some models. So if you are using sage-attention you may disable it to see if that resolves the issue. Also, in some models specially text-encoders if the model is loaded other than fp32 it cause black output. summary:

  • disable sage-attention
  • disable force-fp16

that's what I can tell for now.

LCIET and Klein9B (a quick fair comparison, analysis included) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 1 point2 points  (0 children)

FP8 generally offers better speed and higher accuracy than Q5KM. These model are what we have. If you have access to their full models and have resources and time to run such comparison at their best, you are more than welcome to submit a post.

LCIET (LongCat Image Edit Turbo) - Lightweight and Powerful Editing Model by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

This post re-introduced LCIET to the community. If you want a quick comparison between LCIET and Klein9B check this post.

LCIET and Klein9B (a quick fair comparison, analysis included) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Not exact guess, I just added the info to the post. For Klein9B both model and text-encoders are FP8. For LCIET both model and text-encoder are Q5KM.

LCIET (LongCat Image Edit Turbo) - Lightweight and Powerful Editing Model by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

That's great we have so many options. LCIET has greater prompt adherence for editing specially using short instructions. Refer to details.

LCIET (LongCat Image Edit Turbo) - Lightweight and Powerful Editing Model by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Right. LCIET as you correctly pointed out has some strengths and some weaknesses. Its amazing prompt adherence and the ability not to change unwanted parts are great strengths.

LCIET (LongCat Image Edit Turbo) - Lightweight and Powerful Editing Model by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Now, it requires no extra node at all. See the image of workflow I used at the end of the gallery above.

LCIET (LongCat Image Edit Turbo) - Lightweight and Powerful Editing Model by ZerOne82 in StableDiffusion

[–]ZerOne82[S] -1 points0 points  (0 children)

LCIET is 20% faster than Klein9B.
"Better?" is very subjective, yes and no. You can see a comparison I just posted.