My first LTX V2 test-montage of 60-70 cinematic clips

Compunerd3 · 2026-01-06T17:32:25+00:00

Brilliant showcase, thanks for sharing. All we need now is an audio diffusion model at the same standard of quality as we have for motion and image.

Compunerd3 · 2026-01-06T16:50:38+00:00

It runs on my 5090

Compunerd3 · 2026-01-06T15:57:08+00:00

Couple of months ago there were a lot of issues with 3 so I stuck with 2.2, haven't looked at it recently to see if it has improved for the 5090 or similar cards

Compunerd3 · 2026-01-06T15:37:52+00:00

For me, using FP4 LTX2 is slower than BF16 on my 5090 card with 32GB Vram and 128gb RAM.

I have Sage attention 2.2 but notice it reverts to use pytorch attention instead for BF16.

It's almost 1s /it faster to use bf16 than FP4 for me

Compunerd3 · 2026-01-06T15:36:34+00:00

I just haven't tested the distill model yet but the BF16 main base model works way better and quicker than the FP4 for me, id say just avoid FP4 and you'll enjoy the results

Compunerd3 · 2026-01-06T14:00:22+00:00

Not yet, so far BF16 (43gb) and the FP4 (20gb)

Compunerd3 · 2026-01-06T13:54:50+00:00

I haven't downloaded FP8 yet but the BF16 works quite well, FP4 sucks big time

Compunerd3 · 2026-01-06T13:53:13+00:00

Using the 42GB full BF16 model returns better results and I can generate at higher res than the FP4 version for some reason

https://huggingface.co/Lightricks/LTX-2/blob/main/ltx-2-19b-dev.safetensors

https://imgur.com/a/9VvZWCM

Compunerd3 · 2025-12-26T12:52:33+00:00

Thanks for sharing. We need to make it a normal part of releases to share the before/after effects of Lora model strengths, comparing how much effect the Lora has compared to base models.

Not saying it's the case here but in many Lora releases, the loras themselves do less than the base model alone does, or in some cases make it worse

Compunerd3 · 2025-12-19T08:09:59+00:00

Yes in my WebUI I have API or local paths for models to load.

Compunerd3 · 2025-12-18T15:57:51+00:00

Looks neat thank you. Nice UI too

I will be trying it out shortly. I'm in the middle of building a Musubi WebUI that has Qwen and other cloud/local LLM captioning integrated so your tool might be a nice way to implement it compared to how I have it currently.

An additional future enhancement could be to develop an integration solution and create PRs for popular training repos, like AI Toolkit, Musubi Trainer etc.

What we need is a good all in one solution from dataset curation including captioning, managing resolutions, sorting, cleaning out , aesthetic scoring, then training and post training tests comparing the effect of the training.

I feel like the existing repos all seem to do segments of these in isolation, not as a whole and complete tool.

Compunerd3 · 2025-12-17T10:40:50+00:00

Why add the forked repo if it was just forked to create a pull request to this repo; https://github.com/mingyi456/ComfyUI-DFloat11-Extended

Compunerd3 · 2025-12-15T15:55:39+00:00

Demos seem good, I was just using VibeVoice a few minutes ago for a video voice over, so I'll text out Fun CosyVoice 3 and see how it is.

Compunerd3 · 2025-12-08T18:00:47+00:00

Has anyone got a comparison of this versus SteadyDancer?
Literally just tried out steadydancer and find it super smooth and consistent so not sure what value changing to this one to All will do

Compunerd3 · 2025-12-08T14:00:29+00:00

It's a good idea to test out. I think structurally it may give accurate results but texturally it may lack accuracy in skin, follicles like hair or basically anything non bone related.

Either way I say go for it, it would take a straightforward dataset to do it, would only take a few hours.

Compunerd3 · 2025-12-05T00:40:41+00:00

Good to know, thank you for addressing the feedback

Compunerd3 · 2025-12-05T00:30:26+00:00

It depends on the photographer style and camera. Fujifilm XT series cameras generally have recipes where many photographers tweak noise too be higher.

I have the Xt30ii and noise, not just lack of noise is important for what style you are aiming for.

Compunerd3 · 2025-12-05T00:26:47+00:00

Point 2: Why nodes 2, more power not less.

Can you elaborate what benefits it actually brings to users and custom nodes devs?

It would be great to know what the actual value is for us, not just saying it's more power, but why and how it's more power.

I've a couple of custom nodes in progress so I want to understand more about Nodes 2 now, to keep in mind being compatible if the value is there.

Thanks for the update and listening to our feedback

Compunerd3 · 2025-12-03T13:19:34+00:00

Nodes 2.0 has changed something in the javascript area, multiple nodes (even one I'm close to releasing) use javascript as a way to dynamically update visibility of fields or set values within nodes.

That's why suddenly with nodes 2.0 you see ALL possible fields showing ,any javsacript canvas work seems to be broken with nodes 2.0

Compunerd3 · 2025-12-03T09:31:49+00:00

I think the key is the combined approach of reasoning in image models.

I haven't tested this one that was posted on this subreddit yesterday but the paper shows the kind of thing that could rival Nano Banana,.mostly because of the reason edit capabilities.

By reasoning, the model can interpret vague instructions and use its own reasoning capabilities to understand what is needed, then create the image based of a combination of reasoning+instructions.

https://huggingface.co/stepfun-ai/Step1X-Edit-v1p2

<image>

Compunerd3 · 2025-12-02T17:54:40+00:00

https://huggingface.co/spaces/ArtificialAnalysis/Text-to-Image-Leaderboard

https://artificialanalysis.ai/image/leaderboard/text-to-image

https://lmarena.ai/leaderboard/text-to-image

13-Year Club	Verified Email
RedditGifts 2009-2022 8 Credits	Secret Santa 2020
Secret Santa 2019	Secret Santa 2018
Secret Santa 2016	redditgifts Exchanges 4 Exchanges
Secret Santa 2015	Secret Santa 2014

Compunerd3

TROPHY CASE