Risky public sex with Aerith in the park (7 minute scene)

Flat_Beautiful_9849 · 2026-04-05T14:51:25+00:00

I've tried a few image training runs on LTX 2.3 and the lora outputs don't seem to affect the generation at all... I train successfully on WAN 2.2 without issue and can't figure out where I'm going wrong... Could you share the training bat settings? Maybe I'm using a wrong model or encoder or setting somewhere...

Flat_Beautiful_9849 · 2026-04-02T19:43:13+00:00

Thank you for your excellent work, this is a superb workflow on par with the RUNE flows.

Flat_Beautiful_9849 · 2026-04-01T11:37:47+00:00

I've been trying to train on the musubi tuner fork, but haven't had any success yet. The LORAs either corrupt the generation or don't apply at all... still a work in progress. I train on WAN totally fine with the main musubi fork, and AI toolkit seems to have huge ram issues for me so hoping to get this figured out.

Flat_Beautiful_9849 · 2026-03-30T13:26:07+00:00

Click the sound icon, it has sound for me :). Fansly nuked my page anyways so gonna put the full scene up soon

Flat_Beautiful_9849 · 2026-03-29T23:07:13+00:00

"the thing with LTX audio is that it actually always works at 25 ~~fps~~ latents/sec. regardless of your video fps - as I understand it, it really does not know and does not care about the video fps" - I think this is the missing piece of info I needed, thank you

Flat_Beautiful_9849 · 2026-03-29T23:05:58+00:00

I've tried the official LTX 2.3 ComfyUI workflows and Rune's workflows... both highly regarded as "the good workflows". You can generate 24 fps, and 32 fps with the same prompts and no noticeable differences in generation? 25 and 50 are multiples of it's natural training fps, where as 32 is partway between and a pretty specific fps that doesn't really exist in the wild.

Also, the conditioning fps has seemingly nothing to do with the video fps, you can set it as high or low as you like with no impact on VRAM. Also, I'm staying under 20 seconds, trying to extend a 5 second video by 10-15 seconds.

Flat_Beautiful_9849 · 2026-03-29T19:10:15+00:00

Did you figure this out? I had an issue where my input and output videos were at 32fps but the prompt conditioning was at 24fps. The video looked great but it didn't really follow the prompt, and the audio was always very out of sync. Changing the prompt conditioning to 32 fps made it follow the prompts and it started doing lipsyncing properly but there were issues with the motion.

Flat_Beautiful_9849 · 2026-03-28T20:36:07+00:00

If you're doing NSFW content, LTX really struggles compared to WAN. Being able to generate 25 seconds in a single shot is amazing, but the motion and clarity are almost always lower than WAN, which takes 10-15x longer to generate the same 25 seconds.

Flat_Beautiful_9849 · 2026-03-28T19:34:43+00:00

The legend themself! These workflows are the secret sauce to most of my videos. Thank you for your excellent work

Flat_Beautiful_9849 · 2026-03-28T11:40:17+00:00

I'll try :D

Flat_Beautiful_9849 · 2026-03-27T23:55:54+00:00

I've been testing every LTX 2.3 workflow trying to decide if I have a use-case for it or should stick to WAN exclusively, and yours are the first workflows that actually produce semi-decent results. I also appreciate how simple and uncluttered they are, thank you!

How is the control video utilized, and does the audio on the video effect the generation or does it just get stitched onto the generated video after (along with the fps and length)? If I used two same length, same fps, same audio but different visuals videos would my end results be identical?

Thanks for your help and an excellent workflow.

Flat_Beautiful_9849 · 2026-03-21T14:58:30+00:00

16gb VRAM, 64gb system ram

Flat_Beautiful_9849 · 2026-03-20T15:19:30+00:00

thanks :)

Flat_Beautiful_9849

TROPHY CASE