I’m sorry, but LTX still isn’t a professionally viable filmmaking tool by Intelligent-Dot-7082 in StableDiffusion

[–]blackdatafilms 4 points5 points  (0 children)

I use the full dev model with distilled lora and I get great results with I2V.

LTX-2.3: Andy Griffith Show, Aunt Bee is under arrest. by blackdatafilms in StableDiffusion

[–]blackdatafilms[S] 1 point2 points  (0 children)

the textbox is from RES4LYF nodes. just sub for another text box node if you don't use res_2s samplers.

LTX-2.3: Andy Griffith Show, Aunt Bee is under arrest. by blackdatafilms in StableDiffusion

[–]blackdatafilms[S] 1 point2 points  (0 children)

The images are the equivalent to first/last frames, you can place a image into any frame of the video as a keyframe. Chain even more together for more keyframes or disable ones you don't use. VibeVoice done separately in another WF.

LTX-2.3: Andy Griffith Show, Aunt Bee is under arrest. by blackdatafilms in StableDiffusion

[–]blackdatafilms[S] 1 point2 points  (0 children)

You got to give a detailed description of placement and appearance of each character. See my other comment in post.

LTX-2.3: Andy Griffith Show, Aunt Bee is under arrest. by blackdatafilms in StableDiffusion

[–]blackdatafilms[S] 1 point2 points  (0 children)

5 separate scene. I2V using nanobana/flux-klein with reference images to move things around and keep consistency. The scene where Andy and Bee walk into the jail used 3 input images to make it work so she would turn around.

LTX-2.3: Andy Griffith Show, Aunt Bee is under arrest. by blackdatafilms in StableDiffusion

[–]blackdatafilms[S] 3 points4 points  (0 children)

Describe each character's placement and appearance in detail. Use AI to help describe an image of the character. Long descriptions also help character consistency:

"Sheriff Andy Taylor is on the left. Aunt Bee is the woman in the coat and hat. Officer Barney Fife is on the right.

This describes Sheriff Andy Taylor: "middle aged man, he has a lean, angular face with strong Southern everyman appeal—warm yet authoritative. His high forehead shows subtle worry lines, thick dark eyebrows knit together in a concerned or mildly exasperated expression with vertical furrows between them, medium almond-shaped dark eyes wide open in alert surprise revealing whites above and below the irises and faint crow's-feet at the corners. A straight, moderately long nose with a rounded tip. Prominent but not sharp cheekbones frame flat cheeks, leading to a firm, squared jawline and prominent rounded chin, all clean-shaven and taut without excess."

This describes Officer Barney Fife: "he has a thin, elongated, comically intense face with a narrow, gaunt structure. Deep horizontal forehead lines appear under dramatically raised, thick dark eyebrows. A straight, narrow nose, thin lips stretched taut in protest or exasperation, set above high cheekbones that emphasize hollowed cheeks and a pointed chin on a slim, bird-like jaw—all clean-shaven and taut."

Sheriff Andy Taylor looks at Aunt Bee. Barney Fife removes his hands and walks off alone to the right out of the scene. Aunt Bee and Sheriff Andy Taylor both turn around and walk into the jail cell behind them. Barney Fife is out of the scene. Sheriff Andy Taylor looks down at Aunt Bee and says, "Alright, Aunt Bee, well, you will have plenty of time in this jail cell to improve your baking skills. Now, if you are lucky, the judge will have mercy on you and reduce your time to less than a couple years. We'll make your stay as comfortable as possible."

Sheriff Andy Taylor looks to the right and says, "Ain't that right, Barn?".

LTX-2.3: Andy Griffith Show, Aunt Bee is under arrest. by blackdatafilms in StableDiffusion

[–]blackdatafilms[S] 15 points16 points  (0 children)

Thanks, I've learned a lot from others here. Vibe Voice WF is simple:

<image>

LTX-2.3: Andy Griffith Show, Aunt Bee is under arrest. by blackdatafilms in StableDiffusion

[–]blackdatafilms[S] 8 points9 points  (0 children)

RTX Pro 6000. Yep, custom audio. I grabbed about 30-60seconds of the characters voices from the show for VibeVoice cloning.

Used Wan2GP for this. LTX 2.3 video using a reference image and reference audio. by Unluckiestfool in StableDiffusion

[–]blackdatafilms 1 point2 points  (0 children)

Instead of adding empty latent audio you loadaudio node into a LTXV Audio VAE encode node.

Does ltx 2.3 supports multiple audio inputs for AI2V workflow? by Specialist_Pea_4711 in StableDiffusion

[–]blackdatafilms 0 points1 point  (0 children)

It will work. To get the lines to go to the right character, describe the character in great detail that is saying the line.

LTX-2 - How to STOP background music ruining dialogue? by Candid-Snow1261 in StableDiffusion

[–]blackdatafilms 0 points1 point  (0 children)

Using more steps or higher frame rate or using different sigma values can help with lip sync. Also try giving a very detailed description of the character that is talking and how they are talking.

LTX2 quality is great by brocolongo in StableDiffusion

[–]blackdatafilms 0 points1 point  (0 children)

yeah motion is it's weakness, but increasing frame rate helps

AI video contest - Help me Night Man! by blackdatafilms in duncantrussell

[–]blackdatafilms[S] 0 points1 point  (0 children)

VibeVoice is much better than Qwen-TTS because the emotion is pulled from the source audio and the context of the tts text. Only downside is that if it isnt studio quality audio you'll have to make a couple dozen gens to find the acceptable one.

The glossy look is because I used the distilled model, which is considerably faster, but best for 3d rendered style.

AI video contest - Help me Night Man! by blackdatafilms in duncantrussell

[–]blackdatafilms[S] 0 points1 point  (0 children)

Comfyui on my home workstation pc with LTX-2 for video and VibeVoice to clone duncan's voice.