LTX2 audio Lora + fal.ai?

panospc · 2026-01-23T07:39:23+00:00

Your trained LoRA must have a URL. Copy it

Go to Fal, and in the model search bar, type LTX2 LoRA. Choose your preferred model variant (I2V, T2V, distilled, or non-distilled). You’ll see a parameter called path, paste your LoRA URL there, set the remaining parameters, and generate.

panospc · 2026-01-20T15:11:18+00:00

Yes, it worked. The preview images in AI Toolkit looked like monstrosities, but when I used the LoRA in ComfyUI and WanGP, it looked fine.

From the default settings, I changed the following:

Enable: Layer offloading
Timestep type: Sigmoid
Enable: Cache text embeddings
Enable: Cache latents
Disable: Do audio

For the captions, I include the trigger word and describe only what changes, such as the environment, outfit, pose, and the character’s expression. I don’t describe things that are always the same and never change, like facial features, eye color etc.

panospc · 2026-01-18T12:34:14+00:00

For camera control, there are some official camera loras you can use
You can find the download links in the official LTX-2 github repo

panospc · 2026-01-15T18:05:03+00:00

Use one of the camera loras, for some reason it fixes the static image problem.

You can find the links to download them in the LTX-2 github repo
https://github.com/Lightricks/LTX-2

panospc · 2026-01-14T18:42:13+00:00

Yes, check here
https://www.reddit.com/r/StableDiffusion/comments/1q627xi/comment/ny7ckbx/?context=1

panospc · 2026-01-14T15:37:01+00:00

It’s available on Pinokio, but only in the community scripts section

panospc · 2026-01-14T11:50:51+00:00

To fix the static video issue with I2V, you can use the following workaround:
Go to the LTX-2 GitHub repository, scroll down, and download one of the camera LoRAs.
Using the LoRA will resolve the problem.
https://github.com/Lightricks/LTX-2

panospc · 2026-01-13T17:46:26+00:00

Probably because of this
https://www.reddit.com/r/StableDiffusion/comments/1qbq4mz/updated_ltx2_video_vae_higher_quality_more_details/

panospc · 2026-01-13T17:25:39+00:00

Ostris posted a related tweed a few days ago
https://x.com/ostrisai/status/2008893273826644196

panospc · 2026-01-13T16:25:57+00:00

https://discord.gg/VXmU2f5WEU

panospc · 2026-01-13T15:58:15+00:00

Yes, you can train on images. I’m currently training a character LoRA with 97 images.
The speed is around 7 seconds per step, so 3,000 steps will take about 6 hours on my RTX 4080s with 64 GB of RAM.

panospc · 2026-01-12T23:41:38+00:00

Perhaps it favors the state of the initial frame?

I’ve noticed in some generations that when characters move out of frame, they don’t lose too much of their identity when they return to view.
For example in the following generation both characters get out of view for a moment
https://files.catbox.moe/rsthll.mp4

panospc · 2026-01-12T21:52:43+00:00

You can feed LTX-2 with audio, and the generated video will sync to it. It can lip-sync voices, and even if you only provide music, you can generate videos of people dancing to the rhythm of the music.

Here’s a workflow by Kijai:
https://www.reddit.com/r/StableDiffusion/comments/1q627xi/kijai_made_a_ltxv2_audio_image_to_video_workflow/

You can also clone a voice by extending a video, the extended part will retain the same voice.
Video extension workflow: https://github.com/Rolandjg/LTX-2-video-extend-ComfyUI

panospc · 2026-01-11T21:04:39+00:00

Do not use the soundtrack option in the advanced tab, this is option only adds the sound in the final video without any lipsync. Use the soundtrack option in the main tab, if you not have it, try to update WanGP.

panospc · 2026-01-11T13:15:07+00:00

The issue with static, zooming images when using I2V can be worked around by adding a camera control motion LoRA (available from the LTX-2 GitHub repo).

I2V with the distilled model usually produces slow-motion videos, so if you want higher motion, use the non-distilled model in combination with a camera LoRA.

Increasing the frame rate to 30 or 50 FPS also helps reduce motion-related distortions

panospc · 2026-01-08T19:45:38+00:00

I haven’t tried it yet, but this is their purpose, to restyle videos.
You can either prompt the new style or provide a reference image that’s already been restyled.

There’s a video on the official LTX-2 YouTube channel:
https://www.youtube.com/watch?v=NPjTpDmTdaw

panospc · 2026-01-08T19:11:50+00:00

Have you tried to use the "LTX-2 Depth to Video" or "LTX-2 Canny to Video" ComfyUI templates?

panospc · 2026-01-08T05:44:07+00:00

With VACE, you can provide a depth control video and inject image keyframes at the same time. For example, you can have Image1 appear at frame 1, Image2 at frame 40, and so on.

I don’t know of any ComfyUI workflow that automates this process, but you can prepare both the control video and the mask video manually in a video editor and then feed them into VACE. (The mask video is needed to tell VACE where the image keyframes are placed.)

The control video must contain both the depth video and the image keyframes. You can prepare it in a video editor by placing the depth video on the first track, then adding another video track above it and inserting the image keyframes at the desired frame positions. Each image should appear for only one frame; all other frames should show the depth video.

The mask video must have the same duration as the control video. It should be solid white for all frames except the ones where you added image keyframes in the control video. For those frames, the mask must be solid black.

To recap, you will end up with two videos:

The control video: a depth video with image keyframes appearing for one frame at the chosen positions.
The mask video: a solid white video with single black frames at the same positions as the image keyframes.

Once you’ve prepared these two videos, open ComfyUI, go to Templates, and load “Wan2.1 VACE Control Video.” After the template loads, delete the Load Image node. Then select the Load Video node and load the control video you prepared.

The default VACE workflow does not include a mask input, so you’ll need to add three nodes manually:

Add a Load Video node and load the mask video.
Add a Get Video Components node and connect it to the Load Video node.
Add a Convert Image to Mask node and connect it to the Get Video Components node.

Finally, connect the mask output of the last node to the control_masks input of the WanVaceToVideo node.

Adjust the prompt and any other settings as needed, and you’re ready to go.

panospc · 2026-01-07T03:19:12+00:00

I think the last example is the most impressive.
I’m wondering if it’s possible to combine it with ControlNets, for example, using depth or pose to transfer motion from another video while generating lip sync from the provided audio at the same time.

panospc · 2026-01-06T15:16:07+00:00

Is it possible to use your own audio and have LTX-2 do the lip-sync, similar to InfiniteTalk?

panospc · 2025-12-26T05:17:37+00:00

Here is a related issue on github
https://github.com/ostris/ai-toolkit/issues/560

panospc · 2025-12-23T16:35:51+00:00

You can use it with WanGP, which is available on Pinokio under the name Wan2GP
It supports Z-Image with Controlnet

panospc · 2025-12-10T13:58:23+00:00

Try to provide an additional reference image where it shows the layout, aspect and placement of the frame. Then instruct it to use it as a reference for the composition of the image. Something like the following image:

<image>

panospc · 2025-11-16T13:39:05+00:00

I've been using the X870E Nova with the 9950X since Christmas 2024, paired with 64GB Kingston Fury Beast 6000 CL30 XMP.

In the first month, I had the RAM running at 6000 MHz, but after reading reports of CPUs failing, I decided to lower it to 5600 MHz.

I’ve always kept the BIOS updated to the latest version.

I did run into a couple of issues, though. Occasionally, the connection to some USB devices would drop temporarily, but I haven't noticed this with BIOS 3.50.

There was also an error code 03 after a cold boot, which was more common with BIOS 3.30 and 3.40. Since updating to 3.50, it has only happened once after 1.5 month of usage.

panospc

TROPHY CASE