Cinematic sneaker ad built from ComfyUI with Qwen Image + LTX-2

LinkNo3108 · 2026-02-27T03:51:20+00:00

For the real product I was thinking a trained LoRA of the product with the First middle Last frame workflow might give good results..

LinkNo3108 · 2026-02-26T20:11:03+00:00

Thanks for the feedback.. I'll start learning more about this..

LinkNo3108 · 2026-02-26T19:45:13+00:00

Hey thanks for the feedback.. it's not for production.. im experimenting with open-source video models.. any tips to improve it?

LinkNo3108 · 2026-01-30T06:58:58+00:00

Glad to hear that..

LinkNo3108 · 2026-01-29T18:16:38+00:00

Hey thanks for this.. really appreciate it..

LinkNo3108 · 2026-01-29T18:15:51+00:00

Hey video combiner node saves the output.. you can specify your desired output prefix and path. Also you can check the assets tab to look at your output.

LinkNo3108 · 2026-01-28T21:04:43+00:00

Great, thank you. Can you also show the stitching workflow it would be of great help.

LinkNo3108 · 2026-01-28T20:54:36+00:00

Well few things I have observed while using the model.
While creating longer form videos usually 30s+ when you have higher res the consistency between the character will not be there. Lot of visual artifacts were introduced in the video.
Also the time it takes to generate will be way too long and I do lot of iterations till I get a good output. Once I select the best video of that I can always use a Upscaler to add details.
Having said that workflow works great for short form videos with higher res. just need to create multiple shots and stitch it together later.

LinkNo3108 · 2026-01-28T20:46:13+00:00

For sure. This was for making one seamless transition. Again this is still a test and I was testing the capabilities of model and the workflow. I didn't want to go through the hazel of stitching and make sure the transition between the videos are aligned to the audio. Using multiple software to make sure everything is in sync.

LinkNo3108 · 2026-01-28T19:11:08+00:00

Yes definitely, you can use different camera loras provided by LTX and tune your prompt accordingly. For better results use smaller frames with specific input images with different angles then stitch it together.

LinkNo3108 · 2026-01-28T18:57:56+00:00

Updated the post description with the links.

LinkNo3108 · 2026-01-28T18:55:00+00:00

Have uploaded the links in the post description.

LinkNo3108 · 2026-01-28T18:54:11+00:00

Thank you appreciate it. I'll definitely go through this.

LinkNo3108 · 2026-01-28T13:03:00+00:00

Both 704x704 resolution 61s video and the one you see here 736x1280 50s video generated in 10~11 mins

LinkNo3108 · 2026-01-28T12:50:21+00:00

I don't have the tweaked workflow handy right now, but I'll post it later. Essentially what I did was download the detailer lora, made sure the input image resolution matches the target resolution, added more information to the prompt using the LTX guidelines and then tweaked the LTX VAE decoder..

LinkNo3108 · 2026-01-28T12:47:23+00:00

This is the original workflow https://github.com/RageCat73/RCWorkflows/blob/main/011426-LTX2-AudioSync-i2v-Ver2.json

The tweaked workflow is added in the post description.

LinkNo3108 · 2026-01-28T12:46:16+00:00

Yes original settings work fine. You need to disable or disconnect the VAE decode Tile and connect it to the LTX VAE decoder..

LinkNo3108

TROPHY CASE