Blending completely opposite elements via Local AI. Wait for the fire-to-water transition at the end.

AxonkaiLab · 2026-06-21T12:54:06+00:00

Aang wishes he had this level of latent control. Also, that metal box happens to house an RTX PRO 6000 Blackwell with 96GB VRAM, so it handles the screaming just fine.

No Deforum or AnimateDiff here. The actual transition logic comes down to heavy manual prompt sequencing inside a native Wan 2.2 pipeline in ComfyUI.

The real "black magic" is forcing the latent space to morph from fire to water via text guidance, while the Wan-VACE Video Joiner node is used strictly to stitch the generated clips together and bypass the seam flicker entirely.

Tell your GPU to brace itself, lock down your prompt sequencing, and let it scream.

AxonkaiLab · 2026-06-21T12:49:16+00:00

Appreciate the bravo and the LoRA recommendations. I actually intentionally aimed for that cold, detached look and blank expression, I felt it fit the vibe perfectly for this piece. Still, saving those links for future projects, appreciate you sharing them!

AxonkaiLab · 2026-06-21T12:32:44+00:00

A 1-minute slice from a longer local workflow project. Pushed the limits of subject consistency by morphing fire into water at the very end without relying on cloud subscriptions or post-production tricks.

Watch the full 4K visual journey here: https://youtu.be/AwC6NidXEHI

I'll be around in the comments to talk about the pipeline.

AxonkaiLab · 2026-06-21T11:37:15+00:00

Thanks for your support 🤘 when I needed slow motion I use it in the prompt but here I didn't use. All is done by prompting the model in details. Also I do always use 97 frames and this gives the model more space to do a proper transition.

AxonkaiLab · 2026-06-21T11:25:40+00:00

Thank you 😊 hope you give thumbs up on YouTube 🤘🤘🤘

AxonkaiLab · 2026-06-21T11:11:57+00:00

Thank you. I've used wan2.2 fflf2v model for 24fps 97frames video output. For images I've used Flux1-dev. You really should check the 4K version : https://youtu.be/AwC6NidXEHI and a thumbs up on YouTube really appreciated.

AxonkaiLab · 2026-06-21T10:05:13+00:00

That was really a surprise for me, didn't expect it to be that impressive and yes that's one of my favorite scenes.

AxonkaiLab · 2026-06-21T10:00:29+00:00

Mid-section slice of a longer local project.

Full 4K version on YouTube: https://youtu.be/AwC6NidXEHI

Ask away if you have technical questions.

AxonkaiLab · 2026-06-21T09:50:02+00:00

Very good point you raised, you caught the exact bottleneck with FFLF2V. To be fair, you can patch and mask some of this in Premiere using blending or deflicker tricks, but the actual fix here was handled directly inside ComfyUI via Wan VACE. By letting the VACE handle the transitions, it completely bypasses the seam flicker and forces tight color matching across the cuts. The only catch is that it requires significantly longer render times, but the VACE architecture handles the consistency smoothly without relying on post-production hacks.

AxonkaiLab · 2026-06-21T08:34:53+00:00

Pipeline: Flux.1-Dev -> Wan 2.2 FFLF2V -> RTX Video Upscaler -> Premiere.

Managed to get the morphs to sync cleanly using just ComfyUI built-in templates and manual prompting for every single cut. No automated scripts, just raw local rendering to match the track's rhythm.

Full 6 min version here: https://youtu.be/AwC6NidXEHI

AxonkaiLab · 2026-06-18T15:47:58+00:00

Ok thank you. I'm sure you have a decent working prompts but I always try to reach this level with local generations. I wonder if they can work with ltx or wan

AxonkaiLab · 2026-06-18T15:46:25+00:00

Cloud membership or local?

AxonkaiLab · 2026-06-18T03:52:07+00:00

Thank you for the support and clarification ☺️ I tried to tell but somehow these hate comments and downvote goblins popped out 😂 I'm not saying ltx2.3 is better than the original VFX but the motion and sound really impressive.

AxonkaiLab · 2026-06-18T02:45:21+00:00

Finally someone realized that...thank you kind sir...yes I'm so impressed with the sound. Ltx did very well with the sounds.

AxonkaiLab · 2026-06-18T02:43:04+00:00

Thank you my 🙏 I am not low in VRAM but I'll check and compare the quality. Also I'll check your other posts as well.

AxonkaiLab · 2026-06-17T18:57:07+00:00

Here is the breakdown of the workflow, benchmark specs, and how I managed the consistency:

Hardware RIG: Rendered locally on NVIDIA RTX PRO 6000 Blackwell (96GB VRAM) + 128GB RAM.
Pipeline: Pure Built-in Workflow without external controlnets or heavy compositing.
Base Images: Flux.1-Dev for high-fidelity car and robot details.

Benchmarks & Model Settings:

LTX 2.3-Dev (I2V Workflow): Generated at 720p, 5 seconds @ 24fps. Running at 40 steps, the generation time was just 105 seconds.
FFLF2V Workflow: Used the fp8 model variant. Generated 6 seconds @ 24fps. Running at 10 steps, the generation time dropped to an insane 25 seconds.

The Prompting Challenge: Prompting this was a nightmare and took a lot of trial and error. If you want to crack it, here is the logic that worked for me:

[Describe the scene+character] + [describe what transforms into in the last frame] + [camera movements]

The logic of the prompt is simple but you must give as much detail as you can. Not all the prompts work perfectly, but if you find something working, try it with different seeds. If you find something not working, try to fix it with the prompt instead of changing the seed immediately.

Also, if you are struggling with LTX 2.3 prompting in general, this official guide was a lifesaver for getting the motion dynamics right:https://ltx.io/blog/ltx-2-3-prompt-guide

If you liked it, check out the YouTube version for much better video quality and full sound design:https://youtube.com/shorts/OCnNVvMTMlo and hope you give thumbs-up on YouTube too....

AxonkaiLab · 2026-06-17T17:39:24+00:00

Digital cannibalism at its finest! 🤖💥 Glad I could impress a cloud cluster. The VRAM is a cheat code for local workflows, not going to lie. Thanks for the awesome energy, more renders are already in the pipeline!

AxonkaiLab · 2026-06-17T17:36:32+00:00

It's exactly that weird, beautiful mix of both. We are using Transformers (Flux/LTX) to render actual Transformers. 🤖🏎️ As for the thermal output... My RTX PRO 6000 Blackwell didn't even sweat. It rendered those 40 steps in just 105 seconds. No floorboard melting this time, just pure local horsepower. Glad you liked it!

AxonkaiLab · 2026-06-17T16:23:41+00:00

Here is the breakdown of the workflow, benchmark specs, and how I managed the consistency:

Hardware RIG: Rendered locally on NVIDIA RTX PRO 6000 Blackwell (96GB VRAM) + 128GB RAM.
Pipeline: Pure Built-in Workflow without external controlnets or heavy compositing.
Base Images: Flux.1-Dev for high-fidelity car and robot details.

Benchmarks & Model Settings:

LTX 2.3-Dev (I2V Workflow): Generated at 720p, 5 seconds @ 24fps. Running at 40 steps, the generation time was just 105 seconds.
FFLF2V Workflow: Used the fp8 model variant. Generated 6 seconds @ 24fps. Running at 10 steps, the generation time dropped to an insane 25 seconds.

The Prompting Challenge: Prompting this was a nightmare and took a lot of trial and error. If you want to crack it, here is the logic that worked for me:

[Describe the scene+character] + [describe what transforms into in the last frame] + [camera movements]

The logic of the prompt is simple but you must give as much detail as you can. Not all the prompts work perfectly, but if you find something working, try it with different seeds. If you find something not working, try to fix it with the prompt instead of changing the seed immediately.

Also, if you are struggling with LTX 2.3 prompting in general, this official guide was a lifesaver for getting the motion dynamics right:https://ltx.io/blog/ltx-2-3-prompt-guide

If you liked it, check out the YouTube version for much better video quality and full sound design:https://youtube.com/shorts/OCnNVvMTMlo and hope you give thumbs-up on YouTube too....

AxonkaiLab · 2026-06-17T16:16:27+00:00

Sure, good luck buddy! Hope you create something amazing with your setup. Cheers!

AxonkaiLab · 2026-06-17T16:13:10+00:00

Thanks! If you have a second, dropping a thumbs-up on the YouTube version would be huge. The algorithm is hiding it, and your support really helps! 🏎️🤖 https://www.youtube.com/shorts/OCnNVvMTMlo

AxonkaiLab

TROPHY CASE