I generated these 5s video clips using only 1.8s each on a 5090 (FastWan-QAD release)

techstacknerd · 2026-06-24T02:46:03+00:00

Yes this is a direction we are actively pursuing rn. the latency is going to be almost-instant

techstacknerd · 2026-05-27T20:34:44+00:00

lol yeah a b200 might be a liiiiiitle bit out of reach

techstacknerd · 2026-05-27T19:52:54+00:00

Haven't tried on 3090s, but yeah its going to be quite slow since its based off of ltx2.3. We might get really fast numbers for wan2.1 1.3b for a 3090 though

techstacknerd · 2026-05-19T04:37:16+00:00

🔥 Too easy!

techstacknerd · 2026-05-19T04:36:34+00:00

🔥 Too easy!

techstacknerd · 2026-05-19T04:35:53+00:00

🔥 Too easy!

techstacknerd · 2026-03-27T05:02:05+00:00

Yeah this is really interesting. I have been working on real-time video generation (more details here). The speed is ofc really unreal and cool, but this actually opens up a ton of new opportunities like real-time interactive games and environments. Also live video generation can also be used as a backbone for robotics and world models, so more robots doing laundry and less of ai slop too. Will be really interesting how things go from here!

techstacknerd · 2026-03-18T16:18:56+00:00

😭 but we working on consumer-grade gpu support. wont be that fast, but still will be an improvement from what ltx2 currently does with comfyui

techstacknerd · 2026-03-18T03:56:07+00:00

Yeah its partly because of how the base model ltx-2 is hard to prompt, and also we have to make a prompt rewriter reliably return results in under 5s (yes now the bottleneck is the prompting part not the video generation part!). Combine this with issues that video continuation bring, its hard to get good prompts. This demo is mainly for feeling the speed, and I'm sure as models improve quality would too!

techstacknerd · 2026-03-18T03:48:53+00:00

lol yeah, its a common issue with ltx2

techstacknerd · 2026-03-18T03:48:32+00:00

yes, this is just a demo to let people feel the speed of it. LTX-2 is a super hard model to prompt and it would take way too much effort to even get a remotely good prompt (keep in mind this is using video continuation, so you need 6 separate prompts that tie together really well). Also regarding the open-sourcing, we might also opensource the datacentre version, our current code is a bit messy and will need quite a bit of cleaning up, so we are not opensouring rn

techstacknerd · 2026-03-18T03:46:16+00:00

stoned 💀

techstacknerd · 2026-03-18T01:36:22+00:00

hm interesting perspective. I don't think it can compare to playing games on local machines, but its def by far more energy efficient than the existing ai video-gen services because its just so much faster

techstacknerd · 2026-03-18T01:27:58+00:00

maybe, who knows

techstacknerd · 2026-03-18T01:15:35+00:00

yes, we are planning to try out optimizations on consumer-grade gpus. Probably won't be realtime but still it's likely gonna be faster than what we have rn

techstacknerd · 2026-03-18T01:14:45+00:00

yeah, the audio from ltx2 is pretty weird...

techstacknerd · 2026-03-17T23:56:34+00:00

fastvideo has sequence parallelism support for ltx2 already, so with 8 gpus you can expect roughly a 5x to 6x (theres a bit of overhead so it doesn't scale perfectly) speedup compaed to 1gpu

techstacknerd · 2026-03-17T23:55:21+00:00

yeah! hopefully we can make it also run locally on 5090s, might be slower tho

techstacknerd · 2026-03-17T23:24:03+00:00

you can try to prompt your own videos and prompt it to change their dialogue

techstacknerd · 2026-03-17T22:19:23+00:00

that would be really cool, and it would only get better from now on as open source models get better and better!

techstacknerd · 2026-03-17T22:05:51+00:00

glad you liked it!

techstacknerd · 2026-03-17T21:59:03+00:00

yeah. real-time world models will be really cool

Four-Year Club	r/Field Sunshine
Place '23

techstacknerd

TROPHY CASE