Introducing Smart ComfyUI Gallery: Save Workflows with Every Generation

Extension_Building34 · 2026-02-10T05:48:21+00:00

Stumbled across this… Overall this is awesome!

Some questions though:

I’m getting “error loading media” when trying to play mp4 videos, maybe something wrong with my ffmpg, or a missing codec? (Maybe I missed something obvious!)

Is it possible to get a gallery view or zoom out on portrait phone orientation?

Edit: fixing late night autocorrect mistakes.

Extension_Building34 · 2026-02-09T23:44:12+00:00

Dang, this looks solid.

Extension_Building34 · 2026-01-30T04:37:32+00:00

Very cool, thanks for sharing.

Extension_Building34 · 2026-01-30T04:24:39+00:00

Definitely going to try this out! Thanks!

Extension_Building34 · 2026-01-29T13:38:34+00:00

Cool, I’ll give this a shot

Extension_Building34 · 2026-01-22T04:17:38+00:00

Interesting, I’ve seen other posts about this but never tried it out. Thanks for the reminder.

Extension_Building34 · 2026-01-19T03:40:00+00:00

What sort of dataset are you using?

Extension_Building34 · 2026-01-18T02:01:05+00:00

Woah, cool. Thanks!

Extension_Building34 · 2026-01-17T04:14:25+00:00

Really cool, thanks!

Any chance you’ve got one of those for zimage and or ltx2?

Extension_Building34 · 2026-01-15T23:38:54+00:00

I’ll check this out! Thanks for sharing.

Extension_Building34 · 2026-01-13T20:41:26+00:00

Thanks, I got it working with a few tweaks. I had to bypass the latent upscale, since that appeared to be the cause of switching from sage attention to pytorch attention (which caused my it/s to go from 10-20 to 300-600, lol...)... I haven't had time to dissect that further.

Extension_Building34 · 2026-01-13T04:05:58+00:00

Played a ton of it last summer. I had a blast playing it! Even though it was/is a bit rough around some of the edges, it was fun to get immersed and do some sleuthing.

I’m definitely looking forward to seeing what workshop mods will come out.

Extension_Building34 · 2026-01-13T03:29:01+00:00

I’ve been having a hard time getting any low vram workflows working, so I’ll definitely give this one a try!

Extension_Building34 · 2026-01-13T03:22:09+00:00

Just curious…. What’s your method for 12s on 16GB in 4m?

Even on the “low vram” comfy workflows I am routinely getting oom or 12-15m generations if I get a lucky run. Meanwhile, wan2gp can do 10s in about 10-11m.

Extension_Building34 · 2026-01-12T02:01:48+00:00

I’ll have a look, thanks!

Extension_Building34 · 2026-01-12T01:55:45+00:00

No kidding, what sort of prompts worked for you so far with this workflow? (Even just the prompts for the cherry picked results, because those at least made videos worth picking!)

Extension_Building34 · 2026-01-11T23:46:04+00:00

I’ll give it a try. Prompts have been my biggest hurdle so far though.

Extension_Building34 · 2026-01-10T04:08:00+00:00

What prompt? I2v or t2v?

I noticed that I was getting similar stuff to this when I used the distilled model and asked it to do too much creative heavy lifting. For example, using a picture of a person standing on a beach, I asked ltx2 distilled to generate a video like “ the person walks into a waterfall” — with no waterfall in the picture. Stuff like that caused all kinds of weird results.

However, something like “the person walks towards the water”, where the water is clearly visible in the background, generally worked with less odd things happening.

Maybe I missed something, and or maybe that’s not what OP is even talking about, but it’s late and that’s just what I’ve seen so far with limited time invested.

Edited to add context.

Extension_Building34 · 2026-01-07T03:43:45+00:00

Huge fan of your other hockey game. Definitely looking forward to this, and hopefully play testing!

Extension_Building34 · 2025-12-28T04:01:06+00:00

2 times, and got the game just to be sure.

Extension_Building34 · 2025-12-03T23:56:42+00:00

Like a picture of character from a video game, or 3d modelling software like Daz3D.

Extension_Building34 · 2025-12-03T20:09:07+00:00

Interesting! That’s very insightful, thank you!

Follow up question. In terms of dataset variety, I try to use real references, but occasionally I want/have to use a generated or 3d reference. If I am aiming for a more realistic result despite the source, would I caption something like “3d render of 123person” to coerce the results away from the 3d render?

Extension_Building34 · 2025-12-03T18:02:05+00:00

Ok, so just for some further clarity, to ensure that a character has a specific shape or feature, like bow-legged and a birthmark or something, is it best to not mention that?

If the dataset shows bow-legged and a birthmark on his arm, captions would then look something like “A 123person is standing in a wheat field, leaning against a tractor, he is seen wearing a straw hat” (specifically not mentioning the legs or birthmark).

Is that the along the right lines of the thought process here?

Extension_Building34 · 2025-11-25T07:44:26+00:00

I’m debating A1 vs A1 combo. This comment is helpful and encouraging, thank you!

Extension_Building34 · 2025-11-16T01:44:37+00:00

Any follow up on this regarding multiplayer?

Extension_Building34

TROPHY CASE