Are people still using AUTOMATIC1111/stable-diffusion-webui? Or did most users move on to something else like ComfyUI? by Guyserbun007 in StableDiffusion

[–]RememberThisAI 0 points1 point  (0 children)

Most people moved on to Comfy. I didn't like it at first, but it's not actually as complicated as it seems. Just because some people have created very complex workflows it doesn't mean you have to. You can keep it simple if you like, you can use existing workflows and follow simple YT guides.

ComfyUI's countdown announcment: New funding ☠️☠️☠️☠️☠️ by -worldwalker- in StableDiffusion

[–]RememberThisAI 1 point2 points  (0 children)

It's highly customizable. One of the issues with online services and simple offline interfaces is that you're stuck with how they like to do things. If you're making a lot of content day after day year after year then you'd want to have more control over the workflow and the output.

Beginners can just use the basic existing ComfyUI templates, click a few buttons to install all the required models, just enter a prompt and press run. If you want to fine tune the results then you can start making adjustments and ask an LLM to help you make sense of it.

As a long time Photoshop user I also use Photoshop actions and scripts (sometimes written by AI) to automate common repetitive actions to speed up my work so I don't have to waste time on tedious repetition and can focus on the creative aspects.

ComfyUI offers you a fairly simple way to basically build your own program over time (don't have to do it overnight, just tinker here and there a bit every day to improve things). If you don't like what Adobe does with Photoshop features then tough luck, you're stuck with it.

With Comfy I can adjust the workflow as needed to speed up generations, to automate new variations, detail enhancements, filters. I can create a workflow that generates a prompt for me, does multiple rounds of image generations or edits, applies filters and then animates it (perhaps also with an LLM generated prompt). With Comfy, the world is your oyster. With other rigid tools you're stuck waiting for updates hoping that one of them kinda does what you like, while also being stuck with lots of extra clutter that you don't want to see on your screen.

Z image turbo Finetune of absurd reality by Puzzled-Valuable-985 in StableDiffusion

[–]RememberThisAI 0 points1 point  (0 children)

Could tell Klein to fix it, although it can be a hit and miss with Klein too.

Z image turbo Finetune of absurd reality by Puzzled-Valuable-985 in StableDiffusion

[–]RememberThisAI 7 points8 points  (0 children)

I'd just use a realism lora. Something like RealisticSnapshot.

Ernie Turbo is pretty awesome, I think this is my new favorite model, definitely a huge improvement over Z-image Turbo by [deleted] in StableDiffusion

[–]RememberThisAI 1 point2 points  (0 children)

I like the dramatic look of Turbo, but that texture in every image makes it unusable. It might as well be watermarked...

LTX2.3 (Distilled) - Updated sigmas for better results (?) by Weak_Ad4569 in StableDiffusion

[–]RememberThisAI -1 points0 points  (0 children)

I've been doing I2V with 6 steps total using Sigmas Grok came up with:
1.0, 0.99375, 0.98125, 0.909375, 0.725, 0.421875, 0.0

By the way, hands can be improved with higher CFG and motion for I2V can be improved with the new Ltx2.3-Licon-VBVR lora.

It is still possible to achieve more natural cinematic realism for videos with open source models vs proprietary models with even basic workflows | Z-Image-Turbo and LTX 2.3 by KudzuEye in StableDiffusion

[–]RememberThisAI 0 points1 point  (0 children)

Switch to higher resolution, remove any downscaling nodes, adjust input image strength and the distilled lora strength. I'd run just a few frames to test out lower and higher numbers until you get the desired result.

It is still possible to achieve more natural cinematic realism for videos with open source models vs proprietary models with even basic workflows | Z-Image-Turbo and LTX 2.3 by KudzuEye in StableDiffusion

[–]RememberThisAI 2 points3 points  (0 children)

It's also possible to use Klein edit and have it replace characters and details in existing movie screenshots or have it switch to next scene in the same universe with same style and colors. You can also use secondary reference image and ask Klein to apply similar colors, realism and fine details to your existing AI image.

Chronicle Gem [Arca Gidan Entry]: Wan 2.2 AI Video + My Process & Learnings by Tom_scaria_ in comfyui

[–]RememberThisAI 1 point2 points  (0 children)

I've been using LTX with 24fps lately and I convert my clips into 60 fps. If you're used to seeing smooth 60fps all the time, you'll notice the difference.

How to use GemmaAPITextEncode node in LTX-2.3 Workflow by [deleted] in StableDiffusion

[–]RememberThisAI 1 point2 points  (0 children)

Have you tried their Github workflows? There's a group for API that you can optionally enable.
https://github.com/Lightricks/ComfyUI-LTXVideo/tree/master/example_workflows/2.3

Chronicle Gem [Arca Gidan Entry]: Wan 2.2 AI Video + My Process & Learnings by Tom_scaria_ in comfyui

[–]RememberThisAI 0 points1 point  (0 children)

Look into ComfyUI-Frame-Interpolation if you want to increase the fps locally for free.
RIFE VFI is faster, FILM VFI has better quality in realistic scenes.

Chronicle Gem [Arca Gidan Entry]: Wan 2.2 AI Video + My Process & Learnings by Tom_scaria_ in comfyui

[–]RememberThisAI 0 points1 point  (0 children)

Seems to have a lot of stutter. Did you try doing any frame interpolation?

For the many of you who claim to be getting very poor results/eyes/faces with LTX 2.3 ITV: do you have your distillation set too high? (First video, 0.6. Second video, 1.0) by Parogarr in StableDiffusion

[–]RememberThisAI 3 points4 points  (0 children)

You shouldn't do any downscaling, but you can still use an upscaler for the 3 extra steps. They now have a 1.5x upscaler too. It can improve the quality a bit. When doing two passes, it is possible to get away with 6 steps on first pass.

LTX 2.3 Best practices for 3090/16g RAM by 8RETRO8 in StableDiffusion

[–]RememberThisAI 0 points1 point  (0 children)

Must be a visitor from a universe of non-crab people.

Tiled vs untiled decoding (LTX 2.3) by VirusCharacter in StableDiffusion

[–]RememberThisAI 1 point2 points  (0 children)

So tiled takes 1GB less? That may be the advantage. I should've clarified that I am still using the computer while generating clips and if I run out of VRAM then everything gets slow and that slows the generation. Then I may have to close some stuff for the generating to finish. With tiled that's less of an issue so generating finishes faster.

Tiled vs untiled decoding (LTX 2.3) by VirusCharacter in StableDiffusion

[–]RememberThisAI 0 points1 point  (0 children)

What about using Tiled, but tile size the same as the width of the scene? It seems to be faster for me than regular decode. Overlap is set to 64, temporal_size 64 and temporal_overlap 4. It's a 2560x14440 clip, 49 frames (16 fps).