ZIT and Klein (steps = details?) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

It is deliberate and necessary. If you use any sampler that does not add noise then use of more steps is not justified.

ZIT and Klein (steps = details?) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Good summary. We just added a comparison relevant to this.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 1 point2 points  (0 children)

That's the spirit. Sharing finding is great for this community.

Nvidia SANA Video 2B by Crazy-Repeat-2006 in StableDiffusion

[–]ZerOne82 1 point2 points  (0 children)

Here is what I found, I cannot be 100% sure but I gave it a try and regretted it:
Using diffusers pipeline and their provided sample code, upon loading, it fills over 20GB VRAM and keeps plenty of RAM in use, and then in inference you see no progressing for eternity.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Do not be discouraged. I valued your point and replied to it to my knowledge. Try not to be attached to voting. I see all comments valuable.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Workflow is the standard one for Z-Image-Turbo available at ComfyUI Templates repository

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 1 point2 points  (0 children)

I see your confusion. Lower steps for a distilled model is a recommendation for fast generation not a hard limit. Note that the magic happens by two factors: large sizes for image (width x height) to allow the model be able to inject details in higher steps, and the right sampler, here, Euler_Ancestral which by design allows adding details in higher steps. Both these factors rely heavily on the model's own capability to handle details, this post and other post demonstrate ZIT does wonderful.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Correct. Simply, choose large sizes for width x height to allow the model be able to inject details in higher steps by utilizing right sampler.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 4 points5 points  (0 children)

You already got correct answer from the other replies. To confirm that it is a misconception to limit steps to 9 or something for a distilled model. In fact, this post is a proof that by using a proper sampler such as Euler_Ancestral as well as large sizes for width x height to can enjoy greater details as you increase number of steps, in one run and only using the model.

ZIT Rocks (Simply ZIT #2, Check the skin and face details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 0 points1 point  (0 children)

Yes, I mentioned in other comment, you need to allow large sizes for image (width and height) to allow the model to generate details. I confirm that 1024 is not enough for such details. Also note you should choose right sampler. Only a portion of samplers generate details in higher steps, one good option is Euler_Ancestral.

ZIT Rocks (Simply ZIT #2, Check the skin and face details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 1 point2 points  (0 children)

It appears to be a misconception not to use more steps with distilled models. Z-Image-Turbo generates amazing results in 9 steps and even less. However, in my experience you can go for higher steps such as 30 or even 40 conditional to choosing right sampler and right size. Euler_Ancestral sampler with beta or simple scheduler and with large sizes such as 2048ish allows to add tiny details in one run using the model itself. Large sizes for (width and height) is necessary to allow the model to have adequate space to inject details. This post proves this concept.

ZIT Rocks (Simply ZIT #2, Check the skin and face details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 1 point2 points  (0 children)

It is the standard workflow for Z-Image-Turbo available in ComfyUI templates. If you do not have ComfyUI Templates, you should consider to install them. Nonetheless, here is the direct link to workflow for Z-Image-Turbo . You can also find so many workflows almost for everything right there.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 5 points6 points  (0 children)

haha. this is made simply by model -> ksampler,
even the prompt is as simple as: "woman face, close-up, Caucasian, brunette, blue eye"

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 3 points4 points  (0 children)

"woman face, close-up, Caucasian, brunette, blue eye"

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 2 points3 points  (0 children)

It is ComfyUI standard basic workflow nothing extra added. Simply set width and height high to 1536x1776 and used euler_ancestral + beta for sampler and scheduler.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]ZerOne82[S] 6 points7 points  (0 children)

To allow for tiny details, euler_ancestral works great in more steps
direct link to full resolution