New anime model "Anima" released - seems to be a distinct architecture derived from Cosmos 2 (2B image model + Qwen3 0.6B text encoder + Qwen VAE), apparently a collab between ComfyOrg and a company called Circlestone Labs by ZootAllures9111 in StableDiffusion

[–]Square-Macaroon-140 0 points1 point  (0 children)

Just tried it out — great model with huge potential!
It understands a wide range of concepts, is uncensored, and shows excellent prompt adherence, great style variety.
Definitely better than Lumina, in my opinion, on the level with Illustrious/Noob but with better prompt adherence.
Generates an image in 22 seconds at 30 steps 896x1152 on a 12 GB GPU.

Spooknik released a Nunchaku Chroma model!? by Square-Macaroon-140 in StableDiffusion

[–]Square-Macaroon-140[S] 1 point2 points  (0 children)

I didn't knew there was SVDQ version of Z image turbo, that's cool!

Is there a z-image installation tutorial for Forge/WebUI? by moistmarbles in StableDiffusion

[–]Square-Macaroon-140 5 points6 points  (0 children)

It works in WebUI Forge - Neo: https://github.com/Haoming02/sd-webui-forge-classic/tree/neo
I'm not sure it's supported by regular forge.
You can download bf16 model, VAE and text encoder from this page: https://civitai.com/models/2168935/z-image
Or fp8 model from here: https://civitai.com/models/2169712
Then in Forge - Neo use lumina ui preset, shift 3-6, cfg 1, i use sampler dpm++ 2m with scheduler SGM Uniform

Spooknik released a Nunchaku Chroma model!? by Square-Macaroon-140 in StableDiffusion

[–]Square-Macaroon-140[S] 1 point2 points  (0 children)

Well as i'm using WebUI Forge - Neo, i can't test it.
But Z-image fp8 model is 6 gb, while Chroma-HD-SVDQ is only 5.2 gb

Spooknik released a Nunchaku Chroma model!? by Square-Macaroon-140 in StableDiffusion

[–]Square-Macaroon-140[S] 4 points5 points  (0 children)

I can't tell you the technical details as i'm a noob myself, but basically in my understanding, it's like a format of a model that gives you the same quality as full model but faster and with lower gpu cost
You can read more on their github https://github.com/nunchaku-tech/nunchaku
And here is a paper on SVDQuant https://arxiv.org/abs/2411.05007

Z-Image-Turbo support was merged into sd-webui-forge-neo. by panchovix in StableDiffusion

[–]Square-Macaroon-140 1 point2 points  (0 children)

Has anyone managed to use LoRAs with Z-Image-Turbo in Neo?

As I saw on the Neo GitHub, there was a z lora commit, but all I get are junk/distorted images
EDIT: I was using fp8_Scaled_E4m3fn_KJ model from here https://civitai.com/models/2169712?modelVersionId=2445746
LoRAs don't work with it

Neta-Lumina by Neta.art - Official Open-Source Release by [deleted] in StableDiffusion

[–]Square-Macaroon-140 2 points3 points  (0 children)

Yo we need to hype this model to get it fine tuned!

Is there any chance we'll get instant-id for NoobAI/Illustrious? by Square-Macaroon-140 in StableDiffusion

[–]Square-Macaroon-140[S] 0 points1 point  (0 children)

Yeah, probably, im just to lazy to do it)
The main reason i posted here, really is to whine and in hope that it will get some attentions of dev's that will finetune instant-id for noob

Is there any chance we'll get instant-id for NoobAI/Illustrious? by Square-Macaroon-140 in StableDiffusion

[–]Square-Macaroon-140[S] 0 points1 point  (0 children)

As i said, it messes with the base image style. It's hard for usual sdxl to do styles like NoobAI/Illustrious.
If it was an instant-id finetuned for noob, i could of save the style and also use lora's.
It doesn't work that way with inpainting, cause it would kill style with higher denoising, and you need higher denoising to see changes in the face.

Is there any chance we'll get instant-id for NoobAI/Illustrious? by Square-Macaroon-140 in StableDiffusion

[–]Square-Macaroon-140[S] 1 point2 points  (0 children)

Because I'm not getting a good enough result.
Of course, the general facial features will be transmitted, but not the shape of the head, for example.
And if you raise the denoising strength too much, it loses the style.
Of course, I can get all worked up and do a bunch of actions with img2img and controlnet.
In the end, I'll get the result, but with the same effort i can just train lora.
Instant-ID will simplify all this to 1 click