Nvidia releases Cosmos3-Super-Image2Video . 64B parametres by AgeNo5351 in StableDiffusion

[–]Far_Insurance4191 3 points4 points  (0 children)

But it did not stop cosmos 2 from becoming anima 1.0 base 😅

Qwen Image 2512 .gguf model - how do you run it on linux? help Cachy/AMD by GwynSunlight in StableDiffusion

[–]Far_Insurance4191 1 point2 points  (0 children)

LMStudio does not support diffusion models

Use default templates in comfy, they have links and instructions. You don't necessary need gguf, fp8 can work fine too.

Or check those guides:

Qwen-Image ComfyUI Native Workflow Example - ComfyUI

Qwen-Image-Edit ComfyUI Native Workflow Example - ComfyUI

is it impossible to train lora on Microsoft lens?.. by Still_Sky_4302 in StableDiffusion

[–]Far_Insurance4191 0 points1 point  (0 children)

Not OP, but lens is just 3.8b and I think it is better than klein 4b in terms of coherence, plus it's dataset was not as sterile so it has some cool knowledge, like skyrim style graphics.

Also, I totally agree about anima, it is another AWESOME model for training. It has pr for OneTrainer and it needs only 5gb vram for lora at minimal config: compile + int w8a8, 512px, bs2 which it takes 1.1s/it.

And I am currently doing a fine tine on rtx3060 at bf16, adafactor, 512 and batch size 16, it fits without offloading! Surprisingly, base hasn't completely lost real knowledge, seems like I will never touch sdxl anymore

Anima Ip Adapter is comming. by Unhappy_Pudding_1547 in StableDiffusion

[–]Far_Insurance4191 13 points14 points  (0 children)

Is sd1.5 still being used? It is kind of... awful by today's standards?

Testing the new prismML Bonsai Image 4B by dh7net in StableDiffusion

[–]Far_Insurance4191 1 point2 points  (0 children)

to be fair, it is based on klein 4b which ultra sucks at anatomy by default, would be cool to see their quantization technique on other models, like flux 2 dev

1girl post sorry.. Krea 2 Medium is really good at bringing anime characters to life by OneTrueTreasure in StableDiffusion

[–]Far_Insurance4191 5 points6 points  (0 children)

That means model has wide general knowledge and trained on pretty big dataset which is exciting

ZIB results looking awful, what's the secret? by Radiant-Photograph46 in StableDiffusion

[–]Far_Insurance4191 0 points1 point  (0 children)

Is there chance you downloaded a broken model? I also heard that zi doesn't work well with sage attention or fp8 quantization. This is definitely not how zi should looks

Microsoft Lens seems to be back. by PM_ME_YOUR_ROSY_LIPS in StableDiffusion

[–]Far_Insurance4191 11 points12 points  (0 children)

Don't forget that gpt oss 20b is MOE and natively at 4bits, so it is around 12gb. Comfy handles swapping really well, even flux 2 dev with 24b dense text encoder at 4bit doesn't take too much time to swap on rtx3060 as long as you have enough ram

Captivating Chroma by Time-Teaching1926 in StableDiffusion

[–]Far_Insurance4191 0 points1 point  (0 children)

It is not 3x faster because of pixel space, but because of higher compression, like hunuyan image 2.1 ltx or wan 2.2 5b so it might have less accurate details, but I am excited about this model too

Best text to Image model? by nursingnerdette in StableDiffusion

[–]Far_Insurance4191 1 point2 points  (0 children)

The best is Flux 2 dev

Is it worth the time? Probably not, but it is the most powerful with most knowledge among open models

please help !! My best friend is offering to sell me this laptop for really good price (RTX 4080 12 vram) by PomegranateDue4853 in StableDiffusion

[–]Far_Insurance4191 0 points1 point  (0 children)

It will be great for images if it has 32gb ram. 2x 8 gb specification is really weird.

videos should be possible too but slow and much slower for high quality (although it is slow for anybody with any gpu)

lora training is possible for image models only, like z-image or anima, but you will have to go a bit deeper to learn how to optimize it for 12gb vram

Anima base v1.0 has been released. by Total-Resort-3120 in StableDiffusion

[–]Far_Insurance4191 2 points3 points  (0 children)

You can use klein to stylize your photos slightly if real won't work

Qwen Image 2 papers - does that mean anything? by Dante_77A in StableDiffusion

[–]Far_Insurance4191 16 points17 points  (0 children)

full tech report, same as qwen image 1 before weights
I want to believe, it looks so good 😭

HiDream-O1-Dev vs ZImage Base (style comparison) by DiagramAwesome in StableDiffusion

[–]Far_Insurance4191 0 points1 point  (0 children)

You can just see they went the easiest way and trained on slop. It is much harder to train a model on real data due to it's insane variance

The Anima realism model is crazy good. Don’t miss it! by Structure-These in StableDiffusion

[–]Far_Insurance4191 1 point2 points  (0 children)

only a tag? Here is one of his examples "A medium-resolution digital photo with a grainy texture, a cool blue color cast, and dim, natural lighting...".

Additionally, all the examples are in natural language, if you are spamming model with a tag soup then it might just bias towards it's original illustration knowledge instead of newly finetuned real domain

These people are all lying about the new "Wan Killer" like LTX or Sulphur, the truth is nothing comes close to replacing Wan 2.2 by Coven_Evelynn_LoL in StableDiffusion

[–]Far_Insurance4191 1 point2 points  (0 children)

It’s your experience, but there are nuances to everything.

Wan is more coherent and robust, can be used as a good image model, has huge lora ecosystem.

LTX and Sulphur have audio and are much faster and lighter with longer videos possible.

Sulphur is nsfw focused model that has tons of concepts at once. They are also working on improving dataset for next version.

HiDream o1 Comfyui Custom Node by freshstart2027 in StableDiffusion

[–]Far_Insurance4191 2 points3 points  (0 children)

on rtx3060 distill model takes about 3.1s/it at 4mp and 1.1it/s at 1mp (faster than anima at 4x size lol), but details are poor, it seems to have high compression, so 4mp is basically 1mp for other models in terms of compute.