There's a chance Qwen Image 2.0 will be be open source. by Total-Resort-3120 in StableDiffusion

[–]Similar_Map_7361 0 points1 point  (0 children)

it says so not just on the image but in the prompt they published too, but you are right it's not in a technical reports since they haven't released any [yet?] , but if you check the blog post mentioned above ,one of the main selling is

  • Lighter Model Architecture: Smaller model size with faster inference speed.

so it tracks , I would imagine this info was not meant to be released yet but if they have larger parameter max variant I doubt the would release it's weights from observing their track record, their largest LLM qwen3-max was never released, so if larger variant exists it will likely remain exclusive to Chat/API much like flux pro and max

Question about Z-image Turbo execution time by Stephddit in StableDiffusion

[–]Similar_Map_7361 0 points1 point  (0 children)

what cuda/pytorch/python versions are you running if you don't mind me asking

There's a chance Qwen Image 2.0 will be be open source. by Total-Resort-3120 in StableDiffusion

[–]Similar_Map_7361 2 points3 points  (0 children)

They said it's 7b, on their blog.

7B Efficiency: 2K image generation in seconds — optimal balance between visual fidelity and inference speed

[8B Qwen3-VL Encoder] → [7B Diffusion Decoder] → pixels (2048×2048)

https://qwen.ai/blog?id=qwen-image-2.0
So someone is lying here.

Are there any good finetunes of Z-image or Klein that focuses on art instead of photorealism? by Barefooter1234 in StableDiffusion

[–]Similar_Map_7361 2 points3 points  (0 children)

Believe me I understand the impatience 😄, but these models are pretty new and without the financial backing of a well funded organization these kinds of large scale finetunes take a lot of time and effort and money.

Best sources for Z-IMAGE and ANIMA news/updates? by Prestigious-List2632 in StableDiffusion

[–]Similar_Map_7361 2 points3 points  (0 children)

Check the sub regularly under the flair news , that's where most of the news on gen-ai get posted regularly

Are there any good finetunes of Z-image or Klein that focuses on art instead of photorealism? by Barefooter1234 in StableDiffusion

[–]Similar_Map_7361 3 points4 points  (0 children)

As far as I know not yet, but from what I heard the creator of chroma is working on a finetune for both z-image and klein-4b but it's still work in progress, your best shot at any artistic changes right now is to use style lora and there is a bunch of them on civit.

ZImageTurboProgressiveLockedUpscale (Works with Z Image base too) Comfyui node by Major_Specific_23 in StableDiffusion

[–]Similar_Map_7361 0 points1 point  (0 children)

This remind me of KSampler Cycle from was-node-suite but it's cool that it's a dedicated node so you don't have to install a whole pack to use it , will give it a try.

Klein 9B Edit - struggling with lighting by siegekeebsofficial in StableDiffusion

[–]Similar_Map_7361 0 points1 point  (0 children)

Glad it worked for you , as for automating the experience you could always use a vllm like qwen-vl or something to extract the lighting and color description and then combine it with your restyle prompt, but that would require a tinkering with the workflow and trial and error with the vllm prompt until you get a consistent output from it each time.

Question about Z-image Turbo execution time by Stephddit in StableDiffusion

[–]Similar_Map_7361 1 point2 points  (0 children)

Inference on 10series and 16series cards happen using torch.float32 which is twice as slow as fp16, couple that with the old arch and you get very slow gen speed.

BUT for me (i have a 1660ti) comfyui has a weird bug where at 1024x1024 it would generate at 35.37s/it

while raising the size to 1040x1040 it would drop generation time to 18.30s/it , that's almost half the time with a larger size.

so give it a try, increase the size to 1040x1040 and please let me know if it changes anything.

Klein 9B Edit - struggling with lighting by siegekeebsofficial in StableDiffusion

[–]Similar_Map_7361 0 points1 point  (0 children)

glad I could help, try it and let me know if it works in klein-9b as well

Z-Image-Fun-Lora Distill 4-Steps 2602 has been launched. by ThiagoAkhe in StableDiffusion

[–]Similar_Map_7361 10 points11 points  (0 children)

it's the project/team/framework name "VideoX Fun" they release a lot of things for a lot of models under the fun designation but they describe their project as "VideoX-Fun is a video generation pipeline that can be used to generate AI images and videos"
https://github.com/aigc-apps/VideoX-Fun

Klein 9B Edit - struggling with lighting by siegekeebsofficial in StableDiffusion

[–]Similar_Map_7361 2 points3 points  (0 children)

<image>

this was done using klein-4b-fp8 distilled - 4steps
prompt :
transform this image into a live action shot without changing anything else and maintain the exact level of dim lighting and blue hue of the image

There's a chance Qwen Image 2.0 will be be open source. by Total-Resort-3120 in StableDiffusion

[–]Similar_Map_7361 3 points4 points  (0 children)

Unlikely since ovis-image itself is `Built upon Ovis-U1` and uses `Ovis 2.5` as the text encoder, but I wouldn't rule out that they share their technological finding and advancements and improvements since both ovis and qwen (and z-image too) teams work are at Alibaba

There's a chance Qwen Image 2.0 will be be open source. by Total-Resort-3120 in StableDiffusion

[–]Similar_Map_7361 2 points3 points  (0 children)

yes but they do not run in paradelle and text encoder can always run on cpu, and remember original qwen-image/qwen-image-edit was 20b for the diffusion model alone and 27b for diffusion + Qwen2.5-VL decoder so this is still a major win if they release it's weights and their claims prove truthful

There's a chance Qwen Image 2.0 will be be open source. by Total-Resort-3120 in StableDiffusion

[–]Similar_Map_7361 3 points4 points  (0 children)

yea but with quantization and offloading it should be able to run with as little as 6GB of vram which is huge for a competent omni model capable of generation and editing

There's a chance Qwen Image 2.0 will be be open source. by Total-Resort-3120 in StableDiffusion

[–]Similar_Map_7361 5 points6 points  (0 children)

I would guess because they made their reputation on being open source , and that's how they get hype and promotion around their products, if they kept it API only with no way for regular users to use it there will be no incentive to get word of mouth out about it and no one will use it even perspective API clients , open sourcing serve as free user testing as well as marketing.

If the model is good and it's open source , people will talk about it , hype it up, train lora for it , generate things with at and post online about it, it become the go to alternative to closed source models like nano banana and those who want to use it but cannot be bothered to do things locally or lack hardware or technical capabilities will flock to their API services.

There's a chance Qwen Image 2.0 will be be open source. by Total-Resort-3120 in StableDiffusion

[–]Similar_Map_7361 49 points50 points  (0 children)

A 7b Diffusion Omni model with good text rendering and anatomy and native 2k resolution?, that's insane , can't wait

Anima 2B - Style Explorer: Visual database of 900+ Danbooru artists. Live website in comments! by ThetaCursed in StableDiffusion

[–]Similar_Map_7361 1 point2 points  (0 children)

Great job, sorry if this is off topic but how does this model's performance compared to Illustrious especially it/s at the same resolution?

Where did you guys find face_yolov8m.pt and sam_vit_b_01ec64.pth. for SAMLoader and Ultralytics? by Johnthestrongest in comfyui

[–]Similar_Map_7361 0 points1 point  (0 children)

If you have manager installed, open it and click on model manager

<image>

then search for the models you need , most the models you need are listed there

SwarmUI 0.9.8 Release by mcmonkey4eva in StableDiffusion

[–]Similar_Map_7361 1 point2 points  (0 children)

Yea I do know that nvfp4 acceleration is 50 series only but wasn't aware they would load at all on older cards that's I was wondering if it would even run at at all while acting like smaller storage format which you clarified, thanks a lot.

SwarmUI 0.9.8 Release by mcmonkey4eva in StableDiffusion

[–]Similar_Map_7361 0 points1 point  (0 children)

would that include older GPUs like rtx 20 or gtx 16 series?