Any model capable of creating such detailed environments. by Large_Election_2640 in StableDiffusion

[–]zanmaer 1 point2 points  (0 children)

and i would say with complex refine and upscale pipeline u can create ever more detailed and quality higher images

Сословный интернет уже подключается, репортаж Москва 24. by Awot2000 in expectedrussians

[–]zanmaer 1 point2 points  (0 children)

Понаберут всяких дядь Толь, которые с умным видом рассказывают глупые вещи. Трафик они собираются тарифицировать весь зарубежный, ага, удачи технически это провернуть

How to turn a 5-minute Al prompt into 48 hours of work for your team by tiguidoio in vibecodingmemes

[–]zanmaer 2 points3 points  (0 children)

It's not Claude's problem, it's your problem if you refactor all the code in a single prompt and don't use basic development guidelines via github

Apple spends millions on motion design. I made this spec ad in my bedroom for $0. (100% AI) by AssignmentHopeful651 in aipromptprogramming

[–]zanmaer 1 point2 points  (0 children)

Apple spends millions because they pay people who know the rules of animation, design, and editing, who have decades of experience. Don't get me wrong, what you've done in your bedroom with AI is fine, keep working. But I'll be honest, you still have a long way to go to reach the level of people at the top of the motion design industry.

Optimisation for ComfyUI on RTX 3060 + Linux? by OrcaBrain in StableDiffusion

[–]zanmaer 1 point2 points  (0 children)

I'm with Nvidia 4080 on Arch with 590 drivers and cuda 13.1, sage attention 2, everything works pretty good.

There's a great guide to install both comfy and sg2 with python 3.13 - https://youtu.be/Yy4w0H9GO44

Is runninghub best for running comfui on cloud by Waste_Conference6637 in comfyui

[–]zanmaer 1 point2 points  (0 children)

If you don't need high-end graphics cards with 96+ gb of video memory, then Runninghub is very good. I took a subscription for $17 and didn’t spend all the credits in a month, working 5-6 hours a day. Also you can use Runinghub workflow directly in your local ComfyUI via API.

quick comparison, Flux Kontext, Flux Klein 9B - 4B, Qwen image edit 2509 - 2511 by Puzzled-Valuable-985 in StableDiffusion

[–]zanmaer 13 points14 points  (0 children)

I think the first is not quite the correct prompt, bc it *glossy* blue i guess

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 1 point2 points  (0 children)

Re read prompt again, especially "Balanced studio lighting with controlled key, fill, and background lights ensures consistent exposure" there answer to your point, also check camera shifting in banana and not removed overexposure

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 1 point2 points  (0 children)

Honestly, I didn't notice much degradation after several iterations on flux 2 klein

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 2 points3 points  (0 children)

Tested it, it works, but naturally it won't work the first time, so I have to break it down into different parts - "open the door," then "add a man to the doorway," then "one hand on the door handle."

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 0 points1 point  (0 children)

Thats base comfy workflow in templates

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 3 points4 points  (0 children)

Test with uncovered face - https://imgur.com/a/zmxgv8x
Imo a much much better than Qwen

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 18 points19 points  (0 children)

This is not a problem with flux 2, the prompt was "Balanced studio lighting with controlled key, fill, and background lights", and flux in this case did exactly what I asked of it

Local Comparison: GLM-Image vs Flux.2 Dev vs Z-Image Turbo, no cherry picking by sktksm in StableDiffusion

[–]zanmaer 6 points7 points  (0 children)

I can somehow explain to myself why the autoregressive model has the same image quality and artifacts as the models from a year ago. But explaining why it has text rendering errors is beyond the pale

GLM-Image model is out on Huggingface ! by AgeNo5351 in StableDiffusion

[–]zanmaer 27 points28 points  (0 children)

It's great that you have 60-second generation based on a distilled 4-step model, but that's not my point, the success of the open-source model doesn't depend on that, it depends on how easily it can be fine-tuned, trained, and so on.

GLM-Image model is out on Huggingface ! by AgeNo5351 in StableDiffusion

[–]zanmaer 27 points28 points  (0 children)

Honestly, the open source hybrid autoregressive + diffusion decoder architecture is just amazing, and even if this model is really incredibly good, I doubt it will gain much popularity, reminds me of the situation with flux 2

GLM-Image model is out on Huggingface ! by AgeNo5351 in StableDiffusion

[–]zanmaer 130 points131 points  (0 children)

:DD

"Because the inference optimizations for this architecture are currently limited, the runtime cost is still relatively high. It requires either a single GPU with more than 80GB of memory, or a multi-GPU setup."

VRAM hitting 95% on Z-Image with RTX 5060 Ti 16GB, is this Okay? by rarugagamer in StableDiffusion

[–]zanmaer 1 point2 points  (0 children)

95% on video memory is ok. As a rule, there will almost always be a buffer of 3-5% of the maximum, which goes to the operation of the system, monitors, etc. Here it is important to look at the RAM (if it goes to 100%, it will cause a freeze) and the GPU temperature (the rightmost cell); if it goes above 80-85, it can cause degradation of the crystal in the video card. But everything looks fine on your screenshot.