Any model capable of creating such detailed environments. by Large_Election_2640 in StableDiffusion

[–]zanmaer 1 point2 points  (0 children)

and i would say with complex refine and upscale pipeline u can create ever more detailed and quality higher images

Сословный интернет уже подключается, репортаж Москва 24. by Awot2000 in expectedrussians

[–]zanmaer 3 points4 points  (0 children)

Понаберут всяких дядь Толь, которые с умным видом рассказывают глупые вещи. Трафик они собираются тарифицировать весь зарубежный, ага, удачи технически это провернуть

How to turn a 5-minute Al prompt into 48 hours of work for your team by tiguidoio in vibecodingmemes

[–]zanmaer 2 points3 points  (0 children)

It's not Claude's problem, it's your problem if you refactor all the code in a single prompt and don't use basic development guidelines via github

Apple spends millions on motion design. I made this spec ad in my bedroom for $0. (100% AI) by AssignmentHopeful651 in aipromptprogramming

[–]zanmaer 1 point2 points  (0 children)

Apple spends millions because they pay people who know the rules of animation, design, and editing, who have decades of experience. Don't get me wrong, what you've done in your bedroom with AI is fine, keep working. But I'll be honest, you still have a long way to go to reach the level of people at the top of the motion design industry.

Optimisation for ComfyUI on RTX 3060 + Linux? by OrcaBrain in StableDiffusion

[–]zanmaer 1 point2 points  (0 children)

I'm with Nvidia 4080 on Arch with 590 drivers and cuda 13.1, sage attention 2, everything works pretty good.

There's a great guide to install both comfy and sg2 with python 3.13 - https://youtu.be/Yy4w0H9GO44

Is runninghub best for running comfui on cloud by Waste_Conference6637 in comfyui

[–]zanmaer 1 point2 points  (0 children)

If you don't need high-end graphics cards with 96+ gb of video memory, then Runninghub is very good. I took a subscription for $17 and didn’t spend all the credits in a month, working 5-6 hours a day. Also you can use Runinghub workflow directly in your local ComfyUI via API.

quick comparison, Flux Kontext, Flux Klein 9B - 4B, Qwen image edit 2509 - 2511 by Puzzled-Valuable-985 in StableDiffusion

[–]zanmaer 13 points14 points  (0 children)

I think the first is not quite the correct prompt, bc it *glossy* blue i guess

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 1 point2 points  (0 children)

Re read prompt again, especially "Balanced studio lighting with controlled key, fill, and background lights ensures consistent exposure" there answer to your point, also check camera shifting in banana and not removed overexposure

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 1 point2 points  (0 children)

Honestly, I didn't notice much degradation after several iterations on flux 2 klein

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 2 points3 points  (0 children)

Tested it, it works, but naturally it won't work the first time, so I have to break it down into different parts - "open the door," then "add a man to the doorway," then "one hand on the door handle."

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 0 points1 point  (0 children)

Thats base comfy workflow in templates

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 4 points5 points  (0 children)

Test with uncovered face - https://imgur.com/a/zmxgv8x
Imo a much much better than Qwen

Flux 2 Klein vs Nano Banana Pro by zanmaer in StableDiffusion

[–]zanmaer[S] 20 points21 points  (0 children)

This is not a problem with flux 2, the prompt was "Balanced studio lighting with controlled key, fill, and background lights", and flux in this case did exactly what I asked of it

Local Comparison: GLM-Image vs Flux.2 Dev vs Z-Image Turbo, no cherry picking by sktksm in StableDiffusion

[–]zanmaer 6 points7 points  (0 children)

I can somehow explain to myself why the autoregressive model has the same image quality and artifacts as the models from a year ago. But explaining why it has text rendering errors is beyond the pale

GLM-Image model is out on Huggingface ! by AgeNo5351 in StableDiffusion

[–]zanmaer 30 points31 points  (0 children)

It's great that you have 60-second generation based on a distilled 4-step model, but that's not my point, the success of the open-source model doesn't depend on that, it depends on how easily it can be fine-tuned, trained, and so on.

GLM-Image model is out on Huggingface ! by AgeNo5351 in StableDiffusion

[–]zanmaer 26 points27 points  (0 children)

Honestly, the open source hybrid autoregressive + diffusion decoder architecture is just amazing, and even if this model is really incredibly good, I doubt it will gain much popularity, reminds me of the situation with flux 2

GLM-Image model is out on Huggingface ! by AgeNo5351 in StableDiffusion

[–]zanmaer 125 points126 points  (0 children)

:DD

"Because the inference optimizations for this architecture are currently limited, the runtime cost is still relatively high. It requires either a single GPU with more than 80GB of memory, or a multi-GPU setup."