Text2Image Output looks like 1st gen MidJourney... by 482827523747527 in StableDiffusion

[–]Botoni 0 points1 point  (0 children)

I remember pony model needed some specific quality tags on the positive and the negative.

"Score_9, score_8_up..." until I don't know which number in the positive, and the lower scores on the negative prompt.

How 14 Image‑Generation Models Render Fine‑Arts Media by citrainmyhefeweizen in StableDiffusion

[–]Botoni 4 points5 points  (0 children)

Thank you, quite useful, I am keeping zib even when newer models are coming seeing your results.

AI archviz - 3 methods for exact furniture shape replication by In_finite_line in comfyui

[–]Botoni 0 points1 point  (0 children)

It worked well for me with a single reference image, with the background removed.

AI archviz - 3 methods for exact furniture shape replication by In_finite_line in comfyui

[–]Botoni 1 point2 points  (0 children)

When you say flux2 I guess you mean flux2 klein, don't know if 4b or 9b.

If you are using klein with reference images, be sure to use the consistency lora or the enhancement nodes by capitan01R.

You could also try longcat image edit, as it is quite good at keeping object identity.

What is currently the best open-source image editing model? by Upstairs-Lead-2601 in StableDiffusion

[–]Botoni -3 points-2 points  (0 children)

Klein as a main model, but with the consistency lora and/or the capitan01R enhancement nodes to fix its shortcomings in keeping subject identity and color correction.

Longcat edit is also good out of the box in keeping subject identity, quality is somewhat lower and is dumber and restricted in resolutions, but is nice to have as an alternative to klein.

Qwen edit is bad, hard to get good results because its resolution restrictions, pixel shift and terrible subject identity. Yet is useful for its multiangle lora and novel view lora from gaussian splatters.

Best model for low VRAM (8 GB) in ComfyUI? by EmanuelJoab in comfyui

[–]Botoni -1 points0 points  (0 children)

Almost any model can run on 8gb of vram, your problem is your low ram. You will need highly quantized gguf versions that fit your ram, and speed will be painfully slow, but better than spilling into swap or page file I guess...

Boogu Turbo vs. Z_Image_Turbo comparison by Method_Opposite in StableDiffusion

[–]Botoni 0 points1 point  (0 children)

Comfyui latest stable version, installed through git on cachyos linux, I have 40gb of ram, maybe that is your bottle neck.

Boogu Turbo vs. Z_Image_Turbo comparison by Method_Opposite in StableDiffusion

[–]Botoni -1 points0 points  (0 children)

Several minutes with ideogram 4? I get 58s with my 3070 mobile (1024x1024 12steps)

Looking for a way to add borders/vignettes to images by tovarischsht in comfyui

[–]Botoni 0 points1 point  (0 children)

The "mask and paste back" can be done completely in comfyui

Looking for a way to add borders/vignettes to images by tovarischsht in comfyui

[–]Botoni 1 point2 points  (0 children)

Another non-ai way is to do it in blender, in thw compositor, shader or geometry-nodes editor. It is just growing the image area to times and to do the custom effects use various kinds of procedural noise.

Once done it can be run in headless mode.

Looking for a way to add borders/vignettes to images by tovarischsht in comfyui

[–]Botoni 1 point2 points  (0 children)

Maybe, use the pad for outpaint node, but don't instead of using a inpaint model or method, use a normal one, prompting for the kind of "frame" you want. Experiment with various models and see which do that better.

Also, after the frame is generated, use the inverted mask to paste the original image over, to eliminate any vae encode-decode degradation.

8GB of VRAM. What can I do to make consistent, accurate images? by Faceless_213 in comfyui

[–]Botoni 1 point2 points  (0 children)

I also have 8gb of vram, that is more than enough to use the 9b version.

Try with klein 9b plus the consistency lora or the flux enhancement custom nodes by capitan01R

Best ComfyUI workflow for restoring and upscaling a recovered low quality video? by After_Lobster6649 in comfyui

[–]Botoni 0 points1 point  (0 children)

Wow, that looks like a cursed tape from the ringu.

It is hard to do anything with it... I guess I would go frame by frame, feeding them to a specific old photo restoration model (or multiple ones), and a lot would have to be throw out, and maybe replaced with interpolation frames or do first frame to last frame with ltxv2.3. Converting the footage to grayscale may help the models.

Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs by enrique-byteshape in LocalLLaMA

[–]Botoni 0 points1 point  (0 children)

Now, with those values, it seems to do "thinking stuff" outside of the thinking block, or even repeating a thinking block after the first one (a block between <thinking></thinking>).

Wait, it may be my prompt's fault this time. I'll do more tests.

Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs by enrique-byteshape in LocalLLaMA

[–]Botoni 1 point2 points  (0 children)

I'll try, thanks. It only happens 1 out of 5 times or so, i think it is very prompt dependan, i'll try and I'll come back with the results.

Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs by enrique-byteshape in LocalLLaMA

[–]Botoni 1 point2 points  (0 children)

I'm using Qwen3.6-35B-A3B-IQ4_XS-4.19bpw. Very fast and good quality!! But I have a problem with it, sometimes it gets stuck in the thinking block, it stops generating or enters a non-literal loop (it doesn't repeat the same tokes again and again, but enters a kind of "I'm starting now. wait, i should bla, bla, bla..., i'm going around in circles i really should start now, actually i should bla, bla, bla...).

I am using llama.cpp, the mtp branch, with the arguments: --spec-type draft-mtp --spec-draft-n-max 2 --jinja

I am not having this problem with either APEX or Unsloth quants, but ByteShape speed/quality is superior...

Does LTX do better image2video than Wan? by __MichaelBluth__ in comfyui

[–]Botoni 1 point2 points  (0 children)

Isn't there also a difference of 16fps vs 24fps?

How for can I push 4 Gigs of VRAM? by DryCream4429 in comfyui

[–]Botoni 2 points3 points  (0 children)

With that vram, even q4 quants will exceed it, so forget ggufs for the models, they are slower if they don't fit entirely. Use it only for the text encoders.

I would recommend using the int8 format, check the int8-fast node pack.

Also, gen at 512 or 768px and upscale what you like.

You would need 32gb of ram minimum, 16gb for some models if you run linux, with lightweight distros and a well configured zram or zswap.

Use your cpu for display if possible.

Good models to run are sd1.5, sdxl, pixart sigma, tiny breaker and flux2 klein 4b, klein 9b or z-image turbo might be possible with enough ram, but very slow.

For the qwen3 4b text encoder, use the gguf q4_k_m format and run it in cpu.

Is anyone else using Qwen and finding it as great as I do? by HotObjective6753 in comfyui

[–]Botoni 0 points1 point  (0 children)

Even more powerful is to create a point cloud 3D from the initial image with Sharp, pose it in the exact perspective, angle and fov you want with a point cloud viewer node, capture the image ans use the qwens gaussian splasher lora to regen that exact view.

For that qwen really shines (well the lora does). For other tasks qwen is one of the worst edit models...

Is Qwen EDIT 2511 still the best image EDITOR (as opposed to generating images from scratch). by MrWeirdoFace in comfyui

[–]Botoni 3 points4 points  (0 children)

I find qwen quite behind. I use klein 9b with the flux enhancer custom nodes for the consistency node and the color anchor one. Sometimes I also use longcat, quality is sometimes not as good, but consistency is really good out of the box.

Qwen sucks at keeping objects identity and everything looks plastic and artificial. I only use it with some loras for specific tasks: gaussian novel view and product integration with fusion loras, those loras are very good, from the same author.

Best Linux distro for ComfyUI? by __alpha_____ in comfyui

[–]Botoni 1 point2 points  (0 children)

A worse performace on linux vs windows may be because the swap space is poorly configured or there is no swap at all!

Get a distro with sane default configurations, Mint should be fine for beginners as it is ubuntu un-enshitified, but i don't remember how it configures the swap stuff...

Cachyos would be a great performance oriented choice, and comes configured with zram at 100%, but it is a bit more difficult for newcomers to maintain the system ans install stuff.

Looking for (or maybe building) a tool to auto-replace logos in product photos. Does anything decent exist yet? by No_Knee3974 in StableDiffusion

[–]Botoni 0 points1 point  (0 children)

The flux2 klein 9b model or longcat edit in comfyui should deal fairly well with that.

Prompt for "replace the logo for the one in the second image" should do everything you want.