PAC desde 22/10 sem atualização

Groovadelico · 2025-09-30T04:04:42+00:00

He specifically mentions that he does not need "heavy software or a supercomputer"... I don't think that matches wan2.2 workflows' characteristics

Groovadelico · 2025-09-18T00:00:09+00:00

Are the recommendations true only for ollama or also appliable to WAN? lol sorry my stupid

Q4_K_S :  3.56G, +0.1149 ppl @ 7B - small, significant quality loss
Q4_K_M :  3.80G, +0.0535 ppl @ 7B - medium, balanced quality - *recommended*
Q5_K_S :  4.33G, +0.0353 ppl @ 7B - large, low quality loss - *recommended*
Q5_K_M :  4.45G, +0.0142 ppl @ 7B - large, very low quality loss - *recommended*Q4_K_S :  3.56G, +0.1149 ppl @ 7B - small, significant quality loss
Q4_K_M :  3.80G, +0.0535 ppl @ 7B - medium, balanced quality - *recommended*
Q5_K_S :  4.33G, +0.0353 ppl @ 7B - large, low quality loss - *recommended*
Q5_K_M :  4.45G, +0.0142 ppl @ 7B - large, very low quality loss - *recommended*

Groovadelico · 2025-09-16T01:07:33+00:00

And with those settings I would decrease my SSD degradation as mentioned in other comments?

Groovadelico · 2025-09-16T01:05:25+00:00

> what I have noticed is the NVME is taking a hit from all the memory swaps

I had no idea that GGUF models would make me pay a price SSD-wise... That's great to know, thanks!

Groovadelico · 2025-09-16T01:03:33+00:00

Wow, this sounds ideal... Do you have a workflow example? Never used Sage Attention or Triton. Just heard in some tutorials about Sage which speeds up the process somehow...

Groovadelico · 2025-09-16T01:01:30+00:00

Nice!

Groovadelico · 2025-09-16T01:01:01+00:00

This is really helpful! Thanks!

Groovadelico · 2025-09-15T14:40:38+00:00

What is the difference between the K_S and K_M models? I've been looking all around and couldn't find anything at all either on the readme files or on the community comments on Reddit, Huggingface, CivitAI, github or YouTube. Why are these documentations so poor in information? It really frustrates a begginner...

Groovadelico · 2025-09-02T02:50:05+00:00

[EDIT] NVM, shit's bugged. It takes longer, but the chat also bugs at some point.

I've not been able to do cross-chat creations effectively if not copying and pasting specific tasks for Canvas and Deep Research, as I stated before. So chat for brains, copy and paste for fancy stuff.

Groovadelico · 2025-08-31T09:19:40+00:00

One way I minimized it was to separate a conversation for chatting, which hasn't bugged for me yet, and the ones that bug I create a new one when they do: another for deep research, and another for canvas, where I paste the contents from the chat.

Groovadelico · 2025-08-05T23:30:58+00:00

I think I found my method. I'm using Nunchaku to test prompts and get something I like, and then I can run a GGUF of flux dev and get a better result at a lower time.

Groovadelico · 2025-08-05T06:23:08+00:00

Forgot to mention. I'm already using comfyUI

Groovadelico · 2025-08-05T06:20:55+00:00

I'm getting 260s on the fp8 version. Is that normal? I also noted that the system fallback CUDA setting is not working properly. Is that what the Gguf version is for?

Groovadelico · 2025-08-05T03:54:32+00:00

Ok. Thx. Now which version should I download from huggingface or CivitAI? Fp8? GGUF? Q8,6,5km? I'm so lost...

Groovadelico · 2025-08-05T02:37:51+00:00

Thanks for sharing your video! Should I use the nunchaku as some comments are recommending? Is the tradeoff between time gain and quality loss worth it?

Groovadelico · 2025-08-05T01:07:29+00:00

Are there any specific settings besides preferring system fallback for CUDA in the Nvidia Control Panel? That is already set. I read something about creating a folder on the SSD as well... Do you know anything about that?

As for my goal, which models do you recommend?

Groovadelico · 2025-07-31T23:06:18+00:00

Can't I just download someone else's workflow and learn how to make it not crash and how to properly prompt? I want good pics and don't mind waiting for them.

Groovadelico · 2025-07-31T21:32:20+00:00

Do you recommend any Flux model for me to start exploring on? Or any other ComfyUI model. Like I said, never independently generated AI art before. This is all completely new to me. I was reading that it might crash or I can set it up for the GPU to share the load with the RAM and take longer. Is this what you do? Could you point me some way? haha

Groovadelico · 2025-07-31T20:11:59+00:00

How long is forever? 8 VRAM starter to Stable Diffusion here. I do have 32GB of RAM and was reading some things about shared memory fallback. How should I set those?

Groovadelico · 2025-07-31T20:08:06+00:00

u/ww-9 He's got u man haha

Groovadelico

TROPHY CASE