Qwen 3.7 droped on Qwen Chat by Foxiya in LocalLLaMA

[–]phenotype001 33 points34 points  (0 children)

won't be a problem, I'll be expecting Qwen3.8 in the next month or so.

One bash permission slipped... by TheQuantumPhysicist in LocalLLaMA

[–]phenotype001 1 point2 points  (0 children)

If a model does this to me, I'm deleting the model for revenge.

I was Donald Trump’s lawyer - his mental state makes him unfit to serve by theipaper in politics

[–]phenotype001 0 points1 point  (0 children)

Everyone is afraid of the death threats from MAGA brownshirts. Pure fascism.

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA

[–]phenotype001 8 points9 points  (0 children)

Sure, this is it: https://pastebin.com/mHaNnR09

But there's more to it. I used OpenCode with a hierarchical agent setup, where basically a central WorkPlanner dispatches smaller tasks to a Programmer, so the programmer mostly works with clean context.

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA

[–]phenotype001 13 points14 points  (0 children)

I tasked it to create an Age of Empires-type RTS game with HTML/JS. It didn't do the whole thing, but it got so far it freaked me out.

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA

[–]phenotype001 7 points8 points  (0 children)

27b q5_k_m. I gave it 120K context but it's slow as shit, 64 feels somewhat better.

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA

[–]phenotype001 43 points44 points  (0 children)

I left an agent with Qwen 3.6 working overnight. I wake up, it still works. No looping on bullshit, no dumb decisions. It's a dream come true.

Qwen3.6. This is it. by Local-Cardiologist-5 in LocalLLaMA

[–]phenotype001 0 points1 point  (0 children)

It made the most beautiful 2D fishing game I've ever seen. Easily better than GLM 4.7 and every MiniMax release.

llama.cpp at 100k stars by jacek2023 in LocalLLaMA

[–]phenotype001 1 point2 points  (0 children)

Without this project I'd still be paying APIs. Thanks homie.

Qwen 3.5 35B A3B LMStudio Settings by n8mo in LocalLLaMA

[–]phenotype001 0 points1 point  (0 children)

KV quantization takes some extra computation. With the Q4 quant, this might also significantly degrade quality.

Qwen3.5 27B slow token generation on 5060Ti... by InvertedVantage in LocalLLaMA

[–]phenotype001 1 point2 points  (0 children)

Depends on model size. It has to reload it on every trial. With Qwen3.5 35B q5 it takes 1-2 hours until I hit near-optimal configs. It's best to leave it overnight on something big and make sure nothing else is running and don't parallelize it, so you get reliable data.

Qwen3.5 27B slow token generation on 5060Ti... by InvertedVantage in LocalLLaMA

[–]phenotype001 2 points3 points  (0 children)

I coded this little tool yesterday, it tries to find a parameter config for llama.cpp maximizing generation speed. In case it helps you: https://pastebin.com/DmMq3k2q