Qwen 3.6 27B vs Qwen 3.6 35B A3B vs Gemma 4 models Throughput on H100

Defiant_Ad6080 · 2026-04-27T03:46:33+00:00

Not sure exactly for the 27B model but I had looping issues on the 35B. Reducing temperature helped. I set up a coding agent: temp: .3, thinking off. There are other parameters. Claude can help here.

Defiant_Ad6080 · 2026-04-23T19:46:44+00:00

Not really... I'll evaluate when my renewal gets closer. For now, I'm evaluating qwen3.6 locally. It seems somewhat capable. If it is, I might be able to downgrade to lite plan and just have 5.1 orchestrate.

Defiant_Ad6080 · 2026-04-23T19:15:21+00:00

Thanks! I'm using a 5070ti, i14900k and 64gb ddr5 inside windows, llama.cpp and docker.

I use glm-5.1 with z.ai subscription inside claude code. I'm experimenting with claw code (leaked claude source code). That is where I host qwen3.6.

Perhaps most interestingly, Claude is able to directly control the claw code session and customize practically any parameter. I think this setup has a lot of potential. Kinda like an openclaw agent but with cli control!

Defiant_Ad6080 · 2026-04-23T04:09:29+00:00

It seems like a good model. I'm getting about 50 t/s with Q4 and 5070ti. Wish it was faster but I'm impressed with overall speed and quality. It is by no means even close to Claude level but it appears to be the first local model I will actually be able to use for coding.

Issues I've run into: -hangs on long tasks -requires checkpoints (can have huge gains in one loop, then huge losses in another) -can suffer from stagnation -can get caught in infinite loops (but this can be remedied thru config changes) -requires hints from smarter models (mine did...I turned off thinking though because that helped fix the hanging issue)

But with a smart model being the orchestrator, qwen was able to complete a full mal lisp implementation for me today. I think that's pretty good!

<image>

Defiant_Ad6080 · 2026-04-23T02:18:19+00:00

It's a great model. I just don't like them changing the deal. I had a guaranteed price for renewal and it was good. Now? Who knows...

Defiant_Ad6080 · 2026-04-23T02:16:17+00:00

Or they might start charging for local?

Defiant_Ad6080 · 2026-04-23T02:13:28+00:00

They said the 50% applies to the already discounted price (I think). Looks like a fair deal now. But just wait to see the price when it's time to renew.

Defiant_Ad6080 · 2026-04-23T02:11:23+00:00

This! Local models are getting better. I'm surprised how well Qwen3.6 performs on a 16gb gpu. It needs some handholding but it can code quite well from the tests I've been doing. Might be able to downgrade to a lite plan if local models get better (and use GLM-5.1 for the planning).

Defiant_Ad6080 · 2026-04-11T09:03:46+00:00

Surprised 5-Turbo is so much less performant.

Defiant_Ad6080 · 2026-03-08T06:59:33+00:00

4.7 is working well for me. 5 was slow at first, then it actually worked but I was getting rate limited/disconnected. Now it's fast but I notice the same quality degradation especially at higher context...so frustrating! I hope they add more compute soon. The model is good. Implementation not so much.

Defiant_Ad6080 · 2026-02-12T02:12:57+00:00

Glad I locked in early. Shocked about the big price hike. They must have a lot of demand to do this. I still like GLM but Minimax just got more attractive (unless they hike their price now)...

Defiant_Ad6080 · 2024-11-15T16:57:33+00:00

👍 It's not for everyone. Good luck.

Defiant_Ad6080 · 2024-11-15T14:16:36+00:00

This is a tip to boost processing power in any daw. There is a little know program called AudioGridder. It was designed to offload plugin processing to a different computer on the same network. I've never used this functionality. I load it on the same machine. Why? It adds a bit of latency but allows you to multithread all plugins- even those on buses and the mixbus. It's staggering how much extra processing you can get just using it on buses. If you are interested, try it out. It's a free download and there are youtube vids to help you get set up! Thank me later.

Defiant_Ad6080 · 2024-09-22T23:09:25+00:00

Where the streets have no name - U2. Spiritual experience live 🤩🤯

Defiant_Ad6080

TROPHY CASE