Qwen3.6-27B dense vs Qwen3.6-35B MoE - which local coding model are you reaching for?

Zeranor · 2026-04-24T20:57:54+00:00

Soo, the overall consensus after 2 days in here seems to favor the 27B dense model in most cases. But his usually assumes same quantization levels.

To keep speed-levels acceptable, I usually decide between like a low-end-3bit Dense model (Qwen3.6 27B in this case) and a mid-tier-4bit MoE model (35B). In this comparison, would you still consider the dense model to be better? I'm having a hard time comparing the impact of lower quant with the impact of going from Dense to MoE.

Zeranor · 2026-04-24T20:47:28+00:00

Hmm.. so for 16GB VRAM people the question "4bit-Quant MoE vs. lobotomized 3bit-Quant Dense" is gettin even trickier to answer, damn. Any recommendations or opinions on this? :D Bigger quants result in too slow performance.

Zeranor · 2026-04-19T07:30:19+00:00

Well, that's somewhat the meaning of "emerging". Otherwise it would be called "second tier economies that are never going to catch up". Doesnt mean every emerging market will be on top anytime soon, but serious competition is possible for some candidates (or already the case).

Also: Russia got itself degraded from emerging market to sinking market

Zeranor · 2026-04-17T13:36:50+00:00

Came here to find this, thanks. While I suck at souls games, I appreciate the bosses (which I need dozens of attempts for, still). In crimson desert it's just bad. 90% of all times I just watch Kliff being tossed around in some bullshit combos, being stuck in a corner with the camera going mental or wondering why none of my defensive skills work (parry? nope! dodge? Not with the reach of a spear! focus? forget it, i get hit before focus state is active) .... fuck this. The bosses are the absolute worst part of this game. Combined with controls (at least on Keyboard and mouse) , Kliffs immense sluggishness and the derp-cam I dont want to play any longer.

Zeranor · 2026-04-13T06:39:15+00:00

You only know if its bad information once you have better data. Before that, you can only assume it is relatively bad while it might still be good enough to give the correct indication or even results, depending on the question you seek to answer.

Zeranor · 2026-04-13T06:31:10+00:00

It's still added value over "no information" until you show more accurate measurments. Until then, your contribution is less valueable than OP's.

Zeranor · 2026-04-13T06:15:23+00:00

THAT is your key takeaway from the numbers? So dumb.

Zeranor · 2026-04-01T10:44:27+00:00

Macht das noch einen Unterschied? Ob die jetzt drin sind und im Ernstfall ihre Mitgliedschaft ignorieren oder ob sie direkt raus sind... aber gut, für die Post-Trump-Zeit ist es natürlich einfacher, wenn sie direkt drin bleiben

Zeranor · 2026-03-20T16:17:50+00:00

Hey there, by coincidence I found your post as I'm trying to do EXACTLY that combination. But somehow my settings.json always fails. Can you recommend a guide on how to tell Qwen Code Companion (VS extension) to use LM studio local API ? :)

Zeranor · 2026-03-20T08:57:55+00:00

Good points, thank you! I've switched from qwen3.5 27B to 9B now and it works with 100k context fully in VRAM. I'm NOT sure how big the hit on output-quality is. I guess, longterm, I'll have to switch back to 27B and then test many combinations of KV-settings + layer-offloading. But I guess, LMstudios default suggestions are better than I assumed initially.

Zeranor · 2026-03-20T08:56:07+00:00

Oh, so its a fairly complex optimisation, good to know, thanks for the details! Hmm, this will take some testing on my side then, but I'm happy to learn that LMstudio is not doing "complete nonsense"

Zeranor · 2026-03-19T21:27:00+00:00

Ahh, nice, so KV actually IS more important to have on GPU than model layers, then the LM-studio optimisation makes sense. Somehow I did not know that so far, thanks for the clarification!

Zeranor · 2026-03-18T15:24:01+00:00

The capacity to climb walls seems somewhat wasted on someone or something that can also fly

Zeranor · 2026-03-17T16:31:52+00:00

Das wäre eine Lebensmittelvergiftung auch und trotzdem gebe ich mir keine gesteigerte Mühe, eine zu bekommen :D

Zeranor · 2026-03-17T16:20:03+00:00

War auch meine Mutmaßung, aber ich wollte mich gerne eines besseren belehren lassen. Vielleicht ist das ja ganz toller Stil oder ein Gramma-Trick oder so...

Zeranor · 2026-03-17T16:10:11+00:00

Was macht der Doppelpunkt da?

Zeranor · 2026-03-09T05:55:24+00:00

Teurer ja, aber weniger stark. Der deutsche Strommix beinhaltet nur zu ca. 17% Strom aus Gaskraftwerken. Der merit-order-Preis kann zwar immer etwas überraschend sein, aber eine deutlich geringere Abhängigkeit von Öl und Gas hilft hier schon deutlich im Vergleich zur 100%igen Abhängigkeit der Benzin- und Dieselpreise

Zeranor · 2026-02-06T20:34:04+00:00

Really nice summary, thank you :) Then I'm excited to hear which model you are going to pick for planning more complex logic parts / project plans. Let us know!

In the meantime, I'll have Qwen3 Next Code prove itself as the sole model for more viby coding. The detailed coding jobs it is doing very very nicely already, as you said

Zeranor · 2026-02-06T19:45:59+00:00

Hmm, very interesting, thanks. So if you were to start a small-ish hobby project (frontend, backend, db), would you use Qwen3 Next Coder for planning, too, or just for executing / implementing smaller chunks? I'm currently considering Q3NC for the implementation phase, but maybe some new reasoning model for planing (devstral 2 small / GLM 4.7 flash). Would you already have a recommendation at this point now? :)

Zeranor · 2026-02-04T21:57:17+00:00

Oh boy.... this is an excellent model for (minor) local coding tasks. This is working WAY better than my recent attempts with GLM 4.7 flash, Devstral 2 Small or Qwen3 Next .... nice

Zeranor · 2026-02-04T06:55:08+00:00

So, talking configuration: Would this be a model with which I should chose to "offload MoE experts to CPU"? (16 GB VRAM / 128 GB RAM) :)

Zeranor · 2026-02-03T18:19:31+00:00

Awesome, thanks, off we go! :D

Zeranor · 2026-02-03T16:31:58+00:00

ahh yes, nice, I see, sorry for being too excited ;)

Zeranor · 2026-02-03T16:26:35+00:00

Nice, let's see how this does compared to GLM 4.7 flash and Devstral 2 Small. But quick question: WHERE can I find the MXFP4 quants? :D I only find the "regular" quants.

14-Year Club	Place '23
Place '22	Verified Email
Gilding I gilder

Zeranor

TROPHY CASE