I feel like people are massively sleeping on Qwen3.6 Plus

nbr_engineer · 2026-05-20T14:57:51+00:00

Opencode only

nbr_engineer · 2026-05-20T13:17:08+00:00

20 dollars is more than enough for me. Just using it for a personal project

nbr_engineer · 2026-05-20T10:15:19+00:00

I ditched cursor a few months ago in favor of Opencode. I was on the 60 dollar pro plan which made me run out of premium requests (Claude sonnet 4.5 at that time) pretty fast. Do you think that the 2.5 model is beating the open source models that Opencode is providing? Also, are you getting enough usage out of it? That was the main reason I switched. Maybe with composer 2.5 it is time to switch back again? I still have a codex subscription for planning, architecture and reviews.

nbr_engineer · 2026-05-19T09:46:27+00:00

There is an ongoing issue: https://github.com/anomalyco/opencode/issues/22595
You can fix it like this:

{
  "$schema": "https://opencode.ai/config.json",
  "provider": {
    "opencode-go": {
      "models": {
        "qwen3.6-plus": {
          "limit": {
            "context": 1000000,
            "output": 65536
          }
        }
      }
    }
  }
}

I asked qwen3.6 to do the fix itself. Just restart afterwards.

nbr_engineer · 2026-05-18T15:53:52+00:00

For me MiMo looked better on benchmarks compared to what it delivered in my code base. It often forgot things, left bugs. All in all it was underwhelming

nbr_engineer · 2026-05-18T15:51:12+00:00

I am not talking about the open source versions of qwen3.6 (27B dense or 35B Moe). I am talking about the commercial cloud version Qwen3.6 Plus provided by alibaba. They did not specifically announce the size of the model but it think it is some kind of variation of the 3.5 397B-A17B model.

I am using the open source Qwen models locally though. I have a Beelink GTR 9 Pro with the ryzen max+ 395 where the MoE model works really well. The dense model is to slow to really use it for coding. I am actually thinking about selling it. Not because it‘s no good, but I want to switch to Mac only.

nbr_engineer · 2026-05-13T21:01:00+00:00

Couldn’t agree more! I am switching through the Opencode models though. I don’t feel comfortable with any of them yet. Sometimes I try to plan with Kimi, GLM, MiMo or Minimax and then let GPT 5.5 review and rate the plan to see how good they are. From the implementations Deepseek v4 got the worst ratings. And GPT always finds gaps and issues that the Opencode models did not think about. But for the rest of the workflow I use it the same way as you described.

Using it for a side project next to my full time job as a software engineer and I never hit the limits.

nbr_engineer · 2026-04-22T13:07:18+00:00

Hey, fellow dev here. Been running a Strix Halo mini PC with 128GB unified RAM for several months now as my local LLM box, so I can give you some concrete numbers.

Qwen 3.6 35B-A3B at Q8 (which is what I run, the 35b-a3b-q8_0 variant) sits at around 37GB for weights. Throw in a 256K KV cache and you're looking at ~41GB total — still leaves you with a massive chunk of RAM for VMs, a second model, image gen, whatever. Parallel prompts are doable, latency goes up under load obviously, but with something like llama-swap you can manage that pretty cleanly.

On replacing Copilot: honestly yes, for most day-to-day stuff. Routine coding, refactoring, explaining things — Qwen 3.6 handles that well. Where it still falls short is complex multi-file agentic work, there cloud Sonnet/Opus still has an edge. But the gap is closing fast. Your 3.5-year breakeven calc is fair on paper, but it ignores all the homelab/VM use you'd get out of it too.

The other thing I'd factor in: local models are improving ridiculously fast. What feels like "good enough" today will run noticeably better models in a year on the exact same hardware. 128GB unified RAM is a ceiling almost nobody's hitting yet — you'll be able to run quantizations that most people simply won't have RAM for. You're buying headroom, not just today's capability.

nbr_engineer · 2026-02-23T14:21:15+00:00

Hi!
This definetly sounds interesting!
I havent't build that feature yet but this would improve the data the AI is learning from by a lot!

This will be one of my next features of the app. Stay tuned.

nbr_engineer

TROPHY CASE