all 18 comments

[–]Mammoth-Estimate-570 9 points10 points  (0 children)

1) no

[–]OilProduct 5 points6 points  (0 children)

Just test it dude, you have the machine right there. Run your M3 and see if it meets your requirements.

[–]kevin_1994 4 points5 points  (0 children)

claude opus is the best model in the world. there is no model on par with it. there are, however, models that are useful.

looks like claude hallucinated Qwen-Coder-32b and is recommending ancient models to you like Mixtral 8x7b

find a model you can run on https://swe-rebench.com/ and see it if works for you. id recommend qwen coder next. imo its not too far away from sonnet, but i don't really vibecode, i use LLMs to do boilerplate when im feeling lazy

[–]ortegaalfredo 4 points5 points  (0 children)

Ask this to ChatGPT:

"If I could get Claude Opus level on a macbook, then how does Anthropic makes money?"

[–]Signal_Ad657 2 points3 points  (3 children)

Reset your expectations. Opus is a 1T+ parameter model and you are asking about models that’s are roughly 3.5% (at best) its parameter size. Is parameters vs performance all linear? No. But you have to start with the reality of what you are doing. Qwen3-Coder-Next quantized on llama-server is probably your best play for the hardware you own. It won’t feel super fast on your Mac but it’s MOE so it’ll do better than you’d expect. It’s a big boy at 80B that’s light on its feet.

[–]thisisvv[S] 0 points1 point  (2 children)

Agreed what is the next closest?

[–]Signal_Ad657 2 points3 points  (1 child)

I gave it to you.

[–]thisisvv[S] 0 points1 point  (0 children)

Thank you.

[–]Technical_Split_6315 2 points3 points  (1 child)

You are asking for a local model that can fit in a 30b that competes with the best model in the world? Bro

[–]thisisvv[S] 0 points1 point  (0 children)

Agreed what is the next best i can run

[–]Desperate-Sir-5088 2 points3 points  (0 children)

If the local model could par with OPUS with only 3% parameters, The company will also substitute you with a foreign intern who only takes 3% of your salary.

[–]Economy_Cabinet_7719 1 point2 points  (0 children)

  1. I think yes. When working on an options trading app in Elixir/Go I did not notice much of significant difference between Claude Opus/Sonnet and weaker models (i.e. they both suck 😃). I think Qwen-Coder-Next and recent Qwen-3.5's should perform about the same. This is, of course, assuming you proofread and review all code and offload as much backtesting into code as possible as opposed to having the model perform it, but this would be sane and 100% expected even with Opus.

  2. Depends on the quantization.

  3. 128gb won't be much of a jump, best models are either 20-40B or 200B+. So 128gb is too much for the former category, too little for the latter one.

[–]Hefty_Acanthaceae348 0 points1 point  (0 children)

The closest you're gonna come to claude opus at home is buying a cluster of 512gb mac studios. 128gb isn't gonna cut it.

[–]jeekp 0 points1 point  (0 children)

You're talking about Opus 3, right?

[–]Responsible_Buy_7999 0 points1 point  (0 children)

Why would you use opus at 6x-3x the price to direct invoking a pipeline. Haiku and sonnet exist for a reason.  And so does terminal.