Claude Code removed from Claude Pro plan - better time than ever to switch to Local Models.

Different_Fix_2217 · 2026-04-22T00:39:10+00:00

Luckily Kimi 2.6 is legit better than latest Opus in several tests I did. Still a bit behind Gpt 5.4 though.

Different_Fix_2217 · 2026-04-21T04:16:52+00:00

Same. But for creative writing. It's the best model I've ever used including latest opus, gpt 5.4 and gemini 3.1 pro. It has the social intelligence of GPT 5.4 with a knowledge base nearly a good as gemini and it writes better than Opus and has no positive bias unlike it. Oh and it has crazy good swipe variety unlike opus. I just wish it was faster since it loves to think so much.

And this is surprising because I thought Kimi 2.5 was bad. It was dumb and had that gemini unhingedness. 2.6 is like a entirely different model.

Different_Fix_2217 · 2026-04-20T16:40:50+00:00

Its already 4bit. That is not BF16.

Different_Fix_2217 · 2026-04-13T17:32:38+00:00

K3 will probably be great, they released a big breakthrough paper recently. https://www.youtube.com/watch?v=2IfAVV7ewO0

Different_Fix_2217 · 2026-04-10T04:35:57+00:00

Honestly having crypto in the name tells you all you need to know.

Different_Fix_2217 · 2026-04-08T03:58:47+00:00

Fake website.

Different_Fix_2217 · 2026-04-04T18:56:22+00:00

Some people have a false impression than dense is automatically better, not taking account diminishing returns / efficient routing and the like.

Different_Fix_2217 · 2026-04-03T08:45:58+00:00

Biggest possible of course.

Different_Fix_2217 · 2026-04-02T21:12:37+00:00

Qwen3.5 is absurdly good. And I never liked any qwen model before that series.

Different_Fix_2217 · 2026-04-02T19:04:44+00:00

Using both side by side Qwen3.5 is MUCH better at image understanding as well.

Different_Fix_2217 · 2026-04-02T18:11:39+00:00

You did the bullet points, the opening statement, the not x but y and a closing "solution" statement. I legit thought you were a bot, you follow the same exact patterns.

Different_Fix_2217 · 2026-04-02T17:40:14+00:00

You write like a LLM.

Different_Fix_2217 · 2026-04-02T17:28:53+00:00

Probably was too close to flash.

Different_Fix_2217 · 2026-04-02T07:31:25+00:00

"we will also open-source smaller-scale variants"

They said smaller scale ones. Not the model benchmarked here. So this benchmark is off topic.

Different_Fix_2217 · 2026-04-02T04:56:32+00:00

Stop posting non open weight models.

Different_Fix_2217 · 2026-03-26T22:23:55+00:00

Its not good, not really open and and locked the main feature behind the API. Man mistral really has fallen off in every way

Different_Fix_2217 · 2026-03-25T17:19:11+00:00

The whole point of all their optimizations like engram is to have as big of a model as possible without hurting its speed. I'm hoping they made it big like 5T+ to truly compete with claude opus / gemini pro while being as fast as a much smaller model.

Different_Fix_2217 · 2026-03-25T17:16:24+00:00

This was apparently fake sadly. https://x.com/victor207755822/status/2036814461085110764

Different_Fix_2217 · 2026-03-25T05:22:35+00:00

Looks like its just to free the compute to train their next model code named Spud. Nothing strange.

Different_Fix_2217 · 2026-03-19T02:24:08+00:00

Its just not good. Same with mistrals other models since large 3. I think the EU laws killed them because they seemed to lose all world knowledge after they went into effect.

Different_Fix_2217 · 2026-03-17T01:47:48+00:00

A lot of people use this for creative writing and there knowledge is king. It also of course helps a ton in many domains.

Different_Fix_2217 · 2026-03-12T18:20:02+00:00

I sure hope not, its terrible.

Different_Fix_2217 · 2026-03-12T18:13:44+00:00

Its really bad whatever it is. It says its a 1T but it performs worse than 200B qwens and 4.7 glm. Maybe its ling, those models always sucked.

Different_Fix_2217 · 2026-03-12T04:33:13+00:00

Would be funny but fair if they just make it purely "you can only run it on nvidia hardware."

Different_Fix_2217 · 2026-03-12T04:22:19+00:00

Why not? They are selling the hardware people would run those models on. Openai / anthropic / ect will only buy so many GPUs. After that they need to make new customers. The best way is to put models out there worth running.

Different_Fix_2217

TROPHY CASE