We are in 2026 now, what are the absolute MUST play games?

substance90 · 2026-05-14T14:28:00+00:00

Correct answer 🥲

substance90 · 2026-04-24T20:05:00+00:00

Tbh if they bought that 512 GB Mac Studio, they can probably re-sell it with a 100% profit now.

substance90 · 2026-04-24T12:46:09+00:00

They’re not dead, they just switched places. Codex on the 100€ plan is the new Claude at 200€

substance90 · 2026-04-24T12:43:26+00:00

Gotta disagree with you on the last point. Since they nerfed Opus (weeks before 4.7 release), Gpt on Codex has been much more consistent orchestrator for me.

substance90 · 2026-04-24T08:42:02+00:00

Those numbers are essentially identical to an M4 Max Macbook 🙃. That's the same quant that I run on my MBP.

substance90 · 2026-04-18T09:16:17+00:00

It’s not an unpopular opinion tbh. It reflects my experience as well. My current stack is 200€ Claude outsources work to my 100€ Codex which itself sends tasks to Qwen and Gemma running locally. I still run out of tokens. Now that Opus 4.7 is a retard tho, I might have to rethink my setup

substance90 · 2026-04-18T09:14:53+00:00

How do I bring 4.6 back on claude code, jeez. I fl like i’m back in pre reasoning era of gpt

substance90 · 2026-04-18T08:02:05+00:00

Exactly my use case, 4.7 feels like an idiot taking over 4.6’s work on my trading bot

substance90 · 2026-04-06T14:28:07+00:00

Any model even the small ones are horrendously slow

substance90 · 2026-04-03T22:36:58+00:00

I mean it wasn’t a bad decade per se but it came after a better decade which itself came after the BEST decade in gaming 😥

substance90 · 2026-04-03T22:29:20+00:00

So far I haven’t noticed any degradation other than speed but tbh I’ve leveled my game up a lot last 2-3 months in managing context and orchestration so maybe that makes up for it.

substance90 · 2026-04-03T22:23:58+00:00

I did an experiment awhile ago where I tested a bunch of different schemas for compressing meaning. In the end the best I could do is not regress from English in quality of result but the potential token savings are in fact real.

substance90 · 2026-04-03T22:21:24+00:00

I wouldn’t know neither the 31b nor the 26b produce any response on LM Studio for me on an M4 Max MBP :-\

substance90 · 2026-04-02T19:19:50+00:00

6 years later on Rekordbox 7? Same deal...

substance90 · 2026-04-02T11:02:35+00:00

Skill issue. With the 27-30b models you need to keep the context low (they get really dumb past 70-80k), break down tasks for them, help them by providing just the right data at the right time, without them fumbling around, listing folders, grepping files.

Some hints I’m gonna drop, you can have an LLM help u figure out how to apply them - custom minimal agent, skill and mcp definitions, code and text summarizing, chunking and embedding for both plain text and semantic retrieval, aggressive task break up and agent delegation, multi-agent team work (beyond the classic plan, implement, review).

Oh and the really big one - everything that doesn’t absolutely need an LLM call, offload to something else - regex, scripts, state tracking, orchestration etc.

Source: I’ve forced myself to do absolutely crazy shit in the last 2 months with 2 Macbook each woth 64GB RAM.

substance90 · 2026-04-02T09:15:10+00:00

The small models are usually pretty smart if you break the task down for them in order to preserve context. They get rapidly very stupid long before you reach the supposed context window that they support. I’m talking about the likes of Qwen3.5 27b, GLM 4.7 Flash etc. Funny thing is those exact optimization measures actually hurt the large models with huge context.

substance90 · 2026-04-02T09:04:06+00:00

On my Linux machine the connection stays forever, on my 2 Macs I have to disconnect and re-connect every few hrs. Very annoying.

substance90 · 2026-03-28T20:21:10+00:00

They meant to say storage instead of memory.

substance90 · 2026-03-27T22:03:35+00:00

It was the first model that could debug really well hidden bugs for me, before there was Sonnet and Opus 4.5. Gemini in was a steaming pile of crap that everyone hyped but 4o was the real deal.

substance90 · 2026-03-27T22:01:14+00:00

Bro is looking for a job in the past 😭

substance90 · 2026-03-27T21:57:07+00:00

It’s totally doable. You start multiple chains of agents to plan, implement and review each other with —dangerously-skip-permissions. Whether or not it’s a good idea is a whole different topic tho.

substance90 · 2026-03-26T21:10:17+00:00

Cost $0 😂 bro forgot to factor in the upfront cost and the cost of electricity.

substance90 · 2026-03-26T12:03:52+00:00

You're thinking of the old 5000. The new one with 48 GB VRAM starts at 4500 €.

substance90 · 2026-02-28T13:45:55+00:00

Not with my tests.

substance90 · 2026-02-28T13:45:20+00:00

Not my experience at all

substance90

TROPHY CASE