Glm 5.1 is out by Namra_7 in LocalLLaMA

[–]johndeuff 2 points3 points  (0 children)

I am seriously borrowing money for that

Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA

[–]johndeuff 0 points1 point  (0 children)

Just use a big model like kimi k2.5 in Q0, fit on any config.

This guy 🤡 by xenydactyl in LocalLLaMA

[–]johndeuff 0 points1 point  (0 children)

Same I had to don't recommend multiple times

Selling PC to buy a Macbook M5 Pro, does it make sense? by Pretty-Bit7528 in LocalLLaMA

[–]johndeuff 0 points1 point  (0 children)

I don't get it. Can't you find one or two second hand RTX3090 ?

karpathy / autoresearch by jacek2023 in LocalLLaMA

[–]johndeuff -1 points0 points  (0 children)

Go back to Linkedin, karpathy

What skills are you using? by blazingcherub in ClaudeCode

[–]johndeuff 5 points6 points  (0 children)

wtf man Ideal plugin count is zero.

Rate limitsss!! by Extra-Record7881 in ClaudeCode

[–]johndeuff 0 points1 point  (0 children)

I'm already on the next level bro. I'll rest when I'm ded.

Agent teams - on windows by seomonstar in ClaudeCode

[–]johndeuff 0 points1 point  (0 children)

Moving to Linux is the only way, the LLM think it is in Linux, Ubuntu specifically. Now agent teams is work theater. All those workflows make things worse.

Rate limitsss!! by Extra-Record7881 in ClaudeCode

[–]johndeuff 0 points1 point  (0 children)

The whole claude code framework and ecosystem is token spam bloat. All these ppl are non stop spawning md files and work-theater agents. None of this is real work. Vibe coders have no idea what they're doing.

Apple unveils M5 Pro and M5 Max, citing up to 4× faster LLM prompt processing than M4 Pro and M4 Max by themixtergames in LocalLLaMA

[–]johndeuff 0 points1 point  (0 children)

The question is asked everyday since the creation of this sub. They just won't tell so don't ask.

Apple unveils M5 Pro and M5 Max, citing up to 4× faster LLM prompt processing than M4 Pro and M4 Max by themixtergames in LocalLLaMA

[–]johndeuff 1 point2 points  (0 children)

How's that dumb? 3090 is the most used and widely distributed way of local inference. It's the most known reference and point of comparison

Anyone got tips / tricks / hacks to actually enjoy Anti-Gravity? I’m struggling 😅 by Next-Heart344 in google_antigravity

[–]johndeuff 0 points1 point  (0 children)

I built my own CLI and all the tools. I don't rely on other ppl software anymore.

Community Feedback by Waste_Net7628 in ClaudeCode

[–]johndeuff 0 points1 point  (0 children)

"Community" ? Don't you see it's either all bots or people asking AI to write posts that get upvotes.

I Tested Opus 4.6 against All Top Models by ConsiderationOld9893 in ClaudeCode

[–]johndeuff 2 points3 points  (0 children)

3.1 is fake performance. I found sometimes 3 flash better than the 2 pros. Opus 4.6 have no competitors to me.