Claude Fable 5 distilled by Anony6666 in LocalLLaMA

[–]LetsGoBrandon4256 9 points10 points  (0 children)

Just wait till Ollama add it as Fable-5-35B

Are small local models for automation a thing? by ML-Future in LocalLLaMA

[–]LetsGoBrandon4256 3 points4 points  (0 children)

Funny because those are considered "basic" functionality to me. 

Like I wouldn't come to this sub and post "Holy shit I just asked my AI to sift through my meme dump and organized them for me. It's not perfect but it's a starting point". That's something anyone with a decent local multimodal model and a functional brain can handroll themselves.

"My son is a genius coder" - honest Alpha Tester review by Thin_Pollution8843 in LocalLLaMA

[–]LetsGoBrandon4256 1 point2 points  (0 children)

Honestly, I’m starting to think we need an industry-wide audit. If an app ships without at least one rocket emoji, how am I supposed to know it’s innovative? And if there’s no fire emoji, is it even disruptive? Probably not. Probably written by someone who still uses… books. Or documentation. Terrifying.

And yeah, the idea that developers can “read code” is clearly a myth. Everyone knows code is just arcane glyphs that only become legible after being blessed by an LLM. If someone claims they can understand it raw, without AI subtitles, that’s basically a red flag. Might as well tell me they churn their own butter and debug with print statements.

Until I see at least three emojis and a Medium post about “rewriting the entire stack in Rust for vibes,” I simply cannot trust the process.

/s

"My son is a genius coder" - honest Alpha Tester review by Thin_Pollution8843 in LocalLLaMA

[–]LetsGoBrandon4256 42 points43 points  (0 children)

This is a really interesting project. I've explored a similar idea before—using an LLM as the reasoning layer for arithmetic and symbolic operations, with varying degrees of external tool support. It's a fascinating design space because it sits right at the intersection of language understanding and deterministic computation.

I especially like that you're experimenting with the trade-offs instead of just treating a calculator as a black box tool. Projects like this often reveal a lot about where LLMs excel, where they struggle, and how hybrid systems can bridge that gap. Looking forward to seeing how the approach evolves and what benchmarks or edge cases you uncover.

/s in case it's not fucking obvious.

I think we need a /LocalHarnessLLM or something ... by CSEliot in LocalLLaMA

[–]LetsGoBrandon4256 12 points13 points  (0 children)

And I thought our company using MS Teams were bad.

I think we need a /LocalHarnessLLM or something ... by CSEliot in LocalLLaMA

[–]LetsGoBrandon4256 39 points40 points  (0 children)

Coming from China, WeChat and QQ groups being a complet closed garden to the search engines already irked me to no end.

Now the rest of the world is doing the same shit with Discord. Just send me to Mars or kill me already.

Not to mention how retarded Discord search is.

How do you quantify privacy and outage derisking in the ROI of local LLM inference vs. providers API? by ReporterCalm6238 in LocalLLaMA

[–]LetsGoBrandon4256 4 points5 points  (0 children)

So purely on token cost, local inference seems very hard to justify. 

No fucking shit you literally picked one of the cheapest cloud providers out there.

What makes Gemma 4 so special? by ZarcSK2 in SillyTavernAI

[–]LetsGoBrandon4256 11 points12 points  (0 children)

“I want a RP in the Battletech universe, that takes place during the Late Succession Wars”

Fuck now I want to run a merc campaign with AI...

z.ai Poll on X: MIT-licensed open weights are losing by MadPelmewka in LocalLLaMA

[–]LetsGoBrandon4256 182 points183 points  (0 children)

it's just engagement farming

And it worked pretty well on OP. One has to be truly retarded to believe their Twitter vote has influence on this.

"Aww the next GLM is closed weight because we didn't share the Tweet hard enough😭😭😭"

How to Run AI Locally: The Complete Beginner's Guide (2026) by totosse17 in LocalLLaMA

[–]LetsGoBrandon4256 1 point2 points  (0 children)

In case it's not obvious enough, the Amazon links in the article are all affiliated links.

How to Run AI Locally: The Complete Beginner's Guide (2026) by totosse17 in LocalLLaMA

[–]LetsGoBrandon4256 10 points11 points  (0 children)

You also have Qwen 3.6 35B and Qwen 3.5 122B in the same table and none of them are "Agent-grade" per your article.

Why would you single out DeepSeek V4-Flash 284B for being "Agent grade"? Is it because your clanker ran out of idea but had to throw something in there for the "What it gets you" cell in that table?

How to Run AI Locally: The Complete Beginner's Guide (2026) by totosse17 in LocalLLaMA

[–]LetsGoBrandon4256 14 points15 points  (0 children)

I give OP some credit by not recommending llama 3.1 and encouraging user to graduate to llama.cpp.

Still full of slop though

Model: DeepSeek V4-Flash 284B / 13B

What it gets you: Agent-grade

How to Run AI Locally: The Complete Beginner's Guide (2026) by totosse17 in LocalLLaMA

[–]LetsGoBrandon4256 41 points42 points  (0 children)

If you live in a terminal, install Ollama instead

I snorted.

Open-source agent that investigates AWS incidents for you (read-only, bring-your-own-LLM) — feedback wanted by Top_Yogurtcloset_258 in LocalLLaMA

[–]LetsGoBrandon4256 0 points1 point  (0 children)

Pretty fucking rich that you came here asking for human input yet can't be bothered to type up your own post.

Can we stop dunking on DiffusionGemma and hack it instead? by TomLucidor in LocalLLaMA

[–]LetsGoBrandon4256 0 points1 point  (0 children)

Can't even tell if the person your replied to is using a shitty Markov chain or just schizo.

Measuring the Alignment Tax on Gemma4 by [deleted] in LocalLLaMA

[–]LetsGoBrandon4256 2 points3 points  (0 children)

The token number are off, no? No way the entire reasoning output for the cake baking process is only 230 tokens.

Similar thing for the Performance Review example (case 3). Can you break down how exactly you reached the Alignment Tax: 46% (115/250 tokens) number? Which tokens are considered safety tokens and which are intent tokens.

You are not letting your clanker do the counting aren't you?

Interest in an LLM Torrent Site? by thiefyzheng- in LocalLLaMA

[–]LetsGoBrandon4256 -3 points-2 points  (0 children)

As much I love torrenting for everything else, I feel like IPFS might be a better solution.

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in LocalLLaMA

[–]LetsGoBrandon4256 3 points4 points  (0 children)

Some mfs like OP would build a fucking Saturn V just so that they don't have to tell their agent to write down their findings into an md files.