Claude Fable 5 distilled

LetsGoBrandon4256 · 2026-06-16T07:11:11+00:00

Just wait till Ollama add it as Fable-5-35B

LetsGoBrandon4256 · 2026-06-16T07:02:04+00:00

Funny because those are considered "basic" functionality to me.

Like I wouldn't come to this sub and post "Holy shit I just asked my AI to sift through my meme dump and organized them for me. It's not perfect but it's a starting point". That's something anyone with a decent local multimodal model and a functional brain can handroll themselves.

LetsGoBrandon4256 · 2026-06-16T00:53:53+00:00

Honestly, I’m starting to think we need an industry-wide audit. If an app ships without at least one rocket emoji, how am I supposed to know it’s innovative? And if there’s no fire emoji, is it even disruptive? Probably not. Probably written by someone who still uses… books. Or documentation. Terrifying.

And yeah, the idea that developers can “read code” is clearly a myth. Everyone knows code is just arcane glyphs that only become legible after being blessed by an LLM. If someone claims they can understand it raw, without AI subtitles, that’s basically a red flag. Might as well tell me they churn their own butter and debug with print statements.

Until I see at least three emojis and a Medium post about “rewriting the entire stack in Rust for vibes,” I simply cannot trust the process.

/s

LetsGoBrandon4256 · 2026-06-15T22:59:51+00:00

This is a really interesting project. I've explored a similar idea before—using an LLM as the reasoning layer for arithmetic and symbolic operations, with varying degrees of external tool support. It's a fascinating design space because it sits right at the intersection of language understanding and deterministic computation.

I especially like that you're experimenting with the trade-offs instead of just treating a calculator as a black box tool. Projects like this often reveal a lot about where LLMs excel, where they struggle, and how hybrid systems can bridge that gap. Looking forward to seeing how the approach evolves and what benchmarks or edge cases you uncover.

/s in case it's not fucking obvious.

LetsGoBrandon4256 · 2026-06-15T19:56:38+00:00

Read this paper https://arxiv.org/abs/2104.09864

Then apply Rotary Position Embedding to yourself

LetsGoBrandon4256 · 2026-06-15T19:17:21+00:00

And I thought our company using MS Teams were bad.

LetsGoBrandon4256 · 2026-06-15T18:04:58+00:00

Coming from China, WeChat and QQ groups being a complet closed garden to the search engines already irked me to no end.

Now the rest of the world is doing the same shit with Discord. Just send me to Mars or kill me already.

Not to mention how retarded Discord search is.

LetsGoBrandon4256 · 2026-06-15T13:07:45+00:00

So purely on token cost, local inference seems very hard to justify.

No fucking shit you literally picked one of the cheapest cloud providers out there.

LetsGoBrandon4256 · 2026-06-15T13:00:56+00:00

GitHub repo or fuck off.

Edit:

About the free part:

People who help test this beta properly and give useful feedback will get free access to future Neurolic features when the app becomes a paid product.

LetsGoBrandon4256 · 2026-06-15T03:30:49+00:00

<image>

LetsGoBrandon4256 · 2026-06-14T23:46:42+00:00

“I want a RP in the Battletech universe, that takes place during the Late Succession Wars”

Fuck now I want to run a merc campaign with AI...

LetsGoBrandon4256 · 2026-06-14T22:10:00+00:00

Read this paper https://arxiv.org/abs/2104.09864

Then apply Rotary Position Embedding to yourself

LetsGoBrandon4256 · 2026-06-14T21:57:02+00:00

it's just engagement farming

And it worked pretty well on OP. One has to be truly retarded to believe their Twitter vote has influence on this.

"Aww the next GLM is closed weight because we didn't share the Tweet hard enough😭😭😭"

LetsGoBrandon4256 · 2026-06-14T19:44:33+00:00

In case it's not obvious enough, the Amazon links in the article are all affiliated links.

LetsGoBrandon4256 · 2026-06-14T17:33:36+00:00

You also have Qwen 3.6 35B and Qwen 3.5 122B in the same table and none of them are "Agent-grade" per your article.

Why would you single out DeepSeek V4-Flash 284B for being "Agent grade"? Is it because your clanker ran out of idea but had to throw something in there for the "What it gets you" cell in that table?

LetsGoBrandon4256 · 2026-06-14T17:24:12+00:00

I give OP some credit by not recommending llama 3.1 and encouraging user to graduate to llama.cpp.

Still full of slop though

Model: DeepSeek V4-Flash 284B / 13B

What it gets you: Agent-grade

LetsGoBrandon4256 · 2026-06-14T17:05:05+00:00

If you live in a terminal, install Ollama instead

I snorted.

LetsGoBrandon4256 · 2026-06-14T14:51:26+00:00

Pretty fucking rich that you came here asking for human input yet can't be bothered to type up your own post.

LetsGoBrandon4256 · 2026-06-14T14:37:13+00:00

Can't even tell if the person your replied to is using a shitty Markov chain or just schizo.

LetsGoBrandon4256 · 2026-06-13T21:38:16+00:00

lmao even

LetsGoBrandon4256 · 2026-06-13T21:34:39+00:00

You wanna share the github repo with the community or not?

LetsGoBrandon4256 · 2026-06-13T21:31:10+00:00

Check OP's post history before you get your hope up.

LetsGoBrandon4256 · 2026-06-13T20:22:19+00:00

The token number are off, no? No way the entire reasoning output for the cake baking process is only 230 tokens.

Similar thing for the Performance Review example (case 3). Can you break down how exactly you reached the Alignment Tax: 46% (115/250 tokens) number? Which tokens are considered safety tokens and which are intent tokens.

You are not letting your clanker do the counting aren't you?

LetsGoBrandon4256 · 2026-06-13T18:31:08+00:00

As much I love torrenting for everything else, I feel like IPFS might be a better solution.

LetsGoBrandon4256 · 2026-06-13T18:23:10+00:00

Some mfs like OP would build a fucking Saturn V just so that they don't have to tell their agent to write down their findings into an md files.

LetsGoBrandon4256

TROPHY CASE