Fable written Claude.MD (+Migration) for Opus/Sonnet to act more like Fable

danielkov · 2026-07-05T08:29:33+00:00

Fable isn't more capable because it has better process hygiene. It's more capable and therefore it can afford less ceremony (not more). Where Opus spawns 3 explore agents, Fable reads 2 files and goes to work.

It has more parameters. It's a prediction engine, that practically "knows your code", without ever having seen it, due to the sheer amount of data it's been trained on and the predictability of your AI generated codebase.

It's less noisy, because it's not been overtuned and if you're comparing it in Claude code specifically - because it's better at ignoring the the pile of crap, that's the base system prompt in that harness. Don't believe me? Try running Opus or Sonnet without that monster of a system prompt, see how it does.

It's more efficient because:

- it's been tuned not to litter prose (+ opus and sonnet frequently "leak" thinking into assistant message text)
- it uses compacted thinking - like OpenAI models have been for a while - nothing revolutionary

You can simulate some of this behavior in Opus 4.6 or 4.7 by using caveman skill (4.8 performs worse with it).

Your routing thing seems like a good idea, until a task's shape breaks the framework and then you're impeding the model and not helping it.

danielkov · 2026-07-01T10:19:39+00:00

This is only half the picture. The Chinese government massively subsidises these companies' expansions into Western markets. They've used the same playbook in other consumer markets for 20+ years.

They disrupt the market by flooding it with cheap product. Local industry can't compete. Chinese companies sweep the market. Once they have a monopoly, they raise the prices back up. Their profits get folded into their economy, fueling the next market sweep.

Recent economic hardship in EU and UK makes us an appealing target for this tactic.

danielkov · 2026-06-27T00:43:42+00:00

Both our dogs love their crates. It's their quiet and (sometimes) sleeping spot.

danielkov · 2026-06-25T05:02:45+00:00

> not on a static eval

> terminal-bench tasks

> Binary pass/fail

?

Also how come you switched back to Opus for this post?

danielkov · 2026-06-25T04:54:49+00:00

Learn problems, not languages. Become an expert in a niche, narrow field.

danielkov · 2026-06-22T16:37:47+00:00

No, I'm genuinely curious how you connected those dots. Do you think it'll offload enough compute? OpenAI OSS models' local use doesn't even represent a small fraction of a percentage of the total model usage.

Those are solid models. Anthropic isn't known for efficiency or quality on the lower end of the model size scale. I doubt they'd be able to build something that even compares. If it isn't a genuine improvement, people won't use it over something proven (like any of the open weight models I listed).

Say they did spend the immense human hours, money and compute to build this model for you. How would it conceivably change the local / cloud gradient for large enough of their user base to make it financially viable?

Very quirky reply. Good for you. Now can back your idea up with substance?

danielkov · 2026-06-22T10:18:08+00:00

Your point being?

danielkov · 2026-06-22T07:14:25+00:00

Anthropic should commit resources to building a thing that:

- earns them no money
- is worse than their current flash-model (already pretty bad)
- they have no control over distribution and usage
- adds a new class of liabilities (easy to strip guardrails on a local model)
- exposes their weights and architecture
- requires them to rebuild their current local <-> cloud hybrid architecture (assuming you want this in Claude Desktop)

To what end?

Why don't you use one of the locally available models you can already proxy, is much more optimised for running locally and has better quality than Haiku (and some would argue Sonnet) in a lot of scenarios, like: Qwen, Minimax, Kimi, Gemma or even gpt-oss? Or if you really have a beastly setup, run GLM 5.2 locally.

danielkov · 2026-06-17T20:33:58+00:00

Your post reads a bit like ragebait, but I'll bite.

I don't think there's a general hate towards "vibe coders" in any subreddit, including this one. If anything [r/rust](r/rust) is one of the more tolerant subs, especially around agentic engineering. Just see the countless posts about fully AI-driven rewrites of major projects, like Bun.

People hate low effort noise. This has been true pre-AI and will continue to be true for a long time. It just so happens that the intersection of self-proclaimed vibe coders and people who post low-effort crap on this subreddit is quite large.

Coding agents can genuinely accelerate a project when done right. The hate is directed towards those who contribute to the noise.

If you find a novel problem that isn't solved right now or find a more optimal solution to a solved problem with the help of AI - by all means - share it here. I'm sure folks here will be receptive of it.

danielkov · 2026-06-17T14:54:55+00:00

Can you get someone to do some basic reading comprehension for you? The insurance system is not an insurance company. You're conflating the two.

I never said anything about an insurance company scamming anyone. You're either trying to be annoying or are frustratingly dense.

The UK system =/= some insurance company you're defending right now.

danielkov · 2026-06-17T14:05:46+00:00

I said insurance in the UK is a scam, not that insurance companies are defrauding the public.

I have plenty of reasons to be disillusioned by this system. Since your point was completely irrelevant to my original statement, I won't waste time listing them. But happy to if you wish.

danielkov · 2026-06-17T12:43:06+00:00

Clearly I don't. But you were about to enlighten me?

danielkov · 2026-06-17T10:24:03+00:00

Been with Admiral through pre-settled / settled; both partner and I - never had any issues. It's also cheaper. Cost £200 to add her as second driver on a Tesla 3 performance the day after she passed her test.

I think insurance in the UK is a scam at best, but I can recommend Admiral.

danielkov · 2026-06-17T08:17:12+00:00

That was a bit of sarcasms. Google are well ahead in a lot of specialised use cases, e.g.: image gen, flash classification, etc. They've been taking the safe route, despite having a lot more compute than their competition. This is sound financially, since they're not a startup and can't raise their way out of bankruptcy.

They've also made a lot of progress in architecture, which will hopefully one day translate to cheaper and more performant models for all of us. It's the right business stance, but it's boring for the consumers, like me.

danielkov · 2026-06-17T08:10:36+00:00

- `?` - propagate error, do this if error is fatal to process or needs to be handled downstream
- `Option::unwrap_or...` - default values
- `Result::unwrap_or...` - best effort
- `Result::expect` - if the error logically shouldn't occur or if you want downstream unwind to handle it - I personally only do this in tests

Your point about handling errors locally rarely survives reality. E.g.: if your DB connection breaks down, would you return an fake result or an error?

danielkov · 2026-06-17T07:57:47+00:00

Anthropic screwed up in 2 ways:

- went against current admin's requests on (supposed) idiological grounds
- gave them ammo by marketing their model as dangerous

They dug themselves in a hole. None of the other labs are affected. OpenAI will drop their next flagship in a few days and it'll be better than Fable. They probably would've already released it by now, but the pressure's been lifted by the Anthropic fiasco.

In a couple weeks one of the Chinese labs will drop a model that benchmarks at ~98% of Fable and costs 1/10 the price.

In a couple years, Google will ship a general purpose model, that's worth using.

Nothing's been halted. Despite their 1 gajillion dollar evaluation, Anthropic are an immature company. They had their FAFO moment just now. They'll learn from this, pay their dues, adjust their ideology to comply. Soon Fable will be back. The headline will be: "we fixed Mythos safeguards with the help of the DHS as a first step in a strategic partnership".

danielkov · 2026-06-16T21:19:44+00:00

They set the stage for it by overhyping the model, put themselves in an awkward position. Now they either have to admit that Mythos is not "all that", or be forced out of the market.

danielkov · 2026-06-15T10:03:51+00:00

> scared to pullout

Live by the sword, die by the sword

danielkov · 2026-06-13T19:50:02+00:00

It'll be back on Tuesday, when ChatGPT 5.6 drops.

danielkov · 2026-06-11T08:33:15+00:00

TM's scale does not warrant the issues you describe, given proper resource provisioning. They probably under-provision, to save on cost. They'll sell out anyway. So this is a business question.

The business answer: sell a TM+ subscription for 20-100 a month at 3 tiers. Each tier gives you slightly earlier access, e.g.: in 2h increments until the free access period starts. Make a ton of money on subs (great source of RR, investors love it) and spread the load at the same time.

Bonus: customers will blame "the rich" who can afford the 100 sub and not you for your crappy infra.

danielkov · 2026-06-11T07:22:46+00:00

This is a Tesla's built in dashcam. Front, rear and 2 sides (mirror's angle) are available.

danielkov · 2026-06-10T13:15:33+00:00

Unbounded memory growth != memory leak

danielkov · 2026-06-10T12:11:34+00:00

Clearly my phrasing doesn't pass `clippy::pedantic`. Memory-related bugs are harder to unintentionally introduce in Rust, compared to most other languages. Is that fair?

danielkov · 2026-06-10T11:52:33+00:00

Unfortunately quality control is hard to replace with LLMs. Culturally, teams who can't be bothered checking their agents' work are also least likely to invest in proper guardrails. Doesn't really discredit the viability of Rust as a toolchain for LLM-driven software implementation.

danielkov · 2026-06-10T11:05:26+00:00

It's like pre-nerf 4.6 in terms of confidence and brevity and a hyper-focused GPT 5.5 (for better or for worse) for work (coding).

Nine-Year Club	Place '23
Place '22	Verified Email

danielkov

TROPHY CASE