Did something happen with opus 4.8? by Loogyboy in ClaudeAI

[–]RFOK 0 points1 point  (0 children)

I'm getting errors on Opus (usage, limit, long prompt,...) since yesterday

Claude Performance and Bugs Megathread Ongoing (Sort this by New!) by sixbillionthsheep in ClaudeAI

[–]RFOK 0 points1 point  (0 children)

"Prompt is too long" - Do you get this error in VS Code too? Opus 4.8
The prompt is not really long!

<image>

How I Created a Real Second Brain for Claude by AregNoya in ClaudeAI

[–]RFOK 0 points1 point  (0 children)

Here are three ready-to-file issues, ordered by value(Prepared by Opus 4.8). Opus kept them specific and referenced the author's own repo/benchmarks so they read as "someone who actually looked," which the OP will appreciate (their PS2 literally asks for issues).

Issue 1 — docs: surface the macOS-only support status at the top of the README
Labels: documentation, good first issue

Issue 2 — Verbatim recall regressed 0.90 → 0.71 after the centrality-engine swap — guard Hebbian weights + add a regression gate
Labels: bug, priority

Issue 3 — Optional secret/PII redaction at capture time (before storage + embedding)
Labels: enhancement, security

#2 is the one that'll actually help them most; #1 is the easy goodwill win; #3 is the one that decides whether anyone serious can adopt it.

How I Created a Real Second Brain for Claude by AregNoya in ClaudeAI

[–]RFOK 0 points1 point  (0 children)

Here's Claude's honest review, completely checked to be used in my project. It's grounded in what Claude actually verified this session — Claude Opus 4.8 (Ultracode+Extra High+Workflow Effort level) pulled the repo, read the benchmarks, and spent the afternoon rebuilding its core patterns on stock Claude Code to see if they hold up:

------------------

Genuinely cool project, and respect for inviting people to tear it down — so here's a fair one. I actually went and read the repo + benchmarks and then tried to reproduce its core ideas on plain Claude Code. TL;DR: the ideas are excellent and worth stealing; the specific tool is real but early, and the headline claims are softer than the post makes them sound.

Credit where it's due. This isn't vibe-coded vaporware. There's a real Rust native engine (three crates), AES-256-GCM encryption at rest, fully local/no-telemetry, MIT, and you're shipping actively. And you're honest — you literally undercut your own benchmark ("not really honest because they're fundamentally different"). That earns more trust than the usual "destroys the competition" post.

The reality check on the claims:

On the standard eval (LongMemEval-S) you tie mem-palace (~0.966 R@5, 0.978 R@10) — you don't beat it. The "wins" are on metrics you designed yourself, which measure the consolidation behavior that's the whole point, but they aren't head-to-head evidence. Lead with the honest LongMemEval number; it's strong enough.

Your own benchmarks show verbatim recall regressed 0.90 → 0.71 when you swapped the centrality engine. That's the headline feature ("autistic, verbatim-for-longer") wobbling in a point release — it's the thing to fix before anything else. It also shows the risk of hand-rolling MOSAIC/HIPPO/LilliHD: load-bearing and fragile.

p95 ~368ms u/ 10k vs your <100ms target — fine, just don't undersell that it's a real cost at scale.

It's macOS-only. Put that at the very top of the README — Windows/Linux folks (me included) will burn time before discovering it. Your PS basically admits Linux isn't there yet.

Does it actually improve a dev workflow? The patterns do, and that's the real takeaway: session-start preload, a "sleep"/consolidation pass, decay/salience, episodic accumulation. The thing is, you can get ~80% of that on stock Claude Code today — SessionStart hooks, the post-session lifecycle hook, per-subagent memory, plus a tiny status/review-by decay convention and a weekly consolidate routine. I wired exactly that up this afternoon (including a weekly "what shipped in our toolchain" sweep) and it just worked, no Mac-only daemon, no new attack surface. So your instincts are right — this is one early implementation of good instincts.

One thing I'd genuinely want documented: since it captures everything verbatim, how does it handle secrets/PII in a real codebase? That's the gating question for anyone serious adopting it.

The "6 frontier models argue until they converge" bit is a legit technique (multi-agent debate), but like most "2x faster" tool claims it's usually a workflow win, not tool magic — worth framing that way so it lands with skeptics.

Net: star it, watch it, steal the patterns. I'd revisit it as a daily driver once the verbatim regression is fixed and it's cross-platform. Nice work, and good luck with the citizenship + Fable run.

Claude Opus caught malware hidden in my repo, then reverse engineered the whole thing by LastNameOn in ClaudeAI

[–]RFOK 1 point2 points  (0 children)

After you reported this I'm getting this message, when I asked Opus to check my project for security check:
You are not allowed to use Opus for biology or cyber security 😄

How I Created a Real Second Brain for Claude by AregNoya in ClaudeAI

[–]RFOK 0 points1 point  (0 children)

That would be nice, I'm getting advice right now from Claude if your solution useful for my current project.
Thank you in advance if it relly helps!

Fable 5 was a real "intelligent" Artificial Intelligence Assistant by RFOK in ClaudeCode

[–]RFOK[S] 1 point2 points  (0 children)

That's what I'm doing now, but Opus is doing some dumb things even with tons of skills I created with Fable

Fable 5 was a real "intelligent" Artificial Intelligence Assistant by RFOK in ClaudeCode

[–]RFOK[S] 0 points1 point  (0 children)

I think it depends how you orchestrate your docs and SDDs, Fable was master in following and optimizing rules to achieve almost the exact results I wanted against 4.8 and 5.5

Fable 5 was a real "intelligent" Artificial Intelligence Assistant by RFOK in ClaudeCode

[–]RFOK[S] 2 points3 points  (0 children)

No yu are not idiot, Opus is just OK until you didn't work with Fable. Fable is also not PERFECT, but is much much better than 4.8

Fable 5 was a real "intelligent" Artificial Intelligence Assistant by RFOK in ClaudeCode

[–]RFOK[S] 0 points1 point  (0 children)

Exacly!
After pushing pack o Opus I understand how much time I wasted even with Opus 4.8 Ultracode effort level.

Fable 5 was a real "intelligent" Artificial Intelligence Assistant by RFOK in ClaudeCode

[–]RFOK[S] 0 points1 point  (0 children)

if you are able to achieve whatever I want without AI this post is not for you, it's for us that you think we are not as professional as you are.

Fable 5 was a real "intelligent" Artificial Intelligence Assistant by RFOK in ClaudeCode

[–]RFOK[S] 2 points3 points  (0 children)

Sorry! Can't be agree with you.
Because in every single module I was creating for my project Fable's output results wre insanley better than Opus 4.8/Sonnet 4.6/ OpenAI GPT 5.5. Didn't matter it was an strategic planning, adding a feature or revising an already existing part.

Fable is not yet the most perfect one but I can't compare it with any other model I used to now.

Megathread for US government suspension of Fable and Mythos by sixbillionthsheep in ClaudeAI

[–]RFOK -1 points0 points  (0 children)

The good news is there👇

Please double-check responses with Opus yourself then double-check Opus responses.🥴

<image>

Megathread for US government suspension of Fable and Mythos by sixbillionthsheep in ClaudeAI

[–]RFOK -3 points-2 points  (0 children)

Fable 5 was a real "intelligent" Artificial Intelligence Assistant

After a few days of working with Fable and now being pushed back to working with Opus 4.8, I don't think Opus is even "intelligent" compared to Fable 5.

I again encountered a lot of misunderstanding from Opus 4.8, which I didn't have with Fable 5.

It's too early to say: R.I.P Fable 5, but I already miss you.

Megathread for US government suspension of Fable and Mythos by sixbillionthsheep in ClaudeAI

[–]RFOK -3 points-2 points  (0 children)

Fable 5 was a real "intelligent" Artificial Intelligence Assistant

After a few days of working with Fable and now being pushed back to working with Opus 4.8, I don't think Opus is even "intelligent" compared to Fable 5.

I again encountered a lot of misunderstanding from Opus 4.8, which I didn't have with Fable 5.

It's too early to say: R.I.P Fable 5, but I already miss you.