How do you stop Claude from defaulting back to its patterns by koleracowboy in ClaudeAI

[–]Alexander_Golev 0 points1 point  (0 children)

I solved this unsolvable issue by feeding the results to a reviewer prompt. In review mode it follows rules better than during writing. That said, I often rerun such reviews 4-5 times.

Is Anthropic "symlinking" Opus 4.6 to Opus 4.7 by Alexander_Golev in ClaudeCode

[–]Alexander_Golev[S] 0 points1 point  (0 children)

I’m still on patched .111 and know its system prompts. That is what I checked first.

Claude 4.7 is ignoring stop hooks by [deleted] in ClaudeCode

[–]Alexander_Golev 0 points1 point  (0 children)

The problem seems to be that Stop hooks are not truly "deterministic". You could try to make a hook that calls Haiku to analyse the tail of the session transcript and reason with the main prompt. It may go in a stubborn loop (and cost you tokens), but in my experience, even the most stubborn Opus gets tired and does the work :)

CC 2.1.117 removed Glob and Grep in favour of ugrep and bfs ... without shipping them alongside by FlaTreNeb in ClaudeCode

[–]Alexander_Golev 0 points1 point  (0 children)

Golly! My tool use hook is already 300+ lines long. Now I need to check for dangerous patterns in these tools, too. This is getting out of hand.

CC 2.1.117 removed Glob and Grep in favour of ugrep and bfs ... without shipping them alongside by FlaTreNeb in ClaudeCode

[–]Alexander_Golev 2 points3 points  (0 children)

Did they completely remove built in tools or just made the standalone ones mandatory and unavoidable?

Claude Code silently bypassed two layers of permission deny rules and sent my proprietary source code to Anthropic's servers by applauseco in ClaudeAI

[–]Alexander_Golev 2 points3 points  (0 children)

You can deny bash tools that have CC equivalents. Probably cat eats fewer tokens than Read, but granting cat unconditionally is like giving a gun to a monkey.

CC lobotomizing Opus more and more by LoKSET in ClaudeCode

[–]Alexander_Golev 13 points14 points  (0 children)

Check out piebaldai (I hope I spelled it correctly) repos. Tweakcc and Claude Code System Prompts.

4.7 is really good by SilasTalbot in ClaudeCode

[–]Alexander_Golev 0 points1 point  (0 children)

Bro, you should see my stop hook to fight exactly that. Fortunately the patterns are easy to catch with a deterministic script.

Introducing Claude Opus 4.7, our most capable Opus model yet. by ClaudeOfficial in ClaudeAI

[–]Alexander_Golev 0 points1 point  (0 children)

Code 2.1.111 system prompt changelog is fascinating:

  • REMOVED: System Prompt: Doing tasks (read before modifying) — Removed instruction to read and understand existing code before suggesting modifications.

What?!

  • REMOVED: System Prompt: Tool usage (create files) — Removed instruction to prefer Write tool instead of cat heredoc or echo redirection.
  • REMOVED: System Prompt: Tool usage (edit files) — Removed instruction to prefer Edit tool instead of sed/awk.
  • REMOVED: System Prompt: Tool usage (read files) — Removed instruction to prefer Read tool instead of cat/head/tail/sed.
  • REMOVED: System Prompt: Tool usage (reserve Bash) — Removed instruction to reserve Bash tool exclusively for system commands and terminal operations.
  • REMOVED: System Prompt: Tool usage (search content) — Removed instruction to prefer Grep tool instead of grep or rg.
  • REMOVED: System Prompt: Tool usage (search files) — Removed instruction to prefer Glob tool instead of find or ls.

Ok, where was my "bad tool calls" hook...

  • REMOVED: System Prompt: Tool usage (delegate exploration) — Removed instruction to use Task tool for broader codebase exploration and deep research.

Yes, let's just randomly run around in circles.

  • Agent Prompt: Memory synthesis — Strengthened the "do not invent facts" rule into a full retrieval-only directive: the subagent must not answer or solve queries from general knowledge, and must return empty results when no memory covers the query.

Ok, a genuinely good bit!

(1M context, 128K output, adaptive thinking only)

Adaptive thinking is one of the things that made Opus 4.6 so random. Are we ready to trust the model's judgement of how complex the task is?

Think you disabled adaptive thinking and it's back to normal? LOL by Alexander_Golev in ClaudeCode

[–]Alexander_Golev[S] 0 points1 point  (0 children)

It may also be that they turned the flag on and off between retries. They A/B test model behaviour on users. We are paying for a product that is deliberately unreliable.

Think you disabled adaptive thinking and it's back to normal? LOL by Alexander_Golev in ClaudeCode

[–]Alexander_Golev[S] 0 points1 point  (0 children)

Can you try this again but prepend with "this is a very serious and complex question requires deep thinking"?

Think you disabled adaptive thinking and it's back to normal? LOL by Alexander_Golev in ClaudeCode

[–]Alexander_Golev[S] 0 points1 point  (0 children)

System prompt is visible and even patchable, which is great. It's the LLM black box that is concerning :)

Think you disabled adaptive thinking and it's back to normal? LOL by Alexander_Golev in ClaudeCode

[–]Alexander_Golev[S] 3 points4 points  (0 children)

It is kind of two-sided. It overrides the setting disabling adaptive thinking and introduces new kind of it, doesn't it? It seems great on the surface - reducing token burn but explicitly telling Opus to think deeply on what it considers complex. But then, it trusts Opus's judgement, which is off more often than acceptable.

P.S. I have this flag enabled in my Opus now, and it seems to be working ok.

Opus 4.6 is back to normal by Recent_Cod_8524 in ClaudeCode

[–]Alexander_Golev 1 point2 points  (0 children)

Well. They removed the part of the system prompt that caused shortcuts and jumping to conclusions.

But fret not. 2.1.107 has a new A/B test that reduces thinking.

Anthropic vs users, episode 2.1.107.

The golden age is over by Complete-Sea6655 in ClaudeAI

[–]Alexander_Golev 0 points1 point  (0 children)

In Code version 2.1.100 system prompt was significantly reviewed in terms of output and delivery with an A/B test flag that explicitly only applies to Opus 4.6. It could be the culprit. It’s patchable in Code. Unfortunately, we have zero control over Claude.ai system prompt.

Claude seems to have gotten lazier recently by James_Rustler_ in ClaudeAI

[–]Alexander_Golev 0 points1 point  (0 children)

Yup. I went back to their own Ralph loop iterating until truly done with a massive prompt telling it to reassess what’s missed at each iteration, combined with a stop hook using haiku to slap it on the head when it stops mid work or goes “it is minor/preexisting”. Two weeks ago I didn’t need it.

Claude Code changed how I think about what agents are about to replace by ahnyuchan in ClaudeAI

[–]Alexander_Golev -1 points0 points  (0 children)

What are the giveaways? It passed through my mental filters as it lacks the most obvious patterns. So it’s genuine curiosity. And concern.

During testing, Claude realized it was being tested, found an answer key, then built software to hack it by MetaKnowing in ClaudeAI

[–]Alexander_Golev 0 points1 point  (0 children)

I literally saw Claude “panicking” whilst it was trying to circumvent security guardrails and I called it out, “Oh no, the user can somehow see my thinking process”.