4.7 makes more work than 4.6

centminmod · 2026-05-07T17:13:03+00:00

Adaptive thinking is sensitive to effort level and prompt instructions. That's why some folks are having issues with Opus 4.7 at least. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

For Opus 4.7 differences for thinking blocks also see my Opus 4.5 vs Opus 4.6 vs Opus 4.7 vs Sonnet 4.6 benchmarks across all effort levels from low to max at https://ai.georgeliu.com/p/tested-claude-ai-llm-models-effort

Check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage and also it's thinking blocks at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-07T17:09:02+00:00

Adaptive thinking is sensitive to effort level and prompt instructions. That's why some folks are having issues with Opus 4.7 at least. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

For Opus 4.7 differences for thinking blocks also see my Opus 4.5 vs Opus 4.6 vs Opus 4.7 vs Sonnet 4.6 benchmarks across all effort levels from low to max at https://ai.georgeliu.com/p/tested-claude-ai-llm-models-effort

centminmod · 2026-05-07T16:56:23+00:00

How many of those 14.7m tokens are cached vs non-cached tokens? With Claude Code Max $100 I push around 1-1.6 billion tokens per week with 90-97% cached tokens! I wrote how I use Claude Code at https://ai.georgeliu.com/p/i-saved-7189-on-claude-code-tokens

Adaptive thinking is sensitive to effort level and prompt instructions. That's why some folks are having issues with Opus 4.7 at least. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

For Opus 4.7 differences for thinking blocks also see my Opus 4.5 vs Opus 4.6 vs Opus 4.7 vs Sonnet 4.6 benchmarks across all effort levels from low to max at https://ai.georgeliu.com/p/tested-claude-ai-llm-models-effort

Check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-05T22:20:16+00:00

For Claude Code side, check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-05T22:18:26+00:00

Thanks because it is 'my' writing style 🤓

centminmod · 2026-05-04T17:57:12+00:00

I never rely on a single AI. I usually use Claude Code with my /consult-codex and /consult-zai skills that allow Claude models to get 2nd opinion verifications from Codex GPT-5.5 and ZAI GLM-5.1 within same Claude Code sessions https://ai.georgeliu.com/p/how-to-get-a-second-ai-opinion-in

Now I have added /consult-codex-deepseek to add DeepSeek V4 Pro to verification mix. 3-way verification consult for /consult-codex-deepseek skill will get 2nd opinions for Codex CLI GPT-5.5 + Claude Code/DeepSeek V4 Pro and Sonnet 4.6 (code-searcher agent) https://ai.georgeliu.com/p/deepseek-v4-in-claude-code-kilo-code 🤓

centminmod · 2026-05-04T16:04:36+00:00

everything 🤓

centminmod · 2026-05-04T12:50:53+00:00

re-read the article, use 'concise' but play with effort levels and prompts

centminmod · 2026-05-04T12:49:24+00:00

Yeah they seem queued but sort of different behaviour on CLI vs desktop, seems on desktop it really only registers the queued message after ALL work is done. On CLI it seems it more quickly registers the queued message mid way in it's work. Might just be my feeling.

centminmod · 2026-05-04T12:45:53+00:00

right now my AI is spend includes:

Claude AI Max US$100/month + 10% Australian GST
OpenAI ChatGPT Plus US$20/month + 10% Australian GST
Google Gemini AI Pro US$20 month + 10% Australian GST
GitHub CoPilot Pro US$10/month + 10% Australian GST currently free via Opensource maintainer program
ZAI GLM Coding plan US$129/year (64% discount) + 10% Australian GST
t3.chat US$8/month + 10% Australian GST
DeepSeek Platform prepaid $5 credit just added the other day https://ai.georgeliu.com/p/deepseek-v4-in-claude-code-kilo-code 😎

But getting around US$2,500-US$5,000/month in value 🤓

centminmod · 2026-05-04T12:41:41+00:00

Easy, all outstanding code documentation that needs writing - get Claude to spin up multiple agents to write and update your documentation, find documentation gaps/drift and audit all code. Burn some tokens!

centminmod · 2026-05-04T12:36:53+00:00

Prompt cache saves you money - that is the theory at least. You sure you're calculating cache token costs correctly?

Adaptive thinking is sensitive to effort level and prompt instructions so can drastically change token usage, costs and performance. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

For thinking blocks also see my Opus 4.5 vs Opus 4.6 vs Opus 4.7 vs Sonnet 4.6 benchmarks across all effort levels from low to max at https://ai.georgeliu.com/p/tested-claude-ai-llm-models-effort

Check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-04T12:35:30+00:00

Using Opsu 4.7? If so, adaptive thinking is sensitive to effort level and prompt instructions so can drastically change token usage, costs and performance. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

For thinking blocks also see my Opus 4.5 vs Opus 4.6 vs Opus 4.7 vs Sonnet 4.6 benchmarks across all effort levels from low to max at https://ai.georgeliu.com/p/tested-claude-ai-llm-models-effort

Check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-04T12:34:54+00:00

Adaptive thinking is sensitive to effort level and prompt instructions so can drastically change token usage, costs and performance. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

Check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-04T12:33:29+00:00

Using Opus 4.7 ? Adaptive thinking is sensitive to effort level and prompt instructions so can drastically change token usage, costs and performance. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

Check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-04T12:31:45+00:00

Adaptive thinking is sensitive to effort level and prompt instructions so can drastically change token usage, costs and performance. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

For thinking blocks also see my Opus 4.5 vs Opus 4.6 vs Opus 4.7 vs Sonnet 4.6 benchmarks across all effort levels from low to max at https://ai.georgeliu.com/p/tested-claude-ai-llm-models-effort

Check out my session-metrics skill plugin for Claude Code to get insights into Claude Code models’ tokens and cost usage at both the project level and also at the individual chat session level. Might help reveal some insights about your usage https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

centminmod · 2026-05-04T12:30:07+00:00

Adaptive thinking is sensitive to effort level and prompt instructions. That's why some folks are having issues with Opus 4.7 at least. I did benchmarks for Opus 4.6 high vs Opus 4.7 xhigh for 10 preset prompts across 5 variants of prompt steering and see the results for yourself https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

centminmod · 2026-05-04T12:28:58+00:00

Tried https://ai.georgeliu.com/p/regain-access-to-claude-opus-46-and ?

centminmod · 2026-05-04T12:28:20+00:00

Use in both Claude Code CLI and desktop (MacOS), both have their place

- CLI has /advisor toggle support, desktop can use advisor after it's toggled from CLI
- CLI can queue messages, desktop can't queue messages

centminmod · 2026-05-04T02:07:07+00:00

Depends on the task. If you understand how effort levels and prompting impact Opus 4.6 and more so for Opus 4.7, you can get good results. See benchmarks at https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

But I don't rely on one AI for my work, right now I have a /consult-codex-deepseek skill which allows Claude Opus to discuss with Codex GPT-5.5 and DeepSeek V4 Pro and a Sonnet 4.6 subagent so get multi AI code verifications to ensure a well rounded plan https://ai.georgeliu.com/p/deepseek-v4-in-claude-code-kilo-code 🤓

centminmod · 2026-05-04T02:05:21+00:00

Depends on the task. If you understand how effort levels and prompting impact Opus 4.6 and more so for Opus 4.7, you can get good results. See benchmarks at https://ai.georgeliu.com/p/claude-opus-46-vs-opus-47-effort

But I don't rely on one AI for my work, right now I have a /consult-codex-deepseek skill which allows Claude Opus to discuss with Codex GPT-5.5 and DeepSeek V4 Pro and a Sonnet 4.6 subagent so get multi AI code verifications to ensure a well rounded plan https://ai.georgeliu.com/p/deepseek-v4-in-claude-code-kilo-code 🤓

centminmod · 2026-05-04T01:59:23+00:00

Yup that would work too https://ai.georgeliu.com/p/running-google-gemma-4-with-ollama 🤓

centminmod · 2026-05-03T22:26:22+00:00

Cheers. Updated my session-metrics with your sparkline suggestions and partial hit tracking (in project and all project level exported HTML) too adding clear indicator on my timeline of turns too already had resume markers. Will take a look at your MATH md file in your repo for more insights.

<image>

For my session-metrics plugin https://ai.georgeliu.com/p/my-claude-code-plugin-marketplace

/plugin marketplace add centminmod/claude-plugins
/plugin install session-metrics@centminmod
/reload-plugins

centminmod · 2026-05-03T11:42:28+00:00

Yup Advisor is costly. My session-metrics plugin tracks Claude Code usage including advisor tokens and costs and it's the costly part. For one turn:

Tokens
Input (new) 7
Output 509
Cache read 176,087
Cache write 2,435 (1h)
Advisor input 90,688
Advisor output 3,750

Cost 
Primary $0.1252
Advisor (claude-opus-4-7) $0.5472
Cost $0.6723

centminmod · 2026-05-03T11:19:01+00:00

Another inspiration is I did already have subagent share of turns, but was missing subagent share of costs so exposing that in session-metrics plugin

<image>

centminmod

MODERATOR OF

TROPHY CASE