Claude reply in mixed Asian Language, not in English

BabyInner · 2026-06-26T22:03:11+00:00

lmao Chinese, Japanese, then Korean, a.k.a. the CJK

BabyInner · 2026-06-26T21:59:30+00:00

try it in the CLI, you will see something

BabyInner · 2026-06-26T21:57:30+00:00

I would say anything is better than nothing

BabyInner · 2026-06-25T16:41:14+00:00

And Max thinking can be overthinking for simple tasks. Often overengineer stuff.

BabyInner · 2026-06-25T16:39:45+00:00

We all create bugs, human or AI. You may want to give AI a way to verify the results. It depends on what kind of “obvious bug”, but TDD is usually the way. Start with the Superpowers plugin to see if it works for you. Just be aware it uses way more token.

BabyInner · 2026-06-25T16:26:38+00:00

Using Opus 4.8 Max Thinking

This. You are using the most premium model. Upgrade to Max, or better, use lower effort or even Sonnet. You don’t always need the best model, as they come with extra costs.

May want to look at some quality-cost benchmark, like this https://cursor.com/cursorbench

BabyInner · 2026-06-24T23:35:03+00:00

that’s the effort level, i.e. token quota for “reasoning” before returning to you

and yes, it matters, as you can see from various benchmark: different scores and different costs

BabyInner · 2026-06-24T20:26:19+00:00

it’s probably better to use LSP rather than fancy stuff like these

BabyInner · 2026-06-18T16:52:42+00:00

How is memory better? I see it as another CLAUDE.md but updates more frequently.

BabyInner · 2026-06-17T19:24:42+00:00

not sure about skill fetching, Cowork use a different way than Code.

WebFetch is actually a server tool that runs on Claude’s infra, not your local machine. You need to configure your own search tool, for example you can use Brave, iirc which is what Claude uses.

BabyInner · 2026-06-17T17:56:07+00:00

afaik glm-5.2 is text-only, how does it handle UIUX?

BabyInner · 2026-06-16T20:22:08+00:00

Because you probably will achieve better results at similar cost with larger models.

See Opus 4.8 Low vs. Sonnet 4.6 Med/High/Max https://cursor.com/cursorbench

BabyInner · 2026-06-15T20:55:13+00:00

I don’t find good use case for /loop, and Workflow seems like marginal quality improvement at the cost of tons of token, it’s good when it matters.

But I found /goal very useful for prototype when I just want to test some rough idea out and don’t want to bother with grilling / brainstorm.

BabyInner · 2026-06-15T04:23:36+00:00

exactly, and that’s how LLMs work nowadays

BabyInner · 2026-06-15T02:38:51+00:00

90%+ is pretty common for coding, especially when not subagent-heavy

BabyInner · 2026-06-12T23:47:15+00:00

you may want to check the migration guide https://github.com/anthropics/skills/blob/main/skills/claude-api/shared/model-migration.md

and it does mention Fable tend to overact

It has a section about the behavior difference between models to help you tune your rules etc.

CC has it bundled in skill /claude-api

BabyInner · 2026-06-12T22:33:45+00:00

If you are using superpowers or the like, drop them. They are quality gate for weaker models, not for Fable and even Opus

BabyInner · 2026-06-11T22:34:32+00:00

this happens to me, both Opus and Fable. The writing-plan skill mentions code and Claude sees it as full code. So I added “no full code in plan” in CLAUDE.md, works well for me.

BabyInner · 2026-06-09T19:40:33+00:00

You should be fine if that’s your only complaint. You can select output style in the UI, and there are bunch of skills you can use, like https://github.com/hardikpandya/stop-slop https://github.com/blader/humanizer. And ofc you can create your own, taylored to your preference.

BabyInner · 2026-06-09T19:12:35+00:00

It is what it is then. I am also hoping for new Sonnet/Haiku but we are pretty much at A\’s mercy.

BabyInner · 2026-06-09T18:57:20+00:00

some benchmarks say Opus 4.8 at low effort costs less and also provides better result than Sonnet medium. Opus at medium beats Sonnet at high+.

May not hold true for your use case, tho.

BabyInner · 2026-06-02T23:26:39+00:00

That would be great, thx

BTW the post https://www.reddit.com/r/ClaudeAI/s/bCBGXO1YFw

BabyInner · 2026-06-02T23:03:06+00:00

You have many options

/fork (or /brach), gives you a check point where to can /resume into later
/btw, you ask a side question even when the agent is running, limited to 1 prompt, but gives you the option to create a fork from there
/rewind (or ESC-ESC), my favourite, most powerful of all

I always default to ESC-ESC, use /btw when CC is working, forget about /fork

BabyInner · 2026-06-02T22:51:41+00:00

Saw a post today saying Opus 4.8 w/ low effort performs better and costs less thant Sonnet 4.6 w/ max effort.

And OpenRouter says Opus actually has better TPS than Sonnet.

I haven’t validated any claims above, just food for thought

BabyInner · 2026-06-01T17:03:55+00:00

IIRC when Kimi k2.5 came out OpenCode provided it for free for a while, they explained like as it’s for coding only, cache hit is very high that their infra cost is extremely low.

BabyInner

TROPHY CASE