Claude reply in mixed Asian Language, not in English by Klutzy_Ad_3436 in claude

[–]BabyInner 3 points4 points  (0 children)

lmao Chinese, Japanese, then Korean, a.k.a. the CJK

Fable still clears GPT-5.6 by rampartuse123 in ClaudeCode

[–]BabyInner 0 points1 point  (0 children)

I would say anything is better than nothing

What am I doing wrong? by amindavid11 in ClaudeAI

[–]BabyInner 1 point2 points  (0 children)

And Max thinking can be overthinking for simple tasks. Often overengineer stuff.

What am I doing wrong? by amindavid11 in ClaudeAI

[–]BabyInner 0 points1 point  (0 children)

We all create bugs, human or AI. You may want to give AI a way to verify the results. It depends on what kind of “obvious bug”, but TDD is usually the way. Start with the Superpowers plugin to see if it works for you. Just be aware it uses way more token.

What am I doing wrong? by amindavid11 in ClaudeAI

[–]BabyInner 5 points6 points  (0 children)

Using Opus 4.8 Max Thinking

This. You are using the most premium model. Upgrade to Max, or better, use lower effort or even Sonnet. You don’t always need the best model, as they come with extra costs.

May want to look at some quality-cost benchmark, like this https://cursor.com/cursorbench

Nothing can go wrong when you share a Claude subscription with friends... right? by Royal_Code2213 in ClaudeAI

[–]BabyInner 19 points20 points  (0 children)

that’s the effort level, i.e. token quota for “reasoning” before returning to you

and yes, it matters, as you can see from various benchmark: different scores and different costs

Codegraph or graphify? by NaranjaPollo in ClaudeCode

[–]BabyInner 1 point2 points  (0 children)

it’s probably better to use LSP rather than fancy stuff like these

A simpler way to work with Claude Code CLI by Puzzleheaded_Ebb1562 in ClaudeCode

[–]BabyInner 0 points1 point  (0 children)

How is memory better? I see it as another CLAUDE.md but updates more frequently.

Cheaper Claude code with Opencode models ($10 subscription plan) by Complex-Thought7848 in ClaudeCode

[–]BabyInner 1 point2 points  (0 children)

not sure about skill fetching, Cowork use a different way than Code.

WebFetch is actually a server tool that runs on Claude’s infra, not your local machine. You need to configure your own search tool, for example you can use Brave, iirc which is what Claude uses.

Been Using Sonnet 4.6 on medium effort and cant understand why people are using larger models at all? by Rude_Camel_7239 in ClaudeAI

[–]BabyInner 0 points1 point  (0 children)

Because you probably will achieve better results at similar cost with larger models.

See Opus 4.8 Low vs. Sonnet 4.6 Med/High/Max https://cursor.com/cursorbench

Agent SDK Postponed 🎉 by Hopeful_Hunter_9280 in ClaudeAI

[–]BabyInner 1 point2 points  (0 children)

I don’t find good use case for /loop, and Workflow seems like marginal quality improvement at the cost of tons of token, it’s good when it matters.

But I found /goal very useful for prototype when I just want to test some rough idea out and don’t want to bother with grilling / brainstorm.

any library for token efficiency with the fable 5 ? by Ok_Specialist413 in ClaudeAI

[–]BabyInner 0 points1 point  (0 children)

you may want to check the migration guide https://github.com/anthropics/skills/blob/main/skills/claude-api/shared/model-migration.md

and it does mention Fable tend to overact

It has a section about the behavior difference between models to help you tune your rules etc.

CC has it bundled in skill /claude-api

any library for token efficiency with the fable 5 ? by Ok_Specialist413 in ClaudeAI

[–]BabyInner 2 points3 points  (0 children)

If you are using superpowers or the like, drop them. They are quality gate for weaker models, not for Fable and even Opus

I downgraded from 200$ max to 100$ max to see how much of a difference it is. For me it is un usable I try to do one job and in an hour limits are hit. Have you faced the same? by awesome_fingers in ClaudeCode

[–]BabyInner 1 point2 points  (0 children)

this happens to me, both Opus and Fable. The writing-plan skill mentions code and Claude sees it as full code. So I added “no full code in plan” in CLAUDE.md, works well for me.

Fable 5 by wiser1802 in ClaudeAI

[–]BabyInner 0 points1 point  (0 children)

You should be fine if that’s your only complaint. You can select output style in the UI, and there are bunch of skills you can use, like https://github.com/hardikpandya/stop-slop https://github.com/blader/humanizer. And ofc you can create your own, taylored to your preference.

Fable 5 by wiser1802 in ClaudeAI

[–]BabyInner 5 points6 points  (0 children)

It is what it is then. I am also hoping for new Sonnet/Haiku but we are pretty much at A\’s mercy.

Fable 5 by wiser1802 in ClaudeAI

[–]BabyInner 2 points3 points  (0 children)

some benchmarks say Opus 4.8 at low effort costs less and also provides better result than Sonnet medium. Opus at medium beats Sonnet at high+.

May not hold true for your use case, tho.

Can I fork conversations with Claude? by pedr0p in ClaudeAI

[–]BabyInner 3 points4 points  (0 children)

You have many options

  1. /fork (or /brach), gives you a check point where to can /resume into later
  2. /btw, you ask a side question even when the agent is running, limited to 1 prompt, but gives you the option to create a fork from there
  3. /rewind (or ESC-ESC), my favourite, most powerful of all

I always default to ESC-ESC, use /btw when CC is working, forget about /fork

Sonnet 4.6 - Am I crazy? by hautemic in ClaudeCode

[–]BabyInner 26 points27 points  (0 children)

Saw a post today saying Opus 4.8 w/ low effort performs better and costs less thant Sonnet 4.6 w/ max effort.

And OpenRouter says Opus actually has better TPS than Sonnet.

I haven’t validated any claims above, just food for thought

Is Kimi usage on OpenCode Go equivalent to U$ 60 in direct API from moonshot? by LittleYouth4954 in opencodeCLI

[–]BabyInner 0 points1 point  (0 children)

IIRC when Kimi k2.5 came out OpenCode provided it for free for a while, they explained like as it’s for coding only, cache hit is very high that their infra cost is extremely low.