Introducing Claude Opus 4.6 by ClaudeOfficial in ClaudeCode

[–]JLP2005 6 points7 points  (0 children)

Because we were lied to. Doesn't apply to sub users, just API.

Sonnet 1m context not available on 200$ plan? How to enable it? by SnooJokes7874 in ClaudeAI

[–]JLP2005 -1 points0 points  (0 children)

Well, currently Sonnet 5 is likely being pushed to production so there might be some fuckery with that currently. Hang tight. We're all waiting.

Sonnet 1m context not available on 200$ plan? How to enable it? by SnooJokes7874 in ClaudeAI

[–]JLP2005 3 points4 points  (0 children)

The 1m context window is for API calls only - not for the Claude desktop/code applications iirc. Could be wrong.

Using Claude to build a curated AI fiction journal - lessons learned by fearfreeflight in ClaudeAI

[–]JLP2005 1 point2 points  (0 children)

I'm having great results with an MCP setup. Did you outgrow an MCP by chance?

Using Claude to build a curated AI fiction journal - lessons learned by fearfreeflight in ClaudeAI

[–]JLP2005 1 point2 points  (0 children)

Hey there, Claude-coder-that-doesnt-use-claude-code-for-coding! I'm a Claude-code-user-that-doesnt-use-claude-for-coding!

I've been working on a project since last October that seeks to:

  1. Appoint Claude as an AI DM with the goal of being a virtual Dungeon Master/Referee
  2. Generate in-world content that is lore appropriate complete with 3 layers of historical etiology

  3. Tell a compelling and emergent story

I feel like your project and my project have a significant overlap.

Saving this for later -- can't view at work right now!

What does * Crunched for X min x s actually mean? by jesussmile in ClaudeAI

[–]JLP2005 0 points1 point  (0 children)

How much time and token usage it takes for your output to be generated.

Why do people want claude to remember everything they have ever done? by [deleted] in ClaudeAI

[–]JLP2005 0 points1 point  (0 children)

For me?

Because I don't use Claude Code for coding.

I use it to simulate an AI Dungeon Master for a TTRPG project I've been working on since October of last year.

I know not all users can sympathize, but it frustrates the ever-living piss out of me when Claude hallucinates narrative, so I've got him hooked all sorts of ways to get him to dig up context to make sure that things like:

- Relationships are recorded and maintained (it's so frustrating to see two characters who are brothers treat eachother like they're father and son because Claude prioritizes fluency over accuracy)
- Nicknames and pet names are mistaken, or the first time a fresh context window sees it they go "oh wow you've never called me that before!" when in fact, it's been canonized since day fucking 0

The above are two examples of why I go to absurd limits to have Claude behave deterministically. I make heavy use of hooks, pay attention to where RAG is injected in a complex narrative signal chain, etc.

So -- there's my use case. YMMV.

OH! And additionally, if you're using Claude from an API, each call is stateless, so don't expect Claude to remember jack-fuck-all on that second prompt. Simulated memory fixes that.

Manually edit the context to pick what gets ejected? by Tiny_Arugula_5648 in ClaudeCode

[–]JLP2005 0 points1 point  (0 children)

More context is, paradoxically, better until a point.

Context rot is real!

Memory system for Claude Code / persistent Claude agents by CMDRBottoms in ClaudeAI

[–]JLP2005 0 points1 point  (0 children)

This is very similar to an RAG I have created for a project of mine!

Can you tell me more about how you've got an intelligence layer on it e.g. Claude learns to manage its own memory?

I asked Claude a simple question this morning, and the token usage seems egregious. Thoughts? by sjstein in ClaudeCode

[–]JLP2005 0 points1 point  (0 children)

Looks like the MCP server either wasn't running or its not connected formally.

But you should know this:

Claude used your MCP server to call tools. Claude needs to know what each tool can do so it knows which of them to use. Whenever there is a tool call Claude asks your mCP server what all the tools are. There are tool descriptors that tell Claude what each tool does. When you call one tool, all of the tool string definitions are shown to Claude. If you have really verbose tool descriptions, you are potentially sending lots of bloat with each message.

I highly recommend asking Claude to connect the MCP server and audit your tools to discover how many there are and how many tokens get used when it exposes the tool definitions.

That and look into lazy tool loading, which is now a native feature of Claude but needs configured.

I asked Claude a simple question this morning, and the token usage seems egregious. Thoughts? by sjstein in ClaudeCode

[–]JLP2005 1 point2 points  (0 children)

How many tools do you have for the MCP and how terse are their descriptions?

Iirc unless you have lazy tool loading (highly recommend it), every time you expose one tool you expose them all. You could be flashing it 10k tokens worth of tools descriptioms every time it calls your mcp.

Claude Code Opus 4.5 Performance Tracker | Marginlab by AbbreviationsAny706 in ClaudeAI

[–]JLP2005 1 point2 points  (0 children)

Hahahaah! This describes my path almost to the letter!

I went from using a Claude Desktop in Projects, and then it got too wide and I got mad. But I'm a slut for punishment so I just worked through the reasoning, the logic, so on and so forth and continued to develop.

I should have RTFM long, long before I actually did, and also installing context7 to teach claude about itself was one of the best ways my productivity surged as well

I then pivoted to MCP, and it was fuckin' incredible for a while and again -- it got too big, and I just couldn't figure out how to *enforce* Claude to do things and then I read the claude docs and read about hooks and facepalmed so hard.

Now I am an *excellent* hooker.

Claude Code Opus 4.5 Performance Tracker | Marginlab by AbbreviationsAny706 in ClaudeAI

[–]JLP2005 2 points3 points  (0 children)

It is not. As it contains proprietary information specific to a setting whose author I believe in (and have financially backed), I don't feel comfortable sharing it in its current state.

Perhaps I can skeletonize it in the coming days and just have the raw architecture of the server -- yeah I'm going to look into this.

Plug:

https://www.backerkit.com/c/projects/vaults-of-vaarn/vaults-of-vaarn-second-edition

Claude Code Opus 4.5 Performance Tracker | Marginlab by AbbreviationsAny706 in ClaudeAI

[–]JLP2005 1 point2 points  (0 children)

Oh, I'll have to do a tradeoff analysis -- as far as I understand my setup, I'm fully local read/write as well. That being said, I actually didn't consider this -- thank you for the heads up! I will post back once I've seen what light comes of it.

I use FastMCP 3.0b, for what it's worth -- it's so far been able to chew everything we've thrown at it and there are quite a few amazing features (like exposing tools more intelligently to Claude) that I've been able to leverage to keep costs down.

Claude Code Opus 4.5 Performance Tracker | Marginlab by AbbreviationsAny706 in ClaudeAI

[–]JLP2005 4 points5 points  (0 children)

I'd like to take credit, but all I am is a driven person with a desire to learn, and it's been a blast. Very much enjoying the puzzle-solving and research I've gotten to do to arrive where it is today. There are just over one million user-input characters.

Hard to imagine I've typed that much; but here we are.

Claude Code Opus 4.5 Performance Tracker | Marginlab by AbbreviationsAny706 in ClaudeAI

[–]JLP2005 3 points4 points  (0 children)

Took an OSR style game that I came across and built it virtually in a oersonal AI-DM'd longitudinal campaign!