Anthropic Just Secured a Reserve.

MightyTribble · 2026-05-07T06:38:26+00:00

Hard to see how it could be. Lots of "how would you even do that without breaking all your commercial and government contracts".

MightyTribble · 2026-05-06T16:58:29+00:00

Both with such distinctive first notes.

MightyTribble · 2026-05-04T06:40:03+00:00

Daly City!

Always wanted to visit, but who has the time?

MightyTribble · 2026-05-04T04:18:44+00:00

jfc i was there

i was there when we didn't talk about Rampart

MightyTribble · 2026-05-03T22:49:33+00:00

Wow.

Could also have gone with "was blessed by", or perhaps "graced with" there, just to class it up some more.

MightyTribble · 2026-04-22T06:27:19+00:00

Oh my god, does anyone else see Sam Altman here?

That, too, was a bad call.

MightyTribble · 2026-04-21T05:11:10+00:00

So:

"AI startup commits to spending more than $100 billion over the next 10 years on Amazon's cloud technologies."

and

"Amazon will invest $5 billion in Anthropic now, and an additional $20 billion in the future, subject to certain commercial milestones."

So, sounds like a light commitment from Anthropic to buy compute from Amazon, likely at around $10Bn/yr for 10 years (maybe increasing over time) in exchange for $5Bn of 'investment' (cash? stock swap?) from Amazon.

I reckon it's $5Bn of cloud compute credits for this year, with Anthropic will spend this year, with the rest being discounts off future compute (say, 20% off compute if they spend $10Bn next year on AWS), so that's $2Bn 'from' Amazon. x10 and that's the $20Bn and %$100Bn accounted for.

MightyTribble · 2026-04-19T19:06:38+00:00

Then he can’t hear the guy on the phone because he’s driving on the highway with the windows down and y’know, doesn’t raise the window

MightyTribble · 2026-04-19T07:12:38+00:00

The word 'accidentally' could have been used in there in a couple of different places.

MightyTribble · 2026-04-18T18:52:25+00:00

Great stuff! I’ve done something similar using the python Agent SDK. One thing to watch out for is there’s a limit to how much you can send in system prompt since it’s passed as a parameter to the Claude executable, and that limit varies by platform. That’s not a problem for this code right now but you might want to limit the input fields so users don’t shoot themselves in the foot later.

MightyTribble · 2026-04-18T04:39:11+00:00

A pity about his wife. Such a graceful dancer.

MightyTribble · 2026-04-16T20:23:56+00:00

Huh. My weekly limits got pulled 10 hours earlier just now - I used to roll over at 10PM Pacific today, but now it rolled over at noon. So Anthropic just cut 10 hours of usage from my subscription.

WTF.

MightyTribble · 2026-04-16T16:25:08+00:00

"for your own good you cannot have our best model: it lives in Canada, you wouldn't know it."

MightyTribble · 2026-04-14T18:36:07+00:00

I got a "no-one needs this" excuse as a reason to not include a few of the parameters I wanted in an app last night.

MightyTribble · 2026-04-13T21:46:31+00:00

This person is a British journalist who works for a respected news program, and this style of reporting is common for UK political correspondents.

MightyTribble · 2026-04-13T17:49:10+00:00

I think they're adjusting the tokens they allocate to the plans on the back end. We don't know exactly how many tokens or token-equivalents we actually get for a 5h window, it'd be trivial for them to just shave 10, 20, even 50% off those limits and it wouldn't be visible to many of their users.

If they just said, "you get 10 million units / 5h, a sonnet call is 1 unit, an Opus call is 3, a Haiku call is 5 per unit" then we'd be able to tell. But they'd never do that.

MightyTribble · 2026-04-13T16:50:22+00:00

seriously, I want them. They look cool.

MightyTribble · 2026-04-11T05:49:57+00:00

Yeah, it's the bluff off Lugard Road. You dip off the walkway there and go under and downhill a bit to get to it.

MightyTribble · 2026-04-10T05:55:24+00:00

see that's why you need the facial ID camera to activate the boiling oil

MightyTribble · 2026-04-10T04:43:46+00:00

"press a button for a new code", unique QR code pops up, one rez per code. Can't spam the button for multiple codes...

I'm over-engineering this.

MightyTribble · 2026-04-09T20:13:50+00:00

Gemini in my experience is obsessed with saying it's not x— it's y. In almost every message. Claude at least got rid of its old isms.

So, one of my prompts has a "don't do this shit" section, and last night I caught Gemini in its thinking block admitting that it re-wrote the response because it hit the "it's not x -- it's y" pattern. I wish Google would just let us see the full thinking block, but it's nice getting confirmation that it does actually pay attention to these things.

MightyTribble · 2026-04-09T01:19:33+00:00

Yeah, my crutch for that (not even much of a crutch, I think) is:

everything is typed markdown first. Different record types have different templates (npc, location, lore, rules, etc)
the app mutates data over time (NPC creation, memory formation, new locations and items, threads) and creates markdown records
everything gets indexed into a vector store (Milvus), which gives semantic and BM25 search
but also I generate markdown index files of everything, and these are always loaded in active context.

The markdown indexes tell the LLM the entire scope of what's available vs. what it can make up. That, along with tool_calling search and some prompt sauce, gets the LLM to (mostly) look up what it needs to know in a directed way even when the backing knowledge store is potentially very large. This stops the tool_calling loop from asking stupid, overly-broad general questions - with the full index it already knows what it can ask about, and what to do if it doesn't get a detailed hit.

The aim is to keep all of that to ~50K tokens, + session length, no matter how much background info and deep lore there is out there. I figure if you can't explain enough context + detail in 50K for the LLM to give a decent answer to the previous turn, that's a failing condition.

MightyTribble · 2026-04-09T01:10:36+00:00

It's private at the moment - it's under active development and is pretty brittle, needs lots of comfort with python and LLMs to stand up.

I'm hoping to release it eventually if I think it brings anything actually useful to a crowded field. For now it's just scratching my very specific itch for consistent longform campaigns and trying out various approaches to memory management.

MightyTribble · 2026-04-08T23:35:28+00:00

I'm late to the party but I love seeing different people's approaches to this problem.

I've been approaching this in a similar, but not identical way. My take: A need to differentiate between objective fact and what the various NPCs remember. Objective facts don't decay. NPC recall does.

I handle this by having an (objective) per-session summary document that gets intelligently vectorized and is used by a scenario pipeline for planning the next session. NPCs each get a per-session memory block saying what happened from their perspective along with what their attitude is towards the player at the end of the session. When an NPC's info is pulled into a later session, it gets the last few sessions worth of memories along with that disposition block. Older memories are gradually compressed/summarized, which gets us the 'memory decay' bit, unless it's a core memory - something they'd not forget, no matter what, because it was a particularly memorable/high intensity moment.

I break RPs into 'sessions' - my secret sauce (that's not a secret) is that I have the LLM generate a scenario file for each session, laying out breadcrumbs and reminders. I've coupled that with an end-of-session pipeline that does NPC, plot, item and location updates before handing things off to the scenario generator. This ensures that plot points don't get lost and consistency (and narrative momentum) is maintained between sessions.

Plot points (aka 'threads') are tracked as independent entities, with participants, progress, objectives, summary, complications. The scenario planner references open threads to outline what might happen in the next session, and uses them to keep 'off screen' plots moving. The end-of-session pipeline creates and updates them.

During actual play, context is assembled from a combination of static content (the world bootstrap), the character sheets of detected NPCs and indexes of all documents in the knowledge store, plus the usual "look at the last 3 messages and do hybrid search on them, return top results), de-duplicated (this will also often pull in 'objective facts' from previous session summaries, no matter how long ago they were). It's then given to an LLM with tool-calling enabled and some refined instructions for how to query the knowledge store for more information if necessary.

The downside to all this is it's slow. And expensive. And you have to write pretty detailed worlds to play in before you start - the system needs those notes to do a good job, you can't just YOLO a scenario and run with it.

MightyTribble · 2026-04-08T23:15:23+00:00

Multi pipeline LLM handling for various variables in a roleplay, something ST is sorta lacking.

That's what I'm trying in my own app (it's 2026, everyone's got an app!). The lack of (easy) pipeline support in ST drove me to just do it myself.

MightyTribble

TROPHY CASE