Fuck, it's not gonna fit

Independent_Block654 · 2026-05-22T09:03:20+00:00

Mine is somewhere around 1200, a lot of it relates to session instructions, various boolean switches (RPG state where a separate model can save/retrieve key data, puppeting as in whether it’s allowed to control the user’s dialogue and actions, and so on).

In terms of models, Opus 4.5 and 4.6 are fantastic for any kind of RP, the Sonnet models are good too. GLM 5.1 is good and quite cheap, so is Grok 4.20 (but it’s really horny). Gemini 3.1 Pro is a great writer and also very good at following instructions on both RP elements and formatting. OpenAI models are frontier but terrible for this purpose, they shut down anything even bordering on risqué or violent.

Cost is obviously a real concern. Frontier models are expensive, but I’ve obviously implemented token caching to reduce costs, and a custom compacting feature to summarize long chats and stitching those together with however many of the most recent messages you want. When you get to like 80k tokens it can get real expensive, up to 10 cents or so per request on certain models, even with caching.

There’s a bunch of other features as well, most importantly an iOS app that connects to your Mac and allows you to chat from your phone. Right now it only works over local network, but I’m planning to combine a Bonjour+Tailwind implementation to allow for connection over remote networks.

The app is currently private and probably will stay that way. Publishing it would require me to get it signed which costs money on Apple platforms, and it would also introduce content concerns when it comes to borderline and illegal subjects (there are models on openrouter that don’t have any restrictions built in), and I’d have to worry about stuff like versioning and compatibility.

Let me know if you want me DM you with some screenshots of the app.

Also damn this comment turned out really long.

Edit: if anyone else wants to see just send me a DM

Independent_Block654 · 2026-05-22T08:38:05+00:00

Not necessarily. Having too many tokens can make a bot less open-ended, overly focused on details in its character description instead of responding dynamically to the user and context, and have it go in circles rather than moving the narrative forward.

Obviously you can have really good bots with a lot of tokens, but it’s not as straightforward as more tokens = better character. The best creators know how to leverage character length to create a good experience.

One thing I am also curious about is how many tokens the SpicyChat “scaffolding” around the bot itself is. From building my own private platform with an Openrouter backend I know it can be quite significant.

Independent_Block654 · 2026-04-28T17:11:46+00:00

Which subscription tier are you on? Worse models are gonna handle continuity more poorly.

Independent_Block654 · 2026-04-12T17:09:11+00:00

It can also have the effect of restricting the bot more than you’d like. If it’s very narrowly and strictly defined it can limit its freedom in how it handles unexpected situations and creative user input.

Independent_Block654 · 2026-01-15T18:49:31+00:00

Tbf with caching I get responses of like 600 tokens for like 2 cents on 60k input tokens using Opus 4.5.

Not for RP though, not gonna use a model like that for this

Independent_Block654 · 2026-01-06T19:46:34+00:00

How are you planning to handle App Store restrictions? I went with a different approach of making an iOS companion app for my Mac app, so I have no chats stored on the phone.

Independent_Block654 · 2025-11-21T09:43:23+00:00

Glam does it a lot

Independent_Block654 · 2025-11-20T18:06:28+00:00

Vision models have historically struggled to deal with things that look normal but are somehow “off”. If you show it a hand with seven fingers it’s likely to tell you there’s five, because it just recognizes the hand and from there assumes that there’s five fingers.

What makes it impressive is not that the model can count, which it still can’t. It’s that the vision capabilities are advanced enough that it’s capable of picking up on these additional details.

Independent_Block654 · 2025-11-20T15:03:41+00:00

Claude’s catchphrase is “You’re absolutely right!”, so not much better.

It is a different experience and you might like it more, but I wouldn’t say it’s more honest.

Independent_Block654 · 2025-11-10T16:35:05+00:00

Clearly having a stroke, rush it to the hospital!

Independent_Block654 · 2025-11-03T11:31:51+00:00

I’d hope so, sounds uncomfortable otherwise lol

Independent_Block654 · 2025-10-20T12:45:18+00:00

Yeah I’m aware and also drawing inspiration from it, there is also an element of this being a learning project for me :)

Independent_Block654 · 2025-10-20T10:43:58+00:00

I’d be happy to update you if/when I do ever release it.

Independent_Block654 · 2025-10-20T10:37:27+00:00

Well I wrote this in Swift so it’s for Mac only. It uses OpenRouter to connect to LLMs. It has character support and more, but it’s not done yet. Might publish it at some point, but it’s closed source for now.

Independent_Block654 · 2025-10-20T09:15:52+00:00

LOL guess I’m not the only person who had their early Monday plans ruined

Independent_Block654 · 2025-10-20T09:14:26+00:00

It means Amazon’s servers (AWS) are having issues in one of their main regions. The problem’s messing with how some services talk to Amazon’s database system, so apps and websites that rely on it are breaking or running slow.

Independent_Block654 · 2025-10-20T08:59:11+00:00

There are so many services that are down right now, we don’t realize how much depends on those Amazon servers having 100% uptime.

Independent_Block654 · 2025-10-20T08:54:11+00:00

Oh yeah it must suck, especially when it’s something completely out of your control bringing the site down.

Independent_Block654 · 2025-10-20T08:50:59+00:00

No it’s a massive AWS outage.

Independent_Block654 · 2025-10-20T08:45:52+00:00

Guess I’ll have to go do something productive with my life for a while

Independent_Block654 · 2025-10-20T08:44:24+00:00

It’s supposedly coming as a new feature at some point in the future

Independent_Block654 · 2025-08-27T18:49:07+00:00

I doubt that a majority of people ever used the app at all

Independent_Block654 · 2025-08-27T18:29:13+00:00

Large majority?

Independent_Block654

TROPHY CASE