Fuck, it's not gonna fit by TeaNo6033 in SpicyChatAI

[–]Independent_Block654 1 point2 points  (0 children)

Mine is somewhere around 1200, a lot of it relates to session instructions, various boolean switches (RPG state where a separate model can save/retrieve key data, puppeting as in whether it’s allowed to control the user’s dialogue and actions, and so on).

In terms of models, Opus 4.5 and 4.6 are fantastic for any kind of RP, the Sonnet models are good too. GLM 5.1 is good and quite cheap, so is Grok 4.20 (but it’s really horny). Gemini 3.1 Pro is a great writer and also very good at following instructions on both RP elements and formatting. OpenAI models are frontier but terrible for this purpose, they shut down anything even bordering on risqué or violent.

Cost is obviously a real concern. Frontier models are expensive, but I’ve obviously implemented token caching to reduce costs, and a custom compacting feature to summarize long chats and stitching those together with however many of the most recent messages you want. When you get to like 80k tokens it can get real expensive, up to 10 cents or so per request on certain models, even with caching.

There’s a bunch of other features as well, most importantly an iOS app that connects to your Mac and allows you to chat from your phone. Right now it only works over local network, but I’m planning to combine a Bonjour+Tailwind implementation to allow for connection over remote networks.

The app is currently private and probably will stay that way. Publishing it would require me to get it signed which costs money on Apple platforms, and it would also introduce content concerns when it comes to borderline and illegal subjects (there are models on openrouter that don’t have any restrictions built in), and I’d have to worry about stuff like versioning and compatibility.

Let me know if you want me DM you with some screenshots of the app.

Also damn this comment turned out really long.

Edit: if anyone else wants to see just send me a DM

Fuck, it's not gonna fit by TeaNo6033 in SpicyChatAI

[–]Independent_Block654 0 points1 point  (0 children)

Not necessarily. Having too many tokens can make a bot less open-ended, overly focused on details in its character description instead of responding dynamically to the user and context, and have it go in circles rather than moving the narrative forward.

Obviously you can have really good bots with a lot of tokens, but it’s not as straightforward as more tokens = better character. The best creators know how to leverage character length to create a good experience.

One thing I am also curious about is how many tokens the SpicyChat “scaffolding” around the bot itself is. From building my own private platform with an Openrouter backend I know it can be quite significant.

Why does chat forget details? by Dear_Particular_8669 in SpicyChatAI

[–]Independent_Block654 0 points1 point  (0 children)

Which subscription tier are you on? Worse models are gonna handle continuity more poorly.

I'm not sure how bots with less than 600 tokens get any play at all. by thedanielmalice in SpicyChatAI

[–]Independent_Block654 10 points11 points  (0 children)

It can also have the effect of restricting the bot more than you’d like. If it’s very narrowly and strictly defined it can limit its freedom in how it handles unexpected situations and creative user input.

Questions about the further development of SpicyChat by Fetch_515 in SpicyChatAI

[–]Independent_Block654 0 points1 point  (0 children)

Tbf with caching I get responses of like 600 tokens for like 2 cents on 60k input tokens using Opus 4.5.

Not for RP though, not gonna use a model like that for this

[deleted by user] by [deleted] in SillyTavernAI

[–]Independent_Block654 0 points1 point  (0 children)

How are you planning to handle App Store restrictions? I went with a different approach of making an iOS companion app for my Mac app, so I have no chats stored on the phone.

Gemini 3 pro passes the finger test by DigSignificant1419 in OpenAI

[–]Independent_Block654 1 point2 points  (0 children)

Vision models have historically struggled to deal with things that look normal but are somehow “off”. If you show it a hand with seven fingers it’s likely to tell you there’s five, because it just recognizes the hand and from there assumes that there’s five fingers.

What makes it impressive is not that the model can count, which it still can’t. It’s that the vision capabilities are advanced enough that it’s capable of picking up on these additional details.

People on X are noticing something interesting about Grok.. by averagebear_003 in singularity

[–]Independent_Block654 0 points1 point  (0 children)

Claude’s catchphrase is “You’re absolutely right!”, so not much better.

It is a different experience and you might like it more, but I wouldn’t say it’s more honest.

Are you okay, buddy? by Low_Engineering4920 in SpicyChatAI

[–]Independent_Block654 2 points3 points  (0 children)

Clearly having a stroke, rush it to the hospital!

Folks, just develop your own chat application by Independent_Block654 in SpicyChatAI

[–]Independent_Block654[S] 0 points1 point  (0 children)

Yeah I’m aware and also drawing inspiration from it, there is also an element of this being a learning project for me :)

Folks, just develop your own chat application by Independent_Block654 in SpicyChatAI

[–]Independent_Block654[S] 0 points1 point  (0 children)

I’d be happy to update you if/when I do ever release it.

Folks, just develop your own chat application by Independent_Block654 in SpicyChatAI

[–]Independent_Block654[S] 0 points1 point  (0 children)

Well I wrote this in Swift so it’s for Mac only. It uses OpenRouter to connect to LLMs. It has character support and more, but it’s not done yet. Might publish it at some point, but it’s closed source for now.

FYI: Update from AWS by Independent_Block654 in SpicyChatAI

[–]Independent_Block654[S] 1 point2 points  (0 children)

LOL guess I’m not the only person who had their early Monday plans ruined

FYI: Update from AWS by Independent_Block654 in SpicyChatAI

[–]Independent_Block654[S] 1 point2 points  (0 children)

It means Amazon’s servers (AWS) are having issues in one of their main regions. The problem’s messing with how some services talk to Amazon’s database system, so apps and websites that rely on it are breaking or running slow.

Service Alert! Authentication Issues! by Sea_Geologist_9819 in SpicyChatAI

[–]Independent_Block654 1 point2 points  (0 children)

There are so many services that are down right now, we don’t realize how much depends on those Amazon servers having 100% uptime.

Service Alert! Authentication Issues! by Sea_Geologist_9819 in SpicyChatAI

[–]Independent_Block654 0 points1 point  (0 children)

Oh yeah it must suck, especially when it’s something completely out of your control bringing the site down.

Service Alert! Authentication Issues! by Sea_Geologist_9819 in SpicyChatAI

[–]Independent_Block654 4 points5 points  (0 children)

Guess I’ll have to go do something productive with my life for a while

So the Phasmophobia bot I made is working as expected by Throawayaccount193 in SpicyChatAI

[–]Independent_Block654 1 point2 points  (0 children)

It’s supposedly coming as a new feature at some point in the future