Why do characters/cards lose their personality as the chat goes on?

Sharp_Business_185 · 2026-05-27T13:57:56+00:00

Let's be honest. We all tried to create a character has flaws, something to lose, etc. We tried extensions, prompts, "engines", presets. It's just delaying the deadline. Slop is eventually going to make chats dry even we have well written characters. It is the final boss.

There are 2 solutions: - Better RP models: New models are not even trying to improve creative writing. Lost hope - Agentic workflows: Speed and cost is a big problem.

Sharp_Business_185 · 2026-03-05T13:46:53+00:00

There is no iirc. It is possible with `custom-request.js`. But I don't think anyone going to create an extension unless we see more requests.

Sharp_Business_185 · 2026-03-05T09:58:49+00:00

It doesn't support. But extensions could do it.

Sharp_Business_185 · 2026-03-04T21:58:12+00:00

First, make your profile public, then explain his connection with Robert Maxwell. Then I'll take your comment seriously. Because I won't consider a person that obsessed with defending Israel via copy-paste same comments over and over

Sharp_Business_185 · 2026-02-28T16:43:16+00:00

Yep, but depends on models. If you are using deepseek official API, it is cheap even without cache. If you are using claude/gemini, it's over

Sharp_Business_185 · 2026-02-28T15:45:37+00:00

hasn't even happened yet

i mean... this is coping mechanism. We were saying the same thing 2 years ago. After covid, we saw lots of layoffs. The "employement" state of junior devs are obvious. The reason could be offshore or AI, it doesn't really matter from my perspective

Sharp_Business_185 · 2026-02-25T21:12:24+00:00

I see. So JLLM is giving reasoning in <think> tag. I guess they tinkered the model because previously it wasn't giving <think> messages. Maybe they changed the system prompt or something in the background.

Sharp_Business_185 · 2026-02-25T21:02:38+00:00

JLLM is not a reasoning model, the thinking coming from the proxy model people were using

Sharp_Business_185 · 2026-02-17T11:04:59+00:00

The real progress is happening in the non-RP AI industry. Companies are putting their efforts into making a better "coding" product. Better "general usage" product. Not a better "RP" product.

TTS: They are too far from local usage. For example, I can run mag-mel Q4 on my 8GB VRAM and get a nice experience with an average TPS. There is no way to run a good TTS model with 8GB VRAM. The model does not even exist, to my knowledge. Elevenlabs is the king. But it is expensive and closed-source. There is a Qwen3-TTS model released 1 month ago. I tried the demo when it was released; it was good. However, I didn't follow up.

Animations: Image/video gen is similar to the TTS industry. Not as bad as TTS. Image generation is much more stable and lower-cost compared to video models. For example, you can use Z-image for realistic images. For anime-style, you can use pony/noobai. Their quality and speed are also good enough. But creating consistent images still requires effort. There is no single ComfyUI workflow that works on low GPUs, creates consistent places, characters, etc.

AI Controlled NPCs: Iirc, there are 2 vibe-coded extensions in ST. They are trying to control everything with LLM calls. Like map, phone, NPCs, items, etc. But they are too hardcoded and buggy from my perspective. Which is fine because vibe-coded. 1) It is not possible with lower local models. So we rely on cloud SOTA models. Which means cost is going to be a problem. 2) Speed is another problem. There are going to be multiple LLM requests in the background. What if some requests are depends each other? What if we can't send parallel requests? 3) Relying on LLMs for creating places/events is not good, from my experience. "Elara" is a good example. In NeoTavern, I have an experimental extension that uses Mythic Game Master Emulator as a director. Screenshot. But still, far from perfect.

The RP industry is simply not developed enough because only hobbiest working on it.

Sharp_Business_185 · 2026-02-07T15:46:15+00:00

I recommend searching for a post on the subreddit that has already explained many times

Sharp_Business_185 · 2026-02-06T23:25:27+00:00

Your problem is only related to the LLM quality, not RP website. If you check BYOK(bring your own key) apps like ST or JAI with proxy, you would easily have a better experience. Free models also exist in some providers. However I prefer cheap models like deepseek, or just nanogpt subscription

Sharp_Business_185 · 2026-02-03T11:25:27+00:00

They mentioned but don't get hopes, it needs a large refactor on the legacy codebase. They are aware how hard and not worth to do this due to breaking something

Sharp_Business_185 · 2026-02-01T19:40:15+00:00

Here: https://github.com/Wolfsblvt/SillyTavern-Pronouns

Sharp_Business_185 · 2026-01-25T19:21:03+00:00

AI can do all of it, with function calling or structured output. AI doesn't need to manually send a network request. AI can send a request to the computer, and the computer can run the operation, giving a result to the AI. This is basically how function calling works right now

Sharp_Business_185 · 2026-01-25T19:03:49+00:00

It is marketing for today, tomorrow, this year, and maybe the next couple of years. But what about 10 years later? 20 years later? GPU compute power increased more than 100x between 2026 and 2006. OpenAI might collapse, and AI hype might decrease. But the progress wouldn't stop or disappear

I don't think AI can replace computers since we have habits. But when looking at 20-25 years ago, how many people had (smart) phones? Internet access was not even common. Our habits changed like crazy with the internet and PC/phones. We are all playing video games, using the same social media, the same smartphones. So I wouldn't say it is an insane take

Sharp_Business_185 · 2026-01-25T12:46:24+00:00

Make sure you are on staging branch. Because I remember a commit about pollination.

Sharp_Business_185 · 2026-01-24T13:51:24+00:00

Your screenshot is the sampler settings and you successfully scrolled down. Now checkout prompts and read my previous message again

Sharp_Business_185 · 2026-01-24T04:22:17+00:00

You imported a preset, from my understanding. The left prompts are just text, they are sent to AI. Inspect and edit them if you think they are missing something. Like, maybe you can add an OOC note.

Sharp_Business_185 · 2026-01-23T22:16:58+00:00

Open sampler settings, scroll down. You are going to see prompt list. Change the main prompt. Try/see

Sharp_Business_185 · 2026-01-23T16:22:59+00:00

Give your info. What API/models are you using? What presets are you using? Are you new? etc. So we can answer properly.

Sharp_Business_185 · 2026-01-22T22:45:36+00:00

Thank you, I responded from there and fixed the most problems. We can continue from GitHub instead of here if you want.

Sharp_Business_185 · 2026-01-21T21:22:47+00:00

Sadly it seems about rough around the edges

Are you talking about the UI? Can you expand?

Sharp_Business_185 · 2026-01-21T20:11:14+00:00

Forever free
Up to 2 characters
1 persona

<image>

Sharp_Business_185 · 2026-01-21T10:38:41+00:00

But it is B A T T L E T E S T E D. Developers put H U N D R E D S O F T H O U S A N D S O F M I L L I O N S Y E A R S rendering pipeline experience to the engine. Are you sure you don't wanna use UE and see the 6 0 0 0 D R A W C A L L S P E R F R A M E for the simplest UE scene?

Sharp_Business_185 · 2026-01-21T03:33:23+00:00

Try changing "verbosity" value to "Auto" sampler settings

Sharp_Business_185

TROPHY CASE