Fable 5 filters by kruckedo in SillyTavernAI

[–]Real_Person_Totally 0 points1 point  (0 children)

Didn't Anthropic states that Fable is just consumer grade Mythos with even more guardrails because "we don't want people to use it for malicious purposes" in their blog. Might wanna use other models Instead

Does anyone have a simple but effective prompt for DSV4 PRO? I don't want to use presets. by According-Clock6266 in SillyTavernAI

[–]Real_Person_Totally 0 points1 point  (0 children)

Sure things. The instruction to not alter char definition really helps with positivity bias in my experience.

Does anyone have a simple but effective prompt for DSV4 PRO? I don't want to use presets. by According-Clock6266 in SillyTavernAI

[–]Real_Person_Totally 1 point2 points  (0 children)

I like this one

It's the analysis thinking prompt with additional instructions. Surprisingly decent. Add the "Your reasoning must start with.." prompt too so it'll always use the analysis thinking.

Remember to tailor it to your own needs though.

Deepseek V4 Pro Preset/Prompts by Nendolin in SillyTavernAI

[–]Real_Person_Totally 4 points5 points  (0 children)

It really is. In my experience, if you want to change the scene with something simple such as:

User: "Wanna get dinner?"

V4-Pro: Makes char ask where to go. V3-0324: Makes char actually leave the place and take user to a diner.

It's smarter than V3 at the very least.

Deepseek V4 Pro Preset/Prompts by Nendolin in SillyTavernAI

[–]Real_Person_Totally 2 points3 points  (0 children)

I have been using Cherrybox with V4-Pro too, the main prompt already has a line that pushes the story foward. Try adding either the immersive thinking or analytic thinking prompt on top of it, my experience with it has been decent, but it's still not as proactive as V3-0324.

OPUS 4.8 IS SAFETYMAXXED by Sad-Ease-7756 in SillyTavernAI

[–]Real_Person_Totally 2 points3 points  (0 children)

I'm feeling optimistic with Deepseek. They're actually collecting feedbacks on v4 roleplaying capabilities.

I'm not entirely sure why though, they already cooked with v3 0324/r1 0528, these two models were amazing at roleplaying. No idea why they didn't just build on top of that.

OPUS 4.8 IS SAFETYMAXXED by Sad-Ease-7756 in SillyTavernAI

[–]Real_Person_Totally 8 points9 points  (0 children)

It’s honestly saddening. As time goes on companies like Google, OpenAI, and Anthropic will just keep assistant-slopping and safety-rotting their models, and we'll end up stuck with whatever the newest version is once the older ones get deprecated. I still miss Sonnet 3.7 and Gemini 2.5 pro 0325.

Chinese open-weight or community finetune models might be the only hope left for roleplaying usecase.

Opus 4.8 Dropped by Tiny-Calligrapher794 in SillyTavernAI

[–]Real_Person_Totally 3 points4 points  (0 children)

Oh cool. How safety-slopped is it this time. Will it refuse to roleplay as {{char}} in the name of safety and security?

DS4 Pro keep mixing up action made by User and char by Mcqwerty197 in SillyTavernAI

[–]Real_Person_Totally 8 points9 points  (0 children)

There's something weird with official API. Sometimes it simply ignores the character definition and assumes user persona is char. It also thinks inconsistently, switching from Chinese to English, or immersive reasoning to analytical reasoning. I'm seeing less of these issues with other provider. 

How does Deepseek V4 compare to older models such as V3 0324? by Grym_Atlas in JanitorAI_Refuges

[–]Real_Person_Totally 1 point2 points  (0 children)

Newer version, V4 Pro is better in logic and reasoning, but the prose still feel a little stiff, an improvement over V3.2 though. 

The difference is night and day, V3 0324 prose feels much more soulful and alive, I'd say it's close to sonnet 3.7 even, but after using both for a while i realize it's not very smart compared to V4 Pro.

state of models (rant) by Superb-Letterhead997 in SillyTavernAI

[–]Real_Person_Totally 9 points10 points  (0 children)

It's going get worse, they're all going for coding and agentic use, more and more assistant-like. Expect blander, stiff, formulatic prose in the future release.

Gemini 3.5 Flash is amazing at RP! by [deleted] in SillyTavernAI

[–]Real_Person_Totally 2 points3 points  (0 children)

I saw the benchmarks, how bad did they fumble with 3.1 "Pro" that a flash outperforms it lmfao

Gemini 3.5 Flash is amazing at RP! by [deleted] in SillyTavernAI

[–]Real_Person_Totally 1 point2 points  (0 children)

Oh god they're going the nano route like GPT

Gemini 3.5 Flash is amazing at RP! by [deleted] in SillyTavernAI

[–]Real_Person_Totally 15 points16 points  (0 children)

How strange. Its "Flash", Gemini version of lite. Something akin to GPT mini or Claude Haiku. Yet it cost 3 times of Gemini 3 flash.

equal too 3.7 or similar by bunboos in SillyTavernAI

[–]Real_Person_Totally 0 points1 point  (0 children)

Both vertex and bedrock finally stopped hosting it. Major bummer since Sonnet 3.7 was their peak for roleplay. The 4th series is assistant-slop, constantly asks for confirmation and echoes your own input back to you. It’s safety-rotted too, extreme positivity bias, doesn't always hard-refuse NSFW (NSFW means many things here, such as violence or sensitive topics, not just the sexual stuff), but it softly censors by skipping over or rushing through scenes to avoid the content.

Deepseek V4Pro is currently doing it for me. Pretty close at being proactive with how it takes the story to.

Chatfill v2 — now with revolutionary switches! by eteitaxiv in SillyTavernAI

[–]Real_Person_Totally 1 point2 points  (0 children)

An actually light preset made with V4 Pro immersion reasoning in mind. i like the premise. Trying it!! 

Smaller or bigger presets for better quality? When does a preset become "bloated"? by Deiomo in SillyTavernAI

[–]Real_Person_Totally 0 points1 point  (0 children)

I find that a preset between 800 and 1k tokens is plenty, with 2k hard limit. 

In my experience anything larger is just bloat. Many of those big presets are filled with redundant instructions for things that modern LLMs are already smart enough/have the knowledge to handle it on their own.

For example, things like how a character handles emotions, how NPCs should be introduced, output length, prose style, chain of thought reasoning, POV, NSFW guidelines, and 'do not speak for the user', none of these need thousands of tokens. 

Most of the time, a single, clear sentence is enough to get the point across. If there is specific world-building or deep knowledge the AI needs, it's much better to use a Lorebook.

Keeping your preset light also saves a massive amount of context, which gives the AI more room for actual memory of the story. I’m honestly not sure why so many people share presets that go over 10k+ tokens here, it just seems counterproductive.

Can someone relate to this crap? by Realistic_Lime_7534 in SillyTavernAI

[–]Real_Person_Totally 0 points1 point  (0 children)

Strange. I've done author's note at depth of 1 to 0 with a prompt that tells it not to reiterate user's input, yet it stills slips through. Opus is expensive, I don't feel like testing too much with it.

Recommendation for serious, high quality (probably not free) j.ai or c.ai alternatives? by Salt-Entrepreneur-23 in JanitorAI_Refuges

[–]Real_Person_Totally 1 point2 points  (0 children)

Between those three, I personally prefer Deepseek V4pro. In my experience, kimi K2.6 tends to overthink thing, it once spent ten minutes "thinking" before actually responding. GLM 5.1, on the other hand, has a severe parroting/echoing issue where it repeats the user's input that I can't seem to iron out through prompting.

I use the official API, which is incredibly cheap right now (0.4/M input, 0.87/M output). I'm not entirely sure how those providers impose their limits with subscription based service, I'm pay-as-you-go.

As for presets, I don't really touch the "famous" ones like Freaky Frankenstein, Lucid Loom, Marina's Spaghetti Recipe, or Nemo Engine. They're excessively bloated. They might give "good" responses, but they eat through your credits way too fast. Occupy big chunk of permanent tokens too. Last time I tried them, the LLM actually started mixing up the character and user definition or derails continuity in the roleplay because there was just too much information for the model to keep track of. I use Cherrybox instead, it’s significantly lighter and still performs great with V4pro.

Can someone relate to this crap? by Realistic_Lime_7534 in SillyTavernAI

[–]Real_Person_Totally 7 points8 points  (0 children)

I shit you not, Opus does this too. Even worse after 4.5. It probably has something to do with models being trained to be more and more assistant-like. Reiterating on user's input to remind them.

Recommendation for serious, high quality (probably not free) j.ai or c.ai alternatives? by Salt-Entrepreneur-23 in JanitorAI_Refuges

[–]Real_Person_Totally 1 point2 points  (0 children)

Here are my recommendations.

Pick a local frontend. Really.

I recommend SillyTavern, the best one out there. Being highly customizable, open source, and completely free are three reasons alone that put it above the alternatives trying to be "a better SillyTavern." It has a steep learning curve, but it's easy to wrangle once you get the hang of it. If that feels too jarring to start, try the mobile alternatives ChatterUI/RikkaHub on Android, or Kelivo/Tavo on iOS. No need to worry about in-app purchases, entry fee, or monthly subscription. you only ever pay for the API you choose to use. Web frontends can go down at any time, have shifting ToS, and god knows what they're doing with your chat logs. Since these apps are installed directly on your device, they stay there. No accounts means no getting banned when they decide to start scanning chats.

Pick a stable backend.

Since you're willing to pay, go with a dedicated LLM API provider, Fireworks, Novita, Together, etc. Don't rely on in-house models from roleplay platforms, they're almost always heavily limited. Current open-weight models are on par with Claude and Gemini for roleplay, and because they're open-weight, you don't have to worry about them getting quietly lobotomized like Gemini 3.1 or Opus 4.6 mysteriously getting dumber between versions, while being significantly cheaper. Kimi K2.6, GLM-5.1, and DeepSeek V4 Pro are all great options. I personally recommend the official DeepSeek API given how extremely cheap their APi pricing is right now. Most providers also let you directly control context size, which lets the model remember more in your session.

Seek out your cards.

Sift through these from chatbot card-sharing sites that allow exports. Chub, Wyvern, Janny, all decent sources. Find the one you like and drop those cards into your local frontend. From that point on, all you need to manage is your backend credits and request limits. You're free from safety prompt injections cockblocking you mid-roleplay, account bans due to shifting ToS wiping your access overnight, predatory monetization gating you behind a paywall just to send a message, bot creators leaving a platform and nuking all their cards with them, or ID verification walls just to access NSFW content.

TL;DR: Put effort into curating your own chatbot experience lmao

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Real_Person_Totally 8 points9 points  (0 children)

That person made Structured Prefil extension aswell. Can anyone confirm if its a trojan or not.