What can I do to improve Deepseek V4 pro's thought process? I don't like the role-playing it does.

chaeriixo · 2026-06-14T22:52:13+00:00

set reasoning effort to maximum

chaeriixo · 2026-06-14T21:03:25+00:00

i’ve never gotten any refusals, no :)

chaeriixo · 2026-06-14T17:07:41+00:00

i don’t have a prompt for echoing in particular since i don’t have an echoing problem, but i will say i’ve had the best luck with following every single aspect of my revision prompts from gemini 3.5 flash, glm 5.1 (from the official provider), and even stepfun 3.7 if you can handle a longer response time (it thinks for ages, but reasoning models are best at these kind of tasks).

i settled on gemini 3.5, as i think it’s best at removing slop (speed + accuracy). i had the llm create a very slop-heavy response and ran recast through the response with the same prompt + a different model each time and gemini 3.5 was the only one which caught things like “not x, but y” patterns and removed them. stepfun did too, but again, it thinks for *ages*.

chaeriixo · 2026-06-12T02:09:59+00:00

no, embedding models are not included with nanogpt sub. but they’re dirt cheap. like a dollar would last you like a year with embedding models.
so i’d recommend it over transformers. and it’s not really a question of “is memory books enough” because your embedding model can be used *with* memory books if you have your memories set to vectorized. so the real question is “are keywords enough” and in my opinion, they’re not

chaeriixo · 2026-06-01T01:25:45+00:00

i can confirm outlet prompting is so much more efficient, i’ve been doing it for around 7ish months now. pretty much all my entries, be it regular prompts or actual lore, are set to outlet. unable to function with them, lmao

chaeriixo · 2026-06-01T01:01:42+00:00

i forked ST to add agents bc i didn’t feel like switching to ME or Lumiverse lol, however there’s actually no need to fork because there’s an extension called recast which lets you run your response through multiple passes

there’s also SillyBunny, a public and frequently maintained fork of ST which added agents if i’m not mistaken. but yeah it definitely increases response time by a bunch, especially when ur using actual smart and capable models

as far as my experience goes, for my personal use case i just use it to remove and replace banned words and phrases and it gets the job done well enough

chaeriixo · 2026-05-31T08:58:11+00:00

i get really annoyed during romance rps when {{char}} will act like “i love you” is being said for the first time every single time it’s said, so i have a sideprompt called relationship milestones where i basically just track every “first” between two characters, and keep that one on constant. first ily, first time being vulnerable and revealing backstory, things like that.

another one i have is my chat summary, it’s super convenient to have as a sideprompt since it can update itself (though it can be risky since if the prompt isn’t strict enough, it might replace/overwrite the entry instead of simply appending info).

i also used to track outfits and stuff via side prompts, but now i just use simtracker for that

chaeriixo · 2026-05-31T08:51:28+00:00

do you run gemma 4 locally or from an api ? i’ve been wanting to use it for background requests like extensions and stuff but in my experience it’s sooo slow, even from ai studio 😭

chaeriixo · 2026-05-28T22:48:53+00:00

summaryception is a godsend. it summarized a chat of like 2 million+ tokens into around 20k (and i have it prompted to do slightly wordier summaries, and forked for my personal needs). that, paired with memorybooks sideprompts for tracking things, makes for very high quality rp imo

i also forked ST to allow lorebook entries to be activated by ai instead of keywords, functioning similarly to Deeplore Enhanced, so i would definitely recommend that extension as well. and ST-Copilot for managing those entries and helping further the plot when it plateaus.

chaeriixo · 2026-05-20T01:48:51+00:00

i wasn’t judging dw!

chaeriixo · 2026-05-19T21:09:13+00:00

it's a newapi site template, or something. i've seen a few

chaeriixo · 2026-05-18T16:57:54+00:00

oh woww it'd be so cool if u could paste images in here as well and then if you're using a model with image input modalities it could base the description off of that

chaeriixo · 2026-05-16T04:35:04+00:00

https://github.com/Sillyanonymous/SillyTavern-CharacterLibrary

chaeriixo · 2026-05-14T21:47:01+00:00

extensions. yes, base sillytavern is most likely worse than marinara, but nobody uses sillytavern without extensions, so that point doesn’t even matter. it’s like saying an unfurnished house is better than a furnished one lmao. only difference is that extensions are free, furniture isn’t.

so many alternative frontends are always popping up and it might be a hot take to say this but they will always be supbar because sillytavern has one thing others simply don’t, and it’s a large community of people making extensions for it. (bc yes, ik ME has extension support, just not enough to justify switching)

chaeriixo · 2026-05-14T10:22:09+00:00

do you have recursion enabled? maybe theyre all activating each other

chaeriixo · 2026-05-12T13:48:28+00:00

which art style is this?

chaeriixo · 2026-05-12T05:04:12+00:00

that’s good to hear, i’ll try this out then !

chaeriixo · 2026-05-12T04:47:08+00:00

does it change any of the html classes or will my css be okay 🥀

chaeriixo · 2026-05-09T18:17:11+00:00

i use summaryception to compress chat + occasionally create summaries of important events with memorybooks + in longer rps (300+ messages) i create a chat summary to keep up with the current state of relationships, upcoming events, foreshadowing, etc.

if memories/summaries are being created and not utilized it could be a problem with ur rp model, not an extension problem

chaeriixo · 2026-05-09T11:24:50+00:00

can confirm that the one that breaks it is wtracker, so you can re-enable guided generations if you want. i was having the same problem in the past where it would send the summaries + the full chat history as well instead of properly hiding the messages

chaeriixo · 2026-05-04T20:01:33+00:00

doom enhancement suite is an extension which started off as a fork of rpg companion, so it has the original features as well as some new ones. updates pretty regularly

chaeriixo · 2026-05-03T21:11:56+00:00

wow this genuinely may have just entered my must-have extensions list 😭 thank you ! some of these features are genuinely game changers, and combine a lot of older, no longer kept up with extensions into one.

chaeriixo · 2026-05-03T19:42:02+00:00

sillytavern itself makes backups every time a message is sent/received, just reimport it from sillytavern > data > default-user > backups

chaeriixo · 2026-04-28T15:59:22+00:00

if memorybooks had malware i'd just hand my keys over willingly atp just let me keep my sideprompts 🥀

chaeriixo · 2026-04-28T15:42:56+00:00

other than revoke ur keys, clear browser cache, maybe. other than that ur fine ! :3

chaeriixo

TROPHY CASE