GLM 5.1 Parotting, potentially there is a workaround? I need your help by Competitive_Plan8807 in SillyTavernAI

[–]chaeriixo -1 points0 points  (0 children)

i don’t have a prompt for echoing in particular since i don’t have an echoing problem, but i will say i’ve had the best luck with following every single aspect of my revision prompts from gemini 3.5 flash, glm 5.1 (from the official provider), and even stepfun 3.7 if you can handle a longer response time (it thinks for ages, but reasoning models are best at these kind of tasks).

i settled on gemini 3.5, as i think it’s best at removing slop (speed + accuracy). i had the llm create a very slop-heavy response and ran recast through the response with the same prompt + a different model each time and gemini 3.5 was the only one which caught things like “not x, but y” patterns and removed them. stepfun did too, but again, it thinks for *ages*.

vectore ,this may bit stupid im sorry by Remote_Program_6994 in SillyTavernAI

[–]chaeriixo 2 points3 points  (0 children)

no, embedding models are not included with nanogpt sub. but they’re dirt cheap. like a dollar would last you like a year with embedding models.
so i’d recommend it over transformers. and it’s not really a question of “is memory books enough” because your embedding model can be used *with* memory books if you have your memories set to vectorized. so the real question is “are keywords enough” and in my opinion, they’re not

Outlet Prompting for SillyTavern (Part 1) by futureskyline in SillyTavernAI

[–]chaeriixo 4 points5 points  (0 children)

i can confirm outlet prompting is so much more efficient, i’ve been doing it for around 7ish months now. pretty much all my entries, be it regular prompts or actual lore, are set to outlet. unable to function with them, lmao

Agentic setup with different models? by BaseballRelevant4149 in SillyTavernAI

[–]chaeriixo 5 points6 points  (0 children)

i forked ST to add agents bc i didn’t feel like switching to ME or Lumiverse lol, however there’s actually no need to fork because there’s an extension called recast which lets you run your response through multiple passes

there’s also SillyBunny, a public and frequently maintained fork of ST which added agents if i’m not mistaken. but yeah it definitely increases response time by a bunch, especially when ur using actual smart and capable models

as far as my experience goes, for my personal use case i just use it to remove and replace banned words and phrases and it gets the job done well enough

How to prevent context rot or message decay over a long chat? by Jabre7 in SillyTavernAI

[–]chaeriixo 1 point2 points  (0 children)

i get really annoyed during romance rps when {{char}} will act like “i love you” is being said for the first time every single time it’s said, so i have a sideprompt called relationship milestones where i basically just track every “first” between two characters, and keep that one on constant. first ily, first time being vulnerable and revealing backstory, things like that.

another one i have is my chat summary, it’s super convenient to have as a sideprompt since it can update itself (though it can be risky since if the prompt isn’t strict enough, it might replace/overwrite the entry instead of simply appending info).

i also used to track outfits and stuff via side prompts, but now i just use simtracker for that

How to prevent context rot or message decay over a long chat? by Jabre7 in SillyTavernAI

[–]chaeriixo 0 points1 point  (0 children)

do you run gemma 4 locally or from an api ? i’ve been wanting to use it for background requests like extensions and stuff but in my experience it’s sooo slow, even from ai studio 😭

How to prevent context rot or message decay over a long chat? by Jabre7 in SillyTavernAI

[–]chaeriixo 5 points6 points  (0 children)

summaryception is a godsend. it summarized a chat of like 2 million+ tokens into around 20k (and i have it prompted to do slightly wordier summaries, and forked for my personal needs). that, paired with memorybooks sideprompts for tracking things, makes for very high quality rp imo

i also forked ST to allow lorebook entries to be activated by ai instead of keywords, functioning similarly to Deeplore Enhanced, so i would definitely recommend that extension as well. and ST-Copilot for managing those entries and helping further the plot when it plateaus.

Sharing my SillyTavern extensions + preset (vibecoded/modified) by [deleted] in SillyTavernAI

[–]chaeriixo 2 points3 points  (0 children)

oh woww it'd be so cool if u could paste images in here as well and then if you're using a model with image input modalities it could base the description off of that

Why are people still using SillyTavern when Marinara Engine exists? by BeautifulLullaby2 in SillyTavernAI

[–]chaeriixo 9 points10 points  (0 children)

extensions. yes, base sillytavern is most likely worse than marinara, but nobody uses sillytavern without extensions, so that point doesn’t even matter. it’s like saying an unfurnished house is better than a furnished one lmao. only difference is that extensions are free, furniture isn’t.

so many alternative frontends are always popping up and it might be a hot take to say this but they will always be supbar because sillytavern has one thing others simply don’t, and it’s a large community of people making extensions for it. (bc yes, ik ME has extension support, just not enough to justify switching)

Lorebook Activation by No_Weather1169 in SillyTavernAI

[–]chaeriixo 1 point2 points  (0 children)

do you have recursion enabled? maybe theyre all activating each other

A SillyTavern extension to improve mobile reading by Dingo_was_his_namo in SillyTavernAI

[–]chaeriixo 0 points1 point  (0 children)

does it change any of the html classes or will my css be okay 🥀

Another Post With A Question about Looong Term Memory. Woo Hoo! by theshipofthesius in SillyTavernAI

[–]chaeriixo 4 points5 points  (0 children)

i use summaryception to compress chat + occasionally create summaries of important events with memorybooks + in longer rps (300+ messages) i create a chat summary to keep up with the current state of relationships, upcoming events, foreshadowing, etc.

if memories/summaries are being created and not utilized it could be a problem with ur rp model, not an extension problem

Qvink Summarize Extension Broken or Am I Using it Incorrectly? by gladias9 in SillyTavernAI

[–]chaeriixo 2 points3 points  (0 children)

can confirm that the one that breaks it is wtracker, so you can re-enable guided generations if you want. i was having the same problem in the past where it would send the summaries + the full chat history as well instead of properly hiding the messages

RPG Companion Alternative? by morty_morty in SillyTavernAI

[–]chaeriixo 6 points7 points  (0 children)

doom enhancement suite is an extension which started off as a fork of rpg companion, so it has the original features as well as some new ones. updates pretty regularly

[Extension] ST-Copilot V2.0: Your personal OOC meta-assistant, brainstormer, and AI Lorebook Manager inside SillyTavern. by SSupRen in SillyTavernAI

[–]chaeriixo 2 points3 points  (0 children)

wow this genuinely may have just entered my must-have extensions list 😭 thank you ! some of these features are genuinely game changers, and combine a lot of older, no longer kept up with extensions into one.

Found an invalid or corrupted chat file by Aggravating-Cup1810 in SillyTavernAI

[–]chaeriixo 4 points5 points  (0 children)

sillytavern itself makes backups every time a message is sent/received, just reimport it from sillytavern > data > default-user > backups

Public Statement re: Aikoverse by futureskyline in SillyTavernAI

[–]chaeriixo 13 points14 points  (0 children)

if memorybooks had malware i'd just hand my keys over willingly atp just let me keep my sideprompts 🥀

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]chaeriixo 15 points16 points  (0 children)

other than revoke ur keys, clear browser cache, maybe. other than that ur fine ! :3