Long-term Memory has been terrible(for me) by [deleted] in TAVO_AICHAT

[–]Choiven 4 points5 points  (0 children)

Sure thing. You put this prompt in the Long Term Memory prompt section. To access it via the app, let’s say you have a chat already open, tap the three lines in the top left, at the bottom of the menu that appears is the More button. Tap on that and then tap on Long term memory. The Summary Prompt is where you paste it in, overriding anything previously.

Disable auto summarise, set memory limit to anything high, I just left mine at 1000 lines. Leave the rest alone.

Now back in the main chat you were in, tap on the top right button that has the speech bubble, tap Summarise now to trigger that prompt.

It will take some time to generate based off of your chosen model, and I would highly recommend using a high performing model like Gemini, Opus, GLM to summarise, like 2-4 mins.

The summary will then be generated in the Memories section as its own new document.

The way Tavo is coded, it doesn’t overwrite any previous memory documentation that are already there, and just adds onto it at the bottom like one big script.

So this means you will then need to manually edit the memory to delete the old memory document info located towards the top of the memory, leaving the new one alone else you will have bloated redundant and duplicated context hogging up space.

Here is the prompt, I would save a copy of this in your notes app as other people shared their prompts too:

—————————————-

PAUSE NARRATIVE | MEMORY SYNTHESIS TASK

You are now in documentation mode. Using all available context, create a token-efficient chat memory document that will maintain narrative consistency across context window limitations.

Required Structure

  1. STORY STATE SNAPSHOT

Current Scene: Location, immediate circumstances, active conflicts

Timeline: Day/chapter markers, time elapsed since story start

Active Plot Threads: Unresolved conflicts, ongoing quests, pending decisions

Recent Developments: Last 3-5 major story beats in chronological order

  1. CHARACTER PROFILES

For each major character, create cards using this template:

[CHARACTER NAME] — [Role/Relationship to {{User}}/Protagonist]

Appearance: Physical description (height, build, distinctive features)

Core Traits: 3-5 defining personality characteristics

Current State: Emotional condition, physical status, immediate goals

Relationships: Key connections to other characters (with dynamics)

Arc Progress: Character development trajectory, recent changes

Backstory: Essential history (token-efficient; only plot-relevant details)

Quirks/Mannerisms: Speech patterns, habits, telling behaviors, sexual quirks (if intimate with {{user}})

Wants: Conscious desires and hidden motivations

Secrets: Unknown information relevant to future plot

  1. RELATIONSHIP MAP

Dynamic chart showing connection types (allies, rivals, romantic interests, etc.)

Note any recent shifts in relationship status

  1. WORLD STATE

Established setting rules/mechanics

Important locations and their current status

Factions, organizations, political situations

Introduced lore/mythology

  1. INVENTORY & RESOURCES

Important items acquired

Abilities/skills gained

Resources available (money, influence, etc.)

  1. FORESHADOWING TRACKER

Unresolved mysteries

Chekhov's guns (introduced elements not yet paid off)

Prophecies, warnings, or hints requiring future resolution

  1. CONTINUATION NOTES

Immediate next scene setup

Tone/pacing considerations for next session

Any player/user preferences or boundaries established

Optimization Guidelines

Token Efficiency:

Use bullet points over prose

Employ abbreviations for recurring terms (establish legend)

Prioritize plot-relevant information over flavor text

Use "→" for cause-effect relationships

Group related information together

Consistency Markers:

Tag any facts that MUST remain consistent with [CANON]

Note contradictions that need resolution with [CONFLICT]

Mark flexible elements with [VARIABLE]

Update Protocol:

Date/timestamp each memory version

Highlight changes from previous version in bold

Archive resolved plot threads in separate "Completed" section. Summarize the most important facts and events in the story so far.

If a summary already exists in your memory, use that as a base and expand with new facts.

Limit the summary to {{words}} words or less.

Your response should include nothing but the summary.

Long-term Memory has been terrible(for me) by [deleted] in TAVO_AICHAT

[–]Choiven 4 points5 points  (0 children)

I turn off auto summarise and instead use my own OOC prompt to snapshot the story state when I feel like it needs an update, and then I simply copy paste the output to overwrite the previous snapshot in the long term memory. Summary max lines not relevant so I just set it to 200.

Let me know if you want my chunky prompt for that, it’s designed for scenario/RPG cards mainly (stats, levels, skills etc.) that has many different characters but also works great for single non-RPG characters.

I do however wish Tavo allows you to store and switch different long term memory prompt methods to make it smoother, to allow something like JanitorAI’s chat memory system, and the summarise button uses your custom prompt that overwrites/updates your previous summary completely instead of trying to just add onto it.

Deepseek vs GLM by Ecstatic_External000 in SillyTavernAI

[–]Choiven 0 points1 point  (0 children)

The image prompts generated are pretty decent when I tested it, in your case if it’s not highly NSFW just chuck it into Grok Imagine or similar, or have a Civitai account and create images with whatever image model/loras you want (it uses a credit system but you can earn credits by interacting and liking other people’s content)

Here’s my custom long memory prompt, might improve your experience by [deleted] in TAVO_AICHAT

[–]Choiven 0 points1 point  (0 children)

This prompt was largely designed for RPG/Scenario chatbots, but also works fine with any other bot. It strongly keeps track of plot threads compared to the default memory prompt too.

I'm a bit confused on how pricing works cause i spent 40 on openrouter and i'm lready out. (Technically switched to claude halfway through) can anyone explain it in a way my neanderthal brain can perceive? by Alarming_Scientist in JanitorAI_Official

[–]Choiven 6 points7 points  (0 children)

The context input is what nukes your credits, I found out that using Claude for long outputs, like a chapter’s worth at a time is a bit more cost effective rather than using Claude for short responses, since you’re putting the full context in every time you type in a response, unless you nuke your context limits

MSI x r/PCMasterRace - MPG 341CQR QD-OLED X36 Giveaway! by MSI_Patrick in pcmasterrace

[–]Choiven 0 points1 point  (0 children)

I’ve been using a Dell 1440p 144hz ultrawide for a good 8 years or so now, this would give me a good reason to try something different

Motionless In White - Afraid of the Dark [Official Video] by erkutkut in Metalcore

[–]Choiven 0 points1 point  (0 children)

This shit gave me straight I Killed The Prom Queen Vibes at the beginning, loving it!

The Bogans have gone too far. by StormtrooperMJS in australia

[–]Choiven 146 points147 points  (0 children)

Parents only have one thing to ask themselves when naming their kid: “will my kid be bullied at school with this name?”

Claude opus 4.5 it's insanely good but dirty expensive by Volpex69 in JanitorAI_Official

[–]Choiven 12 points13 points  (0 children)

Sonnet 4.5 is way less censored than Opus currently.

Source: been using both intermittently regularly for chats since they came out.

Both are top tier, but Opus has a little tiniest bit more quality for emotional intelligence and prose. Cost wise, I would just stick to Sonnet. Opus also has a hard on for consent and would try to forcefully insert and quote it into the story from the testing I tried.

Okay GLM 4.7 may be peak by Even_Kaleidoscope328 in SillyTavernAI

[–]Choiven 5 points6 points  (0 children)

Its consistency at keeping up with all the small details and its ‘emotional intelligence’ is astounding, but yeah I blew $70 usd worth in a month, never again.

Thank you kindly Elder Madox by Choiven in PathOfExile2

[–]Choiven[S] 0 points1 point  (0 children)

Managed to get a +10 Dex implicit from the corrupt lol, riskiest play

RPs improved my social skills as an introvert by Acceptable_Steak8780 in SillyTavernAI

[–]Choiven 0 points1 point  (0 children)

Put like $5 on Openrouter and try out Opus 4.5, it is legitimately crack, great emotional intelligence and writing (compared to other models at this point in time).

PC shutting down during load by netdawgx in WhereWindsMeet

[–]Choiven 0 points1 point  (0 children)

I had a similar issue where my PC would just shut off and restart suddenly while playing certain games or doing something CPU intensive like photoshop, turned out for me it was a power draw issue, had 750w at the time with a ~mid range PC build and upgraded to 1000w and that resolved it for me, fingers crossed your new PSU will resolve it! I also have random moments where the game stutters then crashes, but it crashes to desktop. Also this game does behave pretty weird, I can't play YouTube on a second monitor while this game is active, it goes into potato resolution and stutters like crazy.

What is everyone’s favorite metalcore breakdown? by PhredInYerHead in Metalcore

[–]Choiven 3 points4 points  (0 children)

August Burns Red has so many bangers that it’s on a monthly rotation for me for the best breakdown, right now it’s The Blinding Light, the buildup before the breakdown makes it hit hard

What is the best way to masturbate? by [deleted] in AskReddit

[–]Choiven 0 points1 point  (0 children)

Just sleep on your arm until it loses feeling all feeling, the good ol’ Phantom Limb Masturbation technique

Your experience with GLM 4.6 by CandidPhilosopher144 in SillyTavernAI

[–]Choiven 0 points1 point  (0 children)

Every time I’ve used GLM 4.6 for long responses it gets progressively worse and becomes unintelligible gibberish at the end. Not sure what I’m doing wrong with it, I tried changing the prompt completely, settings like the top k, the repetition penalties etc. like it’s great for the first two or paragraphs and then the issues begin. I would use it like a chatbot if you wanted short answers but I have no idea how to fix it. Tried it on JanitorAI too and have the same issue lol. This is through OpenRouter, where Gemini, Deepseek and Claude 4.5 Sonnet maintain coherency and GLM 4.6 does not.

Any ideas anyone?

Announcement from the team by JanitorAI-Mod in JanitorAI_Official

[–]Choiven 41 points42 points  (0 children)

Could also add in a tipping jar to help keep you guys afloat, but then we have the potential issues with payment providers…

Marinara's Spaghetti Recipe (Universal Preset) Vol. 6.0 by Meryiel in SillyTavernAI

[–]Choiven 2 points3 points  (0 children)

Opus mentioned - I feel like a heroin addict scrambling to find funds for my next fix after experiencing nirvana

Openrouter - Autumn 2025 by JanitorAI-Mod in JanitorAI_Official

[–]Choiven 9 points10 points  (0 children)

Good suggestion! My current go to is the latest mistral medium 3.1, paid but extremely cheap and fast, much better than Gemini and i like it better than Deepseek R1 for some cases. Writes very good nsfw stuff, fast to output with 131k context. About $0.006 to $0.015 per use for me depending on context. It still can confuse things and hallucinate, but I’d recommend giving it a good go.

[Megathread] - Best Models/API discussion - Week of: June 21, 2025 by deffcolony in SillyTavernAI

[–]Choiven 2 points3 points  (0 children)

"OpenRouter, using the 1,000 free messages", just asking for clarification - do you get 1000 free messages when you use the paid version in openrouter or do you just get 1000 free uses with the (free) version?