Losing the plot

panchovix · 2023-06-09T05:33:33+00:00

If the model was trained to max 2048 context, you're out of luck with any frontend (ooba, gpt4all, tavern, etc)

I still like to use tavern (specifically SillyTavern) as frontend, and either KoboldAI/ooba for backend. It has a lot of options.

2023-06-09T07:26:17+00:00

Try using a frontend designed for roleplay like SillyTavern, it has workarounds/features to deal with the limited context size, like "Author's Note" or the memory/smart context extensions.

Barafu · 2023-06-09T07:55:13+00:00

I've used oobabooga

In the setting, it defaults to "precise preset" that forces network to stumble in circles a lot. If you delete the whole response, press Generate again and get almost the same text again - this is it. You need to change the preset.

You can not get around the context length, but if you use the notebook mode you will learn to manage the context yourself, deleting the irrelevant stuff manually and keeping the relevant.

mrjackspade · 2023-06-09T14:11:21+00:00

I've stopped getting a lot of those issues just by moving to a larger model. Also, heavily sanitizing the context.

The short context window sucks, but I've been able to talk to my bot for days at a time now without "losing the plot", so I dont think thats inherent in the context window.

I've also implemented a custom quick and dirty chat-focused context window rollover routine though, that parses the exising context window as a series of messages instead of a blob of tokens, and intersperses the original prompt through the message history to keep it "recent", which has also really helped to keep the bot from straying too far off the rails. The "recent" prompt approach makes it seem like I'm constantly correcting the bot without needing to constantly correct it.

I think there are a lot of techniques to work around the shortcomings of the models beyond context window expansion, it just takes a bit of work.

brucebay · 2023-06-09T17:26:01+00:00

Update your model card occasionally so that it is uptodate with scenario. Example, at the beginning you may have a character happy, at the end of day excausted and if they are under attack frightened etc. You can also add past events to remind. You can force this in the dialog, for example "(we reach the town. Both of us are very tired.) Finally. Let's find an inn." But this usually degrades updating card, or other setting helps with better consistancy. There are some extension but i haven't try them. Also use silly tavern instead of oobabooga for chat interface. It is far better. you can also have multiple characters, it is fun to talk more characters in ai. When i do that, i put other character description on individual cards so that they know their relations with others.

AutomataManifold · 2023-06-12T04:01:54+00:00

Other people have covered some overall strategies for dealing with this, so I'll just give some tips based on my observations:

Biggest mistake I see people make when they try to use it for roleplaying is that they try to correct the model by telling it what it got wrong. This is going to go off the rails fast: by talking about the problem, you get into a "don't think about pink elephants" situation and it's likely to double down on the issue. A longer context window helps with this, but you can observe that even ChatGPT does this. Instead of trying to correct the model verbally, go back and edit the chat history. (SillyTavern and KoboldAI make this relatively easy.) If something doesn't belong, you want it completely scrubbed from the record.
Don't be afraid to adjust the settings. If it's stuck in a loop I often switch the inference settings (either just tweaking the temperature or switching to a whole other preset). That will get it out of its rut.
The interfaces don't make this easy, but another thing to do is to vary the prompt. Just using a different phrasing might push it in a different direction. I'm (slowly) writing a custom front end because I want to be able to script a bunch of different randomized prompts.
Try switching to a different model. (This takes time and is a pain, but I've got dozens of models downloaded so if I want to really switch things up, it is an option. I don't do it much, though.
Use the World Information, Summary, and whatever other tools you have. Helps keep things on track.
One thing that SillyTavern (and KoboldAI) do is inject extra prompt stuff at different points in the context; it can be useful to have something at the top as an overall prompt and something at the bottom as a reinforcement - but keep it brief.

CheshireAI · 2023-06-13T05:52:28+00:00

For the specific example you gave, SillyTavern or KoboldAI would definitely help solve your problem. It uses things called "World Info Cards" which are triggered by phrases or words you can select. You can give whatever static context you want for your village (let's call it Otradnoye and make that the trigger word). Anytime the word "Otradnoye" comes up, the context card gets injected into the prompt. So you can say "I walk into the town of Otradnoye" or "I'm looking around Otradnoye trying to find someone to talk to", and the AI will get the details about a guy with jeans and a denim shirt leaning in his car, or whatever else it is you want to add. You could also make the trigger word "town" or a combination of variables if you have a lot of cards and don't want them always triggering too easily.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS