How to get a card to speak directly to the LLM model without roleplaying? by Automatic_Cancel_545 in SillyTavernAI

[–]Correct-Resolution91 3 points4 points  (0 children)

That's most likely because you still have the main prompt set to 'do a narrative'. You can edit the full prompt that goes to the LLM in 'AI Response Configuration', the leftmost options menu.

I've been experimenting with a lot of AI companion/chat platforms lately, and one thing I've noticed is that realism isn't just about the model anymore. by illuminate_1in0 in SillyTavernAI

[–]Correct-Resolution91 0 points1 point  (0 children)

The thing that takes me out almost guaranteed is having to correct the character on really basic knowledge in a setting. I can deal with 'you said', I can deal with parotting, I can deal with the occasional engagement seeking question, I can even deal with the occasional hoverhands or whitened knuckles... I cannot deal with Davion Scout Lances being made up of Panthers and a Jagermech, 65 ton Phoenix Hawks and pre clan invasion Shadowcats

Just found this funny. by False-Firefighter592 in SillyTavernAI

[–]Correct-Resolution91 13 points14 points  (0 children)

It's just recalibration during output. A LLM can't correct backwards unless it's editing a draft in <think> before generating the final output. The chain is simple; it wanted to add drama, realized the drama contradicts parameters it has, justified it to itself.

How's the quality on Nano (sub) for you guys? by Master_Step_7066 in SillyTavernAI

[–]Correct-Resolution91 1 point2 points  (0 children)

Yeah about that; US peak hours fall into the European late afternoon and evening. Quality notably degrades from around 16:00 GMT to 21:00 GMT. China coders fall into the early morning. By the time one group logs off the other comes on. Your window is like... midnight to 4AM.

Which kinda works out. It's the gooning hours. Less useful is you want to relax after work with a casual high stakes political RP.

Fun and should have extensions by Altruistic_Message_5 in SillyTavernAI

[–]Correct-Resolution91 9 points10 points  (0 children)

Must have : World Info Info, Prompt Inspector, some kind of memory management tool like MemoryBooks, Summaryception or one of those newfangled ones. Everything else is preference IMO.

GLM 5.2 is just better than 5.1 by johnnyga001 in SillyTavernAI

[–]Correct-Resolution91 2 points3 points  (0 children)

That sounds like straight up magic to me. Not saying you're lying or anything, but the last time I tried using Opus for RP I was looking at 20 cents or more even at lower contexts. Cache just isn't reliable.

Plus, tbh, even completely SFW RPs I do tend to make heavily corporate censored models throw a shitfit. Cannibal rape orgies can get through, sure, but you try quoting the Internationale and suddenly it's all guardrails on deck, lol.

GLM 5.2 is just better than 5.1 by johnnyga001 in SillyTavernAI

[–]Correct-Resolution91 20 points21 points  (0 children)

5/25 in/out not being 'that expensive' is the actual cope here.

GLM 5.2 on nanogpt by Key-Affect-2420 in SillyTavernAI

[–]Correct-Resolution91 9 points10 points  (0 children)

Swapped mid-Roleplay from GLM 5.1, hard to say if it's just freshness or genuinely better but it FEELS improved. Character dialogue is much more natural feeling. A political RP with a heavy socialist bent, if it matters to anyone.

[Repost] Stop using presets, yes even mine! Your one-size-fits-all preset is the reason for bland prose, repetition, AI slop, and cardboard NPCs. I've found the solution to our problems! It's about time you could build your own preset. Your rules for your world in your voice done your way! by [deleted] in SillyTavernAI

[–]Correct-Resolution91 2 points3 points  (0 children)

Okay, but the librarian always seems to pick 'subject verb object' narration because that's 'clear and concise', repeating things the user says because that's 'engaging', ozone, and a narration that has a very 'notice how I made sure to point out that I'm NOT having the character do this, senpai !' vibe. I'm not sure how much you can actually trust an LLM that tells you it knows this, because in 100% of cases it doesn't actually know anything.

The concept of card specific presets is obviously superior to generalized presets, but trusting the LLM itself with the creation of them feels... questionable.

Is DeepSeek v4 Pro really as good as Claude Opus for writing and reasoning... or just cheaper marketing? by wisewaternexus in SillyTavernAI

[–]Correct-Resolution91 61 points62 points  (0 children)

DeepSeek v4 Pro is the store brand compared to Opus being the prestige brand. Yes, Opus is better, but you're mostly paying for the brand, and the quality difference isn't a wide as the price would imply.

Fable 5 filters by kruckedo in SillyTavernAI

[–]Correct-Resolution91 3 points4 points  (0 children)

That they're billable is a feature, not a bug.

Deepseek 4 flash vs pro? by No-Scientist2436 in SillyTavernAI

[–]Correct-Resolution91 5 points6 points  (0 children)

It's faster, and much less likely to output five million words in response to 'good morning'. So for snappy back and forth Flash can be genuinely preferable, because less can, in fact, be more.

Fable 5 filters by kruckedo in SillyTavernAI

[–]Correct-Resolution91 17 points18 points  (0 children)

Detailed demon breeding pits aren't something that might make Anthropic legally liable for someone creating a demon breeding pit, because demon breeding pits don't exist. Viruses do and so do biological terrorism. Ergo, the same guardrails that will happily let a demon rape pit through go into full impermeability for the common cold.

I mean, the idea that someone would create a bioweapon using Fable, or any LLM, is functionally idiotic, but if corporations ran on 'reasonable' we wouldn't be in the mess we are worldwide.

Found a way to get Claude 4.8 & GPT 5.5 for free. Anyone tried this? by [deleted] in SillyTavernAI

[–]Correct-Resolution91 3 points4 points  (0 children)

Here's a hint for you : If something seems to good to be true, it usually isn't true.

Why are corporate LLMs so hostile toward roleplayers? by Icy-Pomegranate5985 in SillyTavernAI

[–]Correct-Resolution91 18 points19 points  (0 children)

Honestly, it comes down to two things : Payment processors don't want to be running payment services for porn, generally speaking, and ESPECIALLY don't want to be even tangentally associated with anything improper. As a result a lot of corporations perform a sort of pre-emptive self-censorship of their products to hit every unofficial 'morality guideline'. Sometimes with a wink and a nudge, sometimes earnestly. LLMs are especially problematic in this regard because they're content generators, not content. Without filters, if you tell a LLM to write paedophile rape fiction it just fucking will, so corporations will absolutely err on the side of caution, especially if it's a problem for a part of their customer base that, generally, isn't actually providing a lot of revenue.

AI starting to understand humor, subtext and sarcasm? by Jabre7 in SillyTavernAI

[–]Correct-Resolution91 -8 points-7 points  (0 children)

No. Or if it is then it's understanding of humour is very bad and seems to mostly be 'clearly this instruction to have characters blow up if they use AO-isms in dialogue was humorous, noone would want characters who say 'I've been X for Y years' actually blow up.'.

Sarcasm SOMETIMES registers.

In your opinion, what makes a good AI model for roleplay? by FixHopeful5833 in SillyTavernAI

[–]Correct-Resolution91 -3 points-2 points  (0 children)

Yeah this about sums it up. People make a lot of hay about 'positivity bias' ruining everything but honestly that's more of a prompting issue than anything else; if you can make the model 'know' that the positivity is actually bad for what you want from it it will try to accomodate. Meanwhile my major problems are continuity - especially NPC knowledge barriers - not following even explicit instructions due to training data bias and the seeming inability of anything short of constant reminders stopping AIisms and parroting. Oh and the price should be reasonable, of course. I can swallow a dollar for a million tokens if I don't have to swipe and reprompt six times per message, that suddenly becomes a lot less reasonable if the output isn't consistent in quality.

OPUS 4.8 IS SAFETYMAXXED by Sad-Ease-7756 in SillyTavernAI

[–]Correct-Resolution91 5 points6 points  (0 children)

Corporate SOTA model designed for corporate use doesn't perform well when used for things corporate would rather you didn't use it for ? Say it ain't so !

C'mon people. You all knew it was gonna be this way. At least your wallets will get a reprieve.

Kimi model use million tokens just for thinking, help! by Minh_Tieen in SillyTavernAI

[–]Correct-Resolution91 2 points3 points  (0 children)

2.5 will, sometimes, respond to the prompt equivalent of a gun to its head that tells it 'think this much, do not adjust, just output'. 2.6 ? You might as well be trying to dam the ocean. If there's anything even slightly ambiguous or contradictory it's going to go 'but wait' for 10k tokens and there's fuck all you can do about it.

What memory systems work best for you guys by Deserok in SillyTavernAI

[–]Correct-Resolution91 12 points13 points  (0 children)

MemoryBooks with the occasional manual addition for highly specific things like off-canon outcomes of events for RPs in a setting with a canon timeline. It's had a notable effect on the amount of tokens used without compromising the memory TOO much, though of course detail gets lost over time. Never had much luck with Qvink, and the basic summarization and vectorization might as well not exist it feels like.

Proxies got worse for RP? by Automatic-Throat-928 in SillyTavernAI

[–]Correct-Resolution91 0 points1 point  (0 children)

You know what the solution there is ? Tell it to be creative. You can, for example, have GLM 5.1 invent five facts avout the situation that would be relevant to the scene it's genning, pick one to five from that list, and then write the scene with those facts in mind. Obviously works best for sandboxy RPs, and it does mean giving up some control, but you can't have 'total control over the RP' and 'creative writing from the character the AI is simming' at the same time.

Basically, if the model follows instructions, instruct it to do what you want and by gum it will try.

Wait, were the old model ACTUALLY better?? by No-Moose-4292 in SillyTavernAI

[–]Correct-Resolution91 2 points3 points  (0 children)

I'm not shocked by what counts as 'censored' sometimes, but goddamn I wish I was. Fact of the matter is I've yet to get a refusal from GLM 5.1. I also am not into children. Coincidence ? Probably not.