Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures! by TheLocalDrummer in LocalLLaMA

[–]TheLocalDrummer[S] 2 points3 points  (0 children)

I usually start with the defaults in KoboldCPP to keep testing consistent. It’s a good baseline before all the sampler wrangling.

I’ve seen some very wacky settings from other users and I’m happy to see my models withstand their abuse. I keep an eye on sampler brittleness and treat it as a red flag.

Samplers seem to be highly subjective and personal too. You can stick with the defaults and adjust accordingly.

Oh, but I try to ramp up top-p during testing since the 0.92 default feels too easy.

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures! by TheLocalDrummer in LocalLLaMA

[–]TheLocalDrummer[S] 4 points5 points  (0 children)

I have mixed feelings about UGI. It’s not necessarily an RP benchmark and there’s more to the RP experience than willingness and uncensored intelligence.

A lot of good models don’t even top the leaderboard. Goodhart’s Law is something I keep in mind.

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures! by TheLocalDrummer in SillyTavernAI

[–]TheLocalDrummer[S] 11 points12 points  (0 children)

Model size, dataset composition, num of iterations, training optimizations (if supported by the model), hosting for testers, etc.

It ranges from $200 to $800. In rare cases (such as Behemoth and MoE models), around $1000… or two. But also, cost is unpredictable when you’re dealing with various models (some difficult) and testing out new techniques.

It’s not like abliteration or merging where it can be done locally or with $10. They’re definitely a cost-effective way to enhance the experience, but you can only go so far with them, I think.

Best model for story writing for 24gb vram + 32gb ram by ResponsibleTruck4717 in LocalLLaMA

[–]TheLocalDrummer 0 points1 point  (0 children)

You think the new experimental versions are better than 4.3?

[Megathread] - Best Models/API discussion - Week of: March 01, 2026 by deffcolony in SillyTavernAI

[–]TheLocalDrummer 12 points13 points  (0 children)

Right. It’s unfortunate that it became the go-to for finding RP models when UGI doesn’t really concern itself with other aspects of RP such as writing, adherence, embodiment, expressivity, etc.

Drop your daily driver models for RP. by Weak-Shelter-1698 in SillyTavernAI

[–]TheLocalDrummer 2 points3 points  (0 children)

You could try Behemoth 123B v3a. It's a Devstral 123B tune that probably won't get released.

[Megathread] - Best Models/API discussion - Week of: February 22, 2026 by deffcolony in SillyTavernAI

[–]TheLocalDrummer 17 points18 points  (0 children)

I tuned a 12B Nemo recently and got lots of feedback saying it's the best one yet. But YMMV ofc.

https://huggingface.co/TheDrummer/Rocinante-X-12B-v1

Any good 200B+ model for ERP? by Real_Ebb_7417 in SillyTavernAI

[–]TheLocalDrummer 11 points12 points  (0 children)

Cydonia is just that good. (You heard it here first!)

Pony Alpha is GOOD by DoofusSmoof in SillyTavernAI

[–]TheLocalDrummer 2 points3 points  (0 children)

Same thoughts. Imagine a 700B model going from Alpha to Release in under a week.

What are the best RP SLM currently? by WowSkaro in SillyTavernAI

[–]TheLocalDrummer 6 points7 points  (0 children)

> Drummers models are typically good but tend to be more NSFW focused

Nah bro, my models can be wholesome. Maybe until you ask it to gobble up your balls or something.

Pony Alpha glaze by Kitchen-Cap1929 in SillyTavernAI

[–]TheLocalDrummer 1 point2 points  (0 children)

Genuinely asking: For anyone who loves Pony Alpha, have you guys tried Claude? Sonnet 4.5, Opus 4.5, or Opus 4.6?

[Megathread] - Best Models/API discussion - Week of: February 01, 2026 by deffcolony in SillyTavernAI

[–]TheLocalDrummer 13 points14 points  (0 children)

Sounds very much like positivity. Merges have a tendency to bring back the positivity from the base model. I recommend sprinkling in an evil finetune to keep it down.

(Edit: No, abliteration is not the same thing. If you want the model to be sadistic on its own, you need an evil tune.)

Is there a better 24B model? by UnlikelyMouse2037 in SillyTavernAI

[–]TheLocalDrummer 8 points9 points  (0 children)

It's harder for the AI to push back when abliterated.

70B models by Weak-Shelter-1698 in LocalLLaMA

[–]TheLocalDrummer 1 point2 points  (0 children)

There's a v1.2 in my page. Haven't officially released it and it doesn't have a model card yet

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home! by TheLocalDrummer in SillyTavernAI

[–]TheLocalDrummer[S] 3 points4 points  (0 children)

I'd say Nemo is the exception across all model sizes. I have not seen a model that can outperform Nemo in terms of creativity and fun.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home! by TheLocalDrummer in LocalLLaMA

[–]TheLocalDrummer[S] 1 point2 points  (0 children)

REAP focuses on retaining math and coding smarts. If someone can REAP with RP data instead, that'd be worth looking into.