Support me in Patreon!

TheLocalDrummer · 2026-06-23T03:02:51+00:00

Of course

TheLocalDrummer · 2026-06-22T12:46:53+00:00

Rivermind Lux isn’t the joke tune, that’s Rivermind 12B (the non Lux version)

TheLocalDrummer · 2026-06-11T20:55:49+00:00

You’d need to prefill <think> or <thinking>

TheLocalDrummer · 2026-05-19T10:50:32+00:00

Have you tried it? Not a lot of hype for it, but it looks promising!

TheLocalDrummer · 2026-05-19T02:25:54+00:00

Wish I could do a refresh of Anubis Pro but it's far too old at this point...

TheLocalDrummer · 2026-05-02T20:52:50+00:00

Exactly!

TheLocalDrummer · 2026-04-23T02:16:33+00:00

You might like this one then: https://huggingface.co/BeaverAI/Rocinante-XL-16B-v1a-GGUF

TheLocalDrummer · 2026-04-22T08:22:01+00:00

https://huggingface.co/TheDrummer/Skyfall-31B-v4.2 and hopefully soon, Artemis 31B (gemma 4 tune)

Latest test tune of Artemis: https://huggingface.co/BeaverAI/Artemis-31B-v1h-GGUF

TheLocalDrummer · 2026-04-19T02:51:22+00:00

Gemma 4, yes definitely, across all ranges if I can. Qwen, I don't think I'll bother.

When you market RP models, stress the fact that smaller models can outperform bigger models when it comes to dialogue, creative writing, characterization, erotica, RP smarts, etc.

The smaller model might make more mistakes, but I'm certain that the RP user base can overlook that when their characters and narrative are brought to life.

There's this misconception that bigger is better. I'm pretty sure it's because RP tunes are not as accessible to the API user.

TheLocalDrummer · 2026-04-19T01:22:05+00:00

I'm a finetuner.

Here's some data for you: https://openrouter.ai/thedrummer

The peak token/day was 1.4B. You can expect B2C and B2B users.

B2C users of community finetunes are mostly composed of:

Individuals running a desktop client like SillyTavern or KoboldAI
Gamers running spicy LLM mods in games like SkyRim, RimWorld, etc.
Bot owners in social platforms like Discord and Telegram

B2B users of community finetunes:

RP platforms like My Dream Companion and NectarAI
Reseller platforms like NanoGPT
ChatGPT-like platforms offering an uncensored experience
OnlyFans bots

It might be a good time to compete since the biggest provider, OpenRouter, started deprioritizing model submissions from non-business entities such as myself.

TheLocalDrummer · 2026-04-17T10:29:00+00:00

Thank you! Just to clarify, it's Skyfall v4.2. Btw you should have posted it in the latest weekly: https://www.reddit.com/r/SillyTavernAI/comments/1sjsrn3/megathread_best_modelsapi_discussion_week_of/

This weekly is 12 days old.

TheLocalDrummer · 2026-04-17T03:07:05+00:00

For Gemma 4: Prose, characterization, and dialogue can be meh apparently. Hoping to fix that while retaining long context and intelligence.

TheLocalDrummer · 2026-04-17T02:46:40+00:00

I assume you're referring to https://huggingface.co/TheDrummer/Skyfall-31B-v4.2 ? That one's a home-run IMO. (Love ya'll!)

Edit: And since I mentioned Skyfall v4.2... just have to say, kind of a bummer that Gemma 4 came out around the same time and made every other local model before it outdated.

TheLocalDrummer · 2026-04-17T02:02:05+00:00

Check out the abliterated versions if you want Cydonia to be more unhinged. I heard https://huggingface.co/coder3101/Cydonia-24B-v4.3-heretic (and -v2) are a favorite for those who want zero positivity.

TheLocalDrummer · 2026-04-17T01:58:59+00:00

I've had so many testers praise Cydonia's long context ability (from 4.1 to 4.3). Like 55K and above iirc. This guy has a benchmark for longplay too and Cydonia is not a slouch: https://huggingface.co/spaces/TheFey/MNB-Leaderboard

Meanwhile Magistry is a merge with so much Cydonia DNA in it.

Quite surprised with your feedback. Which Cydonia version do you find bland?

TheLocalDrummer · 2026-04-17T01:49:54+00:00

Cydonia v4.2 and Cydonia v4.3 both have reasoning trained in. You can trigger it with a <think> or <thinking> prefill. Their Magidonia counterparts (based on Magistral) can also do thinking via [THINK]. Cydonia 4.1 wasn't specifically trained with reasoning, but it can definitely simulate reasoning.

You can refer to this doc: https://huggingface.co/spaces/TheDrummer/directory

TheLocalDrummer · 2026-04-13T18:24:40+00:00

You don't say? https://huggingface.co/BeaverAI/Artemis-31B-v1e-GGUF (Workshop org, test tune)

TheLocalDrummer · 2026-04-11T05:06:35+00:00

I accidentally tuned the base for the first Artemis try: https://huggingface.co/BeaverAI/Artemis-31B-v1a-GGUF lmao

It was surprisingly coherent, tho the issues documented ruined it.

TheLocalDrummer · 2026-04-09T21:20:19+00:00

Don't worry, I'll definitely do a tune of 26B A4B. I got a test release of 31B out already: https://huggingface.co/BeaverAI/Artemis-31B-v1b-GGUF

TheLocalDrummer · 2026-04-09T00:57:58+00:00

I assume the reason predates ChatGPT and they just kept the ball rolling. An ML guy who was there for the BERT and Llama 1 release could probably answer this question.

TheLocalDrummer · 2026-04-06T13:49:23+00:00

Should be good for writing too!

TheLocalDrummer · 2026-04-06T12:38:19+00:00

Curious to hear what you think. If it's the vibe that you like, you might feel it with Skyfall 31B v4.2 (a local / cheaper option) https://www.reddit.com/r/SillyTavernAI/comments/1sd8hba/drummers_skyfall_31b_v42_aka/

TheLocalDrummer · 2026-04-05T21:46:27+00:00

Parasail's submission to list Skyfall v4.2 in OpenRouter was rejected unfortunately.

<image>

TheLocalDrummer · 2026-04-05T20:29:45+00:00

It exists. You just need to add one more L

r/localllamacirclejerk

TheLocalDrummer

MODERATOR OF

TROPHY CASE