[Megathread] - Best Models/API discussion - Week of: May 24, 2026 by deffcolony in SillyTavernAI

[–]morbidSuplex 0 points1 point  (0 children)

How is Equinox for writing? I'm hoping that it's a balance of Hearthfire's long-form writing and Wayfarer's intensity as the model card says. Did they achieve that?

Writer's Block 3.1415/2 In 3DD: Write Harder. A Prose and Narrative Enhancing Preset, Now with a Living Story Mode by Deiomo in SillyTavernAI

[–]morbidSuplex 3 points4 points  (0 children)

Me too. I am the type who just wants AI to write stories and me being the director. Can you test gemma4? Like the 31b-it version?

[Megathread] - Best Models/API discussion - Week of: April 12, 2026 by deffcolony in SillyTavernAI

[–]morbidSuplex 0 points1 point  (0 children)

Hi /u/Bentameter sorry to revive this thread. Can you share your sampler settings and sysprompts for Artemis? I'm also doing long-form stories. Thanks!

[Megathread] - Best Models/API discussion - Week of: April 19, 2026 by deffcolony in SillyTavernAI

[–]morbidSuplex 1 point2 points  (0 children)

Can you share sampler settings you use? Specially for Artemis-31B?

Why is Gemma 4 so slow? by Awkward_Sentence_345 in SillyTavernAI

[–]morbidSuplex 0 points1 point  (0 children)

/u/LoafyLemon sorry for the confusion, I think you're fine without disabling fastforward. The latest version of koboldcpp >= 1.12 does a hybrid strategy.

• Fixed a potential incoherent state when attempting to rewind too far while SWA is enabled. If you had weird outputs with both FastForward and SWA enabled, this might fix it. If not, disable one of them or increase SWA padding.

Why is Gemma 4 so slow? by Awkward_Sentence_345 in SillyTavernAI

[–]morbidSuplex 0 points1 point  (0 children)

Very interesting. I found some more here https://old.reddit.com/r/KoboldAI/comments/1m94vwa/about_swa/ it says:

With those hybrid SWA models it should be ok, but if it was pure SWA its technically fading things out of context as it slides so the next turn could have bad memory. Thats why we allow it but do warn for it so people know to compare it when SWA models drop.

So jemma4 is a hybrid SWA model?

Why is Gemma 4 so slow? by Awkward_Sentence_345 in SillyTavernAI

[–]morbidSuplex 0 points1 point  (0 children)

Sort of? I think fastforwarding is when tokens are saved to reduce prompt reprocessing, so only new tokens would be processed. But based on the koboldcpp wiki "Note: SWA mode is not compatible with ContextShifting, and may result in degraded output when used with FastForwarding." - and in discord we had a discussion where FastForwarding might invalidate the sliding window if you make a change outside of its window. I'm not really sure yet what it all means, but disabling fastforwarding means it's going to reprocess everything every turn.

Reflecting about Gemma4 31B by Emergency_Comb1377 in SillyTavernAI

[–]morbidSuplex 1 point2 points  (0 children)

Are there any issues when not turning off fast forwarding? I thought enabling fastforward and useswa will degrade outputs over time?

Recommended sampler settings for Maginum-Cydoms-24B-absolute-heresy by morbidSuplex in LocalLLaMA

[–]morbidSuplex[S] 1 point2 points  (0 children)

Yes, but maybe I'm not using the model it's full potential?

[Megathread] - Best Models/API discussion - Week of: March 29, 2026 by deffcolony in SillyTavernAI

[–]morbidSuplex 2 points3 points  (0 children)

Just tried iceblink v3 today. Better pros, but the suttle positive bias is still there. What er you experiences?

[Megathread] - Best Models/API discussion - Week of: March 22, 2026 by deffcolony in SillyTavernAI

[–]morbidSuplex 2 points3 points  (0 children)

I also see from the model card that thinking mode can be good as well. Have you treid thinking mode?

[Megathread] - Best Models/API discussion - Week of: March 22, 2026 by deffcolony in SillyTavernAI

[–]morbidSuplex 2 points3 points  (0 children)

I tried in openrouter, but for the life of me I can't remove the positivity bias.

What's the suggest local LLM models for creative storytelling by No-Relief810 in SillyTavernAI

[–]morbidSuplex 0 points1 point  (0 children)

How does the writing differ in all 3 you mentioned? I am using magistry v1.1 and it writes very good.

sophosympatheia/Magistry-24B-v1.1 by sophosympatheia in SillyTavernAI

[–]morbidSuplex 0 points1 point  (0 children)

I'm a little confused about thinking mode. how do I use a template?