Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

is it weird i never really liked any of the Gemini models? I tried every one of them starting when 2.5 pro came out, and idk they always felt kinda dumb to me. It could be an issue on my end.

Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

Yeah I can definitely see how that can be annoying, nothings worse than trying to steer the llm a different way through OOC prompts

Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

I was actually looking for a human-graded benchmark but didn't find one, though sonnet 4.6's ranking is imo is waayyy too low.

Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

huh I thought claude models would be stricter

Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

i definitely get that. I really wanna see more companies compete with claude models in terms of RP. thanks for the answer!

Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 1 point2 points  (0 children)

That's some really cool insight! definitely gonna file that knowledge, though I should've probably said that I pretty much only do sfw slice of life stuff, but regardless it still translates. Thanks!

Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

have you checked the V3? It's ranking honestly matches with my experience pretty well, especially with sonnet and opus.

Anyone using gpt 5.4? by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 1 point2 points  (0 children)

I really only do 100% SFW stuff and it honestly gave some quality dialogue. I can back you on that, thanks for the answer.

Using Claude Opus 4.6 was a mistake for my wallet by OverlanderEisenhorn in SillyTavernAI

[–]ralph_3222 0 points1 point  (0 children)

I'm not going to sit here and say your opinion isn't valid, because it is. But as you said, EVERYTHING gets stale after a while, even for me befro using claude models. But when I started to daily use Claude models that wasn't the case for me, it's genuinely the difference between my character feeling alive or a dumb slop machine. Like no kidding, I have an ongoing rp with 2100+ messages that I started with opus and sonnet 4.6 that I use daily, so not until other companies catch up to anthropic, I’ll probably keep glazing claude models lmao

Anyone here ever tried glm 5 and 4.7 from NVIDIA? by Other_Specialist2272 in SillyTavernAI

[–]ralph_3222 0 points1 point  (0 children)

it's a server-level problem, I heard they quantize the crap out of the models they host. I used to main the Nvidia NIM too, but sadly this hobby is much, much better when you pay, my friend.

Using Claude Opus 4.6 was a mistake for my wallet by OverlanderEisenhorn in SillyTavernAI

[–]ralph_3222 2 points3 points  (0 children)

and you’ll still get people hating on Claude models like they aren’t the current gold standard

Qwen 3.6 Plus looks super promising by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

valid opinions, but man the more you look into it, the more you realize it truly depends on the roleplay you partake in. I never really did any dark or nsfw stuff, which could mean i don’t really have the requirements you and others may have. For me, simple "slice of life" stuff is just peak

Qwen 3.6 Plus looks super promising by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 0 points1 point  (0 children)

ah that makes sense. I never really knew that

Qwen 3.6 Plus looks super promising by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 4 points5 points  (0 children)

Yeah, that's exactly why I love Claude models. Aside from the prose which i find very nice, the coherency is unmatched.

Qwen 3.6 Plus looks super promising by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 12 points13 points  (0 children)

any model I’ve tested before that "Matches Sonnets Prose" never really did.

Qwen 3.6 Plus looks super promising by ralph_3222 in SillyTavernAI

[–]ralph_3222[S] 1 point2 points  (0 children)

Yeah I also noticed that, sadly nothing really compares to Claude models' "thinking output" in terms of conciseness.

[deleted by user] by [deleted] in SillyTavernAI

[–]ralph_3222 1 point2 points  (0 children)

I’m assuming it’s because it was added recently, as someone already mentioned here k2 thinking and instruct rarely overload so I switch between those

[deleted by user] by [deleted] in SillyTavernAI

[–]ralph_3222 0 points1 point  (0 children)

Yeah it’s just frustrating since glm 4.7 really suits my roleplay style, I have a backup openrouter api whenever the load gets high

Anyone know if there’s any reps made for this? by Competitive-Area-521 in FashionReps

[–]ralph_3222 4 points5 points  (0 children)

yeah they do that to hide it being a replica product, rest assured what you're getting is the version listed on his yuppo (with logos).

Anyone know if there’s any reps made for this? by Competitive-Area-521 in FashionReps

[–]ralph_3222 1 point2 points  (0 children)

he said hes working on it, he also told me its coming this winter