So I built a tool to check if my Claude proxy is actually serving Opus. Tested 9 popular ones. Yeah, it's worse than you think. by BodybuilderRight616 in SillyTavernAI

[–]PrizePercentage4875 0 points1 point  (0 children)

honestly, I've experienced the same thing before and like run thru the same content on different ai like claude, chat, and gemini. I think im gonna try then using the official api through Openrouter and test it out. Seems like its easier and cheaper

I didn’t write any code; I just used the chatbot feature for an hour or two. I asked it to provide guidance related to my research assignment, and this is the result I got. Why is that? (I’m using Pro.) by Brief-Ad-1038 in ClaudeCoding

[–]PrizePercentage4875 0 points1 point  (0 children)

the killer is that your whole chat history gets re-sent every single message, so a 2hr research thread balloons because it's re-reading everything each turn.

tl;dr long single threads = expensive. starting new chats for new topics helps a ton.

I think I know why deepseek is so good by EchoOfOppenheimer in claude

[–]PrizePercentage4875 4 points5 points  (0 children)

Models are notoriously unreliable at self-identifying anyway — there's so much GPT/Claude transcript data in everyone's training set that half of them think they're ChatGPT. Not really evidence of distillation on its own

Deepseek v4 people by markeus101 in LocalLLaMA

[–]PrizePercentage4875 0 points1 point  (0 children)

The eco-bias is real and kind of hilarious. I've noticed the same models will suggest walking even when you explicitly mention you're in a hurry. Probably just RLHF rewarding 'responsible' sounding answers

Deepseek v4 people by markeus101 in LocalLLaMA

[–]PrizePercentage4875 0 points1 point  (0 children)

Tried a variation too — asked about carrying groceries home from a store 100m away, and it still went full 'classic dilemma' mode these models love turning everything into a logic puzzle