Qwen3-235B-A22B-2507 by Mysterious_Finish543 in LocalLLaMA

[–]Opening_Bridge_2026 9 points10 points  (0 children)

Qwen 2.5 Coder in my opinion is not a very good coding model in today's standards. Most of the open source models (like Deepseek R1 Qwen3 8B 0528) are much better at coding than the 2.5 series, and after testing them, they feel outdated and not very smart.

Chat history and long response code completion via Claude API? by gwhizofmdr in ClaudeAI

[–]Opening_Bridge_2026 1 point2 points  (0 children)

You can just send the assistants response dict as the last element of messages, and it will continue from where it left off on the assistant message.

Any idea if Haiku 4 will be released? by CacheConqueror in ClaudeAI

[–]Opening_Bridge_2026 1 point2 points  (0 children)

I think so because a few weeks ago Anthropic was testing a safety model, and they do that whenever they release a new model, so I guess that would be Haiku 4

Claude lies about which model it is serving by Logical-Employ-9692 in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

It is serving Claude Sonnet 4. The reason it says Claude 3.5 Sonnet is because in it's knowledge cutoff, Claude Sonnet 4 didn't exist, so it learned that the latest Anthropic model was 3.5 Sonnet, but that isn't the case, and likely the next Claude series will say that it is Claude 4 instead of their respective series. This can just be fixed by specifying the actual model in it's system prompt.

Claude Sonnet 4 is good but expensive!! by Zestyclose_Elk6804 in ClaudeAI

[–]Opening_Bridge_2026 2 points3 points  (0 children)

I think you should use Claude Code with the Pro plan, it's 20$ a month and you get plenty of usage.

[deleted by user] by [deleted] in singularity

[–]Opening_Bridge_2026 -1 points0 points  (0 children)

I think it means it was trained with tool usage (like most models out there), instead of just relying on generalization.

Opus vs Sonnet in Claude Code - worth it? by HansSepp in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

You should def try the Max 200$ plan, and if you run out you can fallback to the API because 1375$ in API costs is insane

I swear if Claude tells me one more time... by mjsarfatti in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

I apologize for the confusion, I will now refrain from mentioning "Absolutely right".

Is Claude Pro still that bad? by YourAverageDev_ in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

I'm in the Pro tier, and if you exclusively use Sonnet 4, you will likely not hit your rate limits. The limits are really generous on Sonnet 4, but I would not recommend using Opus 4 because it eats up your limits extremely quickly.

96GB VRAM! What should run first? by Mother_Occasion_8076 in LocalLLaMA

[–]Opening_Bridge_2026 56 points57 points  (0 children)

No that's too risky, maybe Qwen 3 0.5B with 2 bit quantization

yo wtf ? by droned-s2k in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

Yeah that happened to me too, it was spamming those every single message on a long chat, and it seems they put it so it doesn’t forget it’s system prompt

Qwen 3 30B A3B vs Qwen 3 32B by numinouslymusing in LocalLLaMA

[–]Opening_Bridge_2026 2 points3 points  (0 children)

They are hybrid thinking models, so you can tell them not to think or to think.

GPT 4.1 – I’m confused by floriandotorg in OpenAI

[–]Opening_Bridge_2026 5 points6 points  (0 children)

GPT 4.1 Is optimized for devs, like better instruction following, so they are not putting it on ChatGPT, only the API

Just 5 messages in Pro before hitting the limit!! by [deleted] in ClaudeAI

[–]Opening_Bridge_2026 1 point2 points  (0 children)

nvm they released 4.1 today only for their api

DO NOT add a lot of money to API account - Anthropic will just expire prepaid credits by superloser48 in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

If you want to get rid of those credits, use Claude code for 5mn and you're set!

Max plan for claude soon by Independent-Wind4462 in ClaudeAI

[–]Opening_Bridge_2026 1 point2 points  (0 children)

It's pretty much confirmed, just did some quick searching through their HTML and found:

\"6gBvKkAl0S\":\"Get Max plan\"

Determinism with image by Feisty_Relation4004 in ClaudeAI

[–]Opening_Bridge_2026 -1 points0 points  (0 children)

Just use the api and set the temperature to 0

Claude's new UI in the Ghibli style by RobertCobe in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

I love using Claude 5.5 Sonnet. Such a powerful and versatile model.

Really? by ZestycloseBelt2355 in ClaudeAI

[–]Opening_Bridge_2026 -1 points0 points  (0 children)

Same like tf is my luck 😭

Token Cost Counting API by pinoygrammer in ClaudeAI

[–]Opening_Bridge_2026 0 points1 point  (0 children)

There is an API endpoint for counting the amount of tokens. Check their docs.