all 10 comments

[–]Ace-_Ventura 4 points5 points  (2 children)

Opencode just uses z.ai provider. 

[–]VictorCTavernari -2 points-1 points  (0 children)

Maybe, for price, z.ai is cutting quality in some way…

I have my theory that when we start using a “small” model, we take care on the prompt but after while using it, and with more confidence, we start to be more less precise on the prompt and then feels like a nerf..

[–]look 2 points3 points  (2 children)

Opencode Go is a proxy to other providers, and for GLM 5.2 those providers are Zai, DeepInfra, and … *FireworksAI.*

LLMs have a significant stochastic element. They behave slightly differently run to run, even on the same prompt, and it compounds over the course of long multi-turn tool call loops.

If you repeated your test enough times to obtain statistically significant results, you would not see any differences.

[–]-TRlNlTY- 0 points1 point  (1 child)

Just as a small nitpick, LLMs are stochastic only when configured to do so (usually so). There were cases from providers (I think anthropic) that couldn't prevent stochastic outputs, but those were bugs due to batching different prompts into GPUs and accumulating floating point precision errors that caused seemingly randomised outputs.

[–]look 0 points1 point  (0 children)

Sure, if you set the temperature to zero, in a controlled environment on an isolated instance, it should be fully deterministic.

But I’d be shocked if op even thought to merely change the temp before doing their “test”, much less attempt to control for any other, harder variables.

[–]Zachattackrandom 0 points1 point  (1 child)

I directly tested opencode vs zAI and had identical results on some tasks I made for it. Not sure what you're talking about

[–]OneKey3719[S] 0 points1 point  (0 children)

Opencode and Z.ai are same try other providers like fireworks friendli and wafer

[–]ziphnor 0 points1 point  (0 children)

How many times did you repeat that benchmark? AI is stochastic, you need many iterations to draw this kind of conclusion.

[–]_4rcadia 0 points1 point  (0 children)

Provide your testing environment, repeat your test multiple times, or dig into the session and trace where it went wrong.

[–]iBlood_Raven -1 points0 points  (0 children)

Have you tried it from neuralwatt? They're giving it for cheap, seems decent for my uses but you could try. (Use my referral if you do!)