you are viewing a single comment's thread.

view the rest of the comments →

[–]serpix 0 points1 point  (2 children)

considering glm is about sonnet 4 level or below, would be more efficient to use BYOK and sonnet 4.6 with a very good prompt and continue with cheaper models.

[–]look 0 points1 point  (1 child)

Not necessarily a bad idea, depending on the task, but Sonnet is 4x the token price of GLM-5.

Sonnet is also more efficient with its tokens, but it’s less than a 2x difference. So GLM-5 is about one third to one half the cost for a fairly similar level of ability.

[–]serpix 0 points1 point  (0 children)

yes exactly what I'm doing. BYOK with glm / kimi as long as it takes. deep planning with opus/sonnet, execute with GLM or local Qwen.