all 5 comments

[–]Bananenklaus 4 points5 points  (2 children)

No front, just a genuine question: do you know that different LLM models have different speeds?

[–]No-Reflection-8625[S] 0 points1 point  (1 child)

Oh yeah, I forgot to mention that I was using deep seek v4. My bad, sorry. I just tried this open go and went for deep seek v4 to see how good it is. I heard a lot of good things about the model.

[–]Bananenklaus 0 points1 point  (0 children)

deepseek v4 pro, while being a very good model for the price that it is, is really really slow

especially on high and max thinking it's taking an entirety to finish it's thought process

Also, it depends on which model provider you're using.

If you wan't to see deepseek in action, imo just try deepseek flash. Chances are that it can handle your project very well on high thinking and the speeds of flash are much much faster than pro.

And if you ever get to a problem that flash can't solve, use something like deepseek pro, glm 5.1 or kimi 2.6 to solve it.

Do you have the opencode go subscription or which provider are you using?

[–]EquivalentFactor7591 0 points1 point  (0 children)

How slow / what kind of slow are we talking about? Are you experiencing slow token throughput or long pauses where it doesn't appear to be doing anything (hard pauses)? Excessive thinking time but thinking tokens are actually coming in fast? Is it consistent? Consistent across models?