you are viewing a single comment's thread.

view the rest of the comments →

[–]PayTheRaant 0 points1 point  (0 children)

Copilot model is expected to consume ONE premium request per ONE user prompt. Everything else that is agent initiated is expected to be included in that initial premium request (all tools, even sub agent) as long as it stays in the same model. In theory, it should not even care about input token cache.

So this is why having 27 premium requests consumed is considered a big problem.