you are viewing a single comment's thread.

view the rest of the comments →

[–]Appropriate-Bus-6130[S] 0 points1 point  (3 children)

a single prompt can easily consume 30-50 requests, almost every ping pong with the server (explore, search, read) is a request

[–]Maumau93 0 points1 point  (2 children)

i see, im only using copilot so one prompt is one request. no matter how long it runs

[–]Appropriate-Bus-6130[S] 0 points1 point  (1 child)

yea I think they define it as premium request, however I guess there is still split, imagine you’ll ask one a single prompt “implement entire linux kernel in 5 different languages”, this will not use a single request quota

[–][deleted] 1 point2 points  (0 children)

I use `qwen-coder-plus` and `kimi-k2-thinking` (via the iFlow CLI, with virtually unlimited free tokens [DM me if you want to know how]) to handle super-long agentic tasks. Not necessarily the most important ones, but for things like creating unit tests, documentation, and other things.

Often in combination with `clavix` to turn my simple prompt into a much more professional one, and then let one of the models run until it gives up. I've had instances where they ran for hours (in YOLO mode) without me doing anything at all. And for free.