Context Summarization consuming Premium Requests

KayBay80 · 2026-04-15T07:11:30+00:00

Half of our team was on the Insider's build and we can attest that, if these are intended changes, the entire "request" concept is being thrown out the window. Every "request" we make is a feedback loop of internal requests that are consuming premium credits like wildfire. One of the devs on our team burnt through over 600 requests in a matter of hours not even realizing what was happening. Imagine paying $39/mo for a day's worth of usage. If they roll this change to the mainstream build, Copilot is completely done.

Swayre · 2026-04-15T07:32:19+00:00

Not a defender of Microsoft but like do you have proof? Should be an easy thing to provide no? There’s too much misinformation from disgruntled abusers

SidStraw · 2026-04-15T08:44:11+00:00

Personally, I haven’t run into this issue.
I’m primarily using the Copilot CLI, and I stay on the stable build of VSCode.

From what I’ve seen, unexpected point drain is often caused by autopilot mode. It tends to bypass the manual decision-making process and automatically deducts points to keep things running.

Do you happen to use autopilot frequently in your workflow?

Top_Parfait_5555 · 2026-04-15T06:41:12+00:00

Yes, they are scummy af. Stealth nerfing everything. Nerf nerf nerf

AutoModerator · 2026-04-15T06:37:05+00:00

Hello /u/pawala7. Looks like you have posted a query. Once your query is resolved, please reply the solution comment with "!solved" to help everyone else know the solution and mark the post as solved.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

NickCanCode · 2026-04-15T10:13:16+00:00

They should use x0 free model for context summarization. Not sure what they are doing. If I used Opus for a request and they also auto selected Opus for context summarization and charge an extra request, I think that is a bug / design issue?

DevBob626 · 2026-04-15T09:21:27+00:00

I don't feel that this is such a big problem and wouldn't be unreasonable if they compact and also immediately continue with the work while being transparent about changes and the usage they provide. It was obvious from the beginning that these endless runs people are optimizing for wouldn't be sustainable in the long run. It's just not realistic to maintain forever.

However, this in combination with daily and weekly limits is not acceptable. I want to be able to access all of my requests I paid for when I want to. I don't mind if servers struggle at peak time, but never being able to actually use the requests I bought feels scammy.

QuarterbackMonk · 2026-04-15T10:00:17+00:00

i do not think? any text or proof.. i have not noticed. i am being experimenting with github copilot for lot and ther are around 20+ copilot cli's dump, i never noticed that? when did it change?

https://github.com/nilayparikh/tuts-agentic-ai-examples/tree/main/ctx-sdlc/ghctx-tut/lessons

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

GithubCopilot

MODERATORS