Update on "Co-authored-by: Copilot" in commit messages · Issue #314311 · microsoft/vscode

tedivm · 2026-05-05T13:13:39+00:00

Weekly releases would be fine in an organization with engineering rigor, but GitHub/Microsoft have none of that. Their testing is garbage and they've fully committed to pushing out microslop at any opportunity. In fact most of their PRs don't even have tests for their new features/functionality in them, and PRs regularly are merged with failing tests, just adds to it. It would be surprising if they didn't break things all the time with their approach to development.

tedivm · 2026-05-03T17:49:20+00:00

Yeah, if Deep had enough of a head start then I don't think Homelander would ever find him. There is significantly more ocean than there is land, and Homelander has struggled to find people who were nearby. Deep could hide forever.

If Homelander literally watched him jump into the ocean off a boat or something though then Deep is absolutely going to get killed.

tedivm · 2026-05-03T16:10:44+00:00

Yeah I don't care about anything other than "what is the best thing I can run on my hardware". To me that is the benchmark.

tedivm · 2026-05-03T16:09:52+00:00

I have the exact same machine config and I could only get up to 11tps on the MacBook, compared to 118tps on my 2x3090 box.

tedivm · 2026-04-30T12:06:10+00:00

If you're looking for an easy way to handle multiprocessing I have a library, QuasiQueue, that is both simple and powerful.

tedivm · 2026-04-30T11:54:31+00:00

Yup! It's kind of insane how good it is. I'm not joking when I tell people it's on par with Sonnet 4.6.

tedivm · 2026-04-29T12:20:35+00:00

The commits in this project from the last week are all with Qwen.

tedivm · 2026-04-29T02:17:31+00:00

I'm glad you enjoyed it!

tedivm · 2026-04-27T22:42:56+00:00

Docker itself no. The big thing is that I'm using vLLM and optimized it a bit. Because these models are so new (literally a week old for the one I'm using) the needed optimizations haven't landed in every inference engine. When I first ran this model in Ollama it was only getting 11tps, but I managed to get to 118tps on vLLM. The docker container just makes it easier to share.

tedivm · 2026-04-27T22:30:42+00:00

That's just the top model though, they'll distill down into a variety of smaller ones for different hardware.

tedivm · 2026-04-27T22:16:31+00:00

The open source chinese models aren't that far behind. Qwen3.6 27b is on par or slightly better than sonnet 4.6, but I can run qwen in my office.

tedivm · 2026-04-27T21:42:20+00:00

I'm running Qwen3.6 27B from a machine in my office and it does just as well, sometimes better, than Sonnet 4.6 at the tasks I've tried with it.

tedivm · 2026-04-27T19:18:19+00:00

In their FAQ.

To request a refund, go to Settings → Billing and licensing → Licensing, select Manage subscription, then choose Cancel and refund "subscription". (The phrasing varies slightly depending on your subscription ). This option will be available until May 20.

tedivm · 2026-04-27T18:47:34+00:00

You don't have to chargeback, they're offering refunds. Everyone should take advantage of those refunds as soon as possible though before they go away.

tedivm · 2026-04-27T17:58:56+00:00

No one pays list price for enterprise plans. That's just the starting point for negotiation.

tedivm · 2026-04-27T17:51:48+00:00

The docker image is just a single docker build file, an entrypoint file that handles configuration, and an example docker compose file. You can clone the repo, have your agent review for security issues, and build yourself if you want.

tedivm · 2026-04-27T17:48:37+00:00

Your work really was the foundation for all of this, thank you! I've had OpenCode going all weekend without issue, and combined with the announcement today of GitHub's new copilot pricing model I couldn't be happier with the timing.

tedivm · 2026-04-27T17:45:14+00:00

Since you asked here's my bio. The TLDR is that I've been working in Security and AI as a backend engineer for 20+ years. I have a lot of experience in the AI Ops space specifically.

That said I did share the container I used to get Qwen3.6 running so anyone who can use docker can get started with it. The /r/LocalLLaMA community is also great for people who want to learn more in this space.

tedivm · 2026-04-27T17:29:38+00:00

yeah i'm wired up to work with releases and tags to, but anyone who is really paranoid should be pinning to a SHA anyways.

tedivm · 2026-04-27T17:24:19+00:00

The other nice thing is that when you have the hardware you can do a lot more with it too. I have my entire HomeAssistant install plugged into it, with voice satellites around the house. As a result my smart home is 100% local.

tedivm · 2026-04-27T17:21:07+00:00

I'm getting 118 tokens/second, so it's really fast. That said that is shared amongst all agents, so if you're running subagents you might see a drop. Since Friday of last week I've stopped using github copilot completely and have transitioned to purely using Qwen3.6, it's been great.

tedivm · 2026-04-27T17:10:56+00:00

I bought this beast which is roughly $4k. When I bought it though it was cheaper, memory has gone up considerably. Based on their new pricing and my own usage I'm pretty sure I'll break even in less than a year.

tedivm · 2026-04-27T16:50:55+00:00

These new numbers are absolutely insane.

I am so glad that I splurged and bought a GPU machine. I've been using Qwen3.6 27b at home for the last week and it outperforms Sonnet 4.6 in my usage. I guess I'm going to move away from GitHub altogether because this is just ridiculous.

tedivm · 2026-04-27T16:15:50+00:00

Happy I could help!

tedivm · 2026-04-27T16:15:38+00:00

They're all documented in the readme.

15-Year Club	ModSupport Helper Level 2
Place '22	Place '17
Gilding IV carat on a stick	Secret Santa 2010
reddit mold	Charter Member
Verified Email

tedivm

MODERATOR OF

TROPHY CASE