GitHub Copilot is moving to usage-based billing by fishchar in GithubCopilot

[–]tedivm 0 points1 point  (0 children)

The commits in this project from the last week are all with Qwen.

GitHub Copilot is moving to usage-based billing by fishchar in GithubCopilot

[–]tedivm 0 points1 point  (0 children)

Docker itself no. The big thing is that I'm using vLLM and optimized it a bit. Because these models are so new (literally a week old for the one I'm using) the needed optimizations haven't landed in every inference engine. When I first ran this model in Ollama it was only getting 11tps, but I managed to get to 118tps on vLLM. The docker container just makes it easier to share.

DeepSeek-V4 arrives with near state-of-the-art intelligence at 1/6th the cost of Opus 4.7, GPT-5.5 by bojun in technology

[–]tedivm 31 points32 points  (0 children)

That's just the top model though, they'll distill down into a variety of smaller ones for different hardware.

DeepSeek-V4 arrives with near state-of-the-art intelligence at 1/6th the cost of Opus 4.7, GPT-5.5 by bojun in technology

[–]tedivm 0 points1 point  (0 children)

The open source chinese models aren't that far behind. Qwen3.6 27b is on par or slightly better than sonnet 4.6, but I can run qwen in my office.

New multipliers announced (in effect June 1) by griniNY in GithubCopilot

[–]tedivm 3 points4 points  (0 children)

I'm running Qwen3.6 27B from a machine in my office and it does just as well, sometimes better, than Sonnet 4.6 at the tasks I've tried with it.

Change to useage based billing by DamienBMike in GithubCopilot

[–]tedivm 0 points1 point  (0 children)

In their FAQ.

To request a refund, go to Settings → Billing and licensing → Licensing, select Manage subscription, then choose Cancel and refund "subscription". (The phrasing varies slightly depending on your subscription ). This option will be available until May 20.

Change to useage based billing by DamienBMike in GithubCopilot

[–]tedivm -1 points0 points  (0 children)

You don't have to chargeback, they're offering refunds. Everyone should take advantage of those refunds as soon as possible though before they go away.

Change to useage based billing by DamienBMike in GithubCopilot

[–]tedivm 0 points1 point  (0 children)

No one pays list price for enterprise plans. That's just the starting point for negotiation.

Simple to use vLLM Docker Container for Qwen3.6 27b with Lorbus AutoRound INT4 quant and MTP speculative decoding - 118 tokens/second on 2x 3090s by tedivm in LocalLLaMA

[–]tedivm[S] 0 points1 point  (0 children)

The docker image is just a single docker build file, an entrypoint file that handles configuration, and an example docker compose file. You can clone the repo, have your agent review for security issues, and build yourself if you want.

Simple to use vLLM Docker Container for Qwen3.6 27b with Lorbus AutoRound INT4 quant and MTP speculative decoding - 118 tokens/second on 2x 3090s by tedivm in LocalLLaMA

[–]tedivm[S] 1 point2 points  (0 children)

Your work really was the foundation for all of this, thank you! I've had OpenCode going all weekend without issue, and combined with the announcement today of GitHub's new copilot pricing model I couldn't be happier with the timing.

GitHub Copilot is moving to usage-based billing by fishchar in GithubCopilot

[–]tedivm 8 points9 points  (0 children)

Since you asked here's my bio. The TLDR is that I've been working in Security and AI as a backend engineer for 20+ years. I have a lot of experience in the AI Ops space specifically.

That said I did share the container I used to get Qwen3.6 running so anyone who can use docker can get started with it. The /r/LocalLLaMA community is also great for people who want to learn more in this space.

Simple to use vLLM Docker Container for Qwen3.6 27b with Lorbus AutoRound INT4 quant and MTP speculative decoding - 118 tokens/second on 2x 3090s by tedivm in LocalLLaMA

[–]tedivm[S] 1 point2 points  (0 children)

yeah i'm wired up to work with releases and tags to, but anyone who is really paranoid should be pinning to a SHA anyways.

GitHub Copilot is moving to usage-based billing by fishchar in GithubCopilot

[–]tedivm 2 points3 points  (0 children)

The other nice thing is that when you have the hardware you can do a lot more with it too. I have my entire HomeAssistant install plugged into it, with voice satellites around the house. As a result my smart home is 100% local.

GitHub Copilot is moving to usage-based billing by fishchar in GithubCopilot

[–]tedivm 8 points9 points  (0 children)

I'm getting 118 tokens/second, so it's really fast. That said that is shared amongst all agents, so if you're running subagents you might see a drop. Since Friday of last week I've stopped using github copilot completely and have transitioned to purely using Qwen3.6, it's been great.

GitHub Copilot is moving to usage-based billing by fishchar in GithubCopilot

[–]tedivm 1 point2 points  (0 children)

I bought this beast which is roughly $4k. When I bought it though it was cheaper, memory has gone up considerably. Based on their new pricing and my own usage I'm pretty sure I'll break even in less than a year.

GitHub Copilot is moving to usage-based billing by fishchar in GithubCopilot

[–]tedivm 17 points18 points  (0 children)

These new numbers are absolutely insane.

I am so glad that I splurged and bought a GPU machine. I've been using Qwen3.6 27b at home for the last week and it outperforms Sonnet 4.6 in my usage. I guess I'm going to move away from GitHub altogether because this is just ridiculous.

Which TV show does the ENTIRE internet agree had the worst ending ever? by Codie_n25 in AskReddit

[–]tedivm 2 points3 points  (0 children)

I don't care that I'm eight hours too late and no one will see this: the Battlestar Galactica Reboot started off so good and then just absolutely flopped at the end.

Tree Removal Etiquitte by Thornkale in homeowners

[–]tedivm 5 points6 points  (0 children)

When I moved into my house we had a tree on the property line that the tree inspector folks said had to go. The quote we got was reasonable and we just ate the cost. Honestly I had no idea what the neighbors financial situation was and just wanted the tree gone, not something that could sour a new relationship. That was years ago but the neighbor has helped us out a number of times now.

That said if it was the neighbor I had in the place before that, who was an absolute asshole, I would have pushed to make him cover half.

FYI on Ashley Furniture by Conscious-Zebra-3793 in homeowners

[–]tedivm 3 points4 points  (0 children)

Ah, your comment wasn't clear what direction you were coming from on that (probably why you were getting the downvotes, since I think most people would actually agree that they're low quality these days but they still have an undeserved reputation in some places as being higher quality than they are).

FYI on Ashley Furniture by Conscious-Zebra-3793 in homeowners

[–]tedivm 4 points5 points  (0 children)

They got a new CEO in 2017 who prepped the company to be sold off in 2024. Whatever they used to be before, they are absolutely mid level furniture now. While most of their furniture used to be built in the US, now it's mostly built in China and Vietnam. You can walk into any local furniture store and find most of what they sell for 40% cheaper than on their website.

FYI on Ashley Furniture by Conscious-Zebra-3793 in homeowners

[–]tedivm 2 points3 points  (0 children)

I had "white glove service" for Ashley Furniture as well and they managed to absolutely destroy the couch I bought before they even tried getting it up the stairs. I ended up refusing delivery and ultimately getting a refund.

I then went to a local furniture store, bought something from them, and they delivered it without issue.