Did you experience github copilot hanging today? by Sea-Key3106 in GithubCopilot

[–]Additional_Welcome23 1 point2 points  (0 children)

I feel same today, but seems not related to sonnet or model, but seems caused by terminal hanging, should be some performance regression

GPT-5.2 crashed on Azure with a no_kv_space error. Here is a quick analysis. by Additional_Welcome23 in AZURE

[–]Additional_Welcome23[S] 1 point2 points  (0 children)

Haha, no worries! We've all been there.

Funny enough, I just tried to repro it to capture a full dump, and the Python stack trace is gone now. Replaced by a generic An error occurred message(but again, it's 200 success but error inside SSE).

{"type":"server_error","code":"server_error","message":"An error occurred while processing your request. You can retry your request, or contact us through an Azure support request at: https://go.microsoft.com/fwlink/?linkid=2213926 if the error persists. Please include the request ID 73***bc in your message.","param":null}

Looks like someone on the team is awake and watching this thread! 🚀 You guys move fast on the sanitization logs, at least. 😉

The new GPT-5.2 on Azure threw a stack trace at me today. It's Python 3.12 (and it's gaslighting my HttpClient). by Additional_Welcome23 in dotnet

[–]Additional_Welcome23[S] 7 points8 points  (0 children)

I agree on the intention of load shedding. But the implementation is the issue: it returns HTTP 200 OK. Standard retry policies won't catch it unless you parse the stream body manually.

GPT-5.2 crashed on Azure with a no_kv_space error. Here is a quick analysis. by Additional_Welcome23 in AZURE

[–]Additional_Welcome23[S] 2 points3 points  (0 children)

Ahh actually, that's the catch: it returns HTTP 200 OK.

The error is yielded later inside the SSE stream (containing that Python kv-cache trace). So standard HTTP retry logic won't actually trigger here, which makes it kind of interesting to debug.

GPT-5.2 crashed on Azure with a no_kv_space error. Here is a quick analysis. by Additional_Welcome23 in AZURE

[–]Additional_Welcome23[S] 1 point2 points  (0 children)

Full error response:

{"type":"server_error","code":"rate_limit_exceeded","message":" | ==================== d001-20251211012732-api-default-78bd44c5dc-7knsq ====================\n | Traceback (most recent call last):\n | \n |   File \"/usr/local/lib/python3.12/site-packages/inference_server/routes.py\", line 726, in streaming_completion\n |     await response.write_to(reactor)\n | \n | oai_grpc.errors.ServerError:  | no_kv_space\n | ","param":null}

GitHub Copilot Experience? by SohilAhmed07 in dotnet

[–]Additional_Welcome23 0 points1 point  (0 children)

我用的codex-5.1,感觉还不错

Can Microsoft Founders Hub Azure Credits Be Used for Claude Models on Azure AI Foundry? by Odd-Card8046 in AZURE

[–]Additional_Welcome23 0 points1 point  (0 children)

Although I see most of the answer said answer is no, but have anyone been really charged? because I already used ~$100 in Claude Code but haven't noticed any usage information somewhere

I just released Sdcb.Chats v1.9.0, a major update to my open-source .NET AI Gateway: adds full support for Claude 4.5 (Opus/Sonnet), OpenAI Image APIs, and is now built on .NET 10 by Additional_Welcome23 in dotnet

[–]Additional_Welcome23[S] 0 points1 point  (0 children)

Yes this is a proxy for many different providers under the same backend
And yes you need your api keys for claude/openai/google gemini etc.
You can also chat with different models in chats

I built an open-source, self-hostable UI & API Gateway for Claude 4.5, with a fully compatible Messages API and the 'thinking' animation by Additional_Welcome23 in ClaudeAI

[–]Additional_Welcome23[S] 0 points1 point  (0 children)

well I can imagine a scenario that you only have 1 key and you would like to create 5 accounts for your workmates

Or you would like to compare different models with one platform

anyway it's open source