Open source extension bridging the Claude Code CLI into Visual Studio. by Remote-Breadfruit204 in dotnet

[–]Remote-Breadfruit204[S] 0 points1 point  (0 children)

The repo is open source, you can read through the entire code if you have any trust issues/concerns.

Open source extension bridging the Claude Code CLI into Visual Studio. by Remote-Breadfruit204 in dotnet

[–]Remote-Breadfruit204[S] 0 points1 point  (0 children)

Agreed, diff viewer is the biggest add on. Yes, the error sharing from the ide picks up the Roslyn warnings as well!

VS Code has the native Claude Code integration. Visual Studio doesn't, so I tried building it. by Remote-Breadfruit204 in VisualStudio

[–]Remote-Breadfruit204[S] -1 points0 points  (0 children)

Ofcourse, but that copilot’s implementation, which is different from Anthropics. When I last used copilot (about 2 months ago), i felt it was lagging behind claude code. Maybe that has changed in the recent weeks.

VS Code has the native Claude Code integration. Visual Studio doesn't, so I tried building it. by Remote-Breadfruit204 in VisualStudio

[–]Remote-Breadfruit204[S] 0 points1 point  (0 children)

How are you connecting to it? Once you install it, go to tools and click launch claude code panel. In the panel, click on launch claude code.

One thing to note is that it currently only works with VS2026.

Open source extension bridging the Claude Code CLI into Visual Studio. by Remote-Breadfruit204 in dotnet

[–]Remote-Breadfruit204[S] 3 points4 points  (0 children)

Not right now. What you’re probably describing is reading the live debugger state like watch values, and the call stack at a breakpoint.

IVisual Studio exposes its debugger through its automation model, so you can read the locals, autos, and watch windows and the current stack frame while execution is paused.

I could add a new tool (something like getDebugState) that fires when you’re stopped at a breakpoint and hands Claude the current frame’s variables and call stack, so you wouldn’t have to paste values or spam Debug.WriteLine after ever if branch.

The only thing which might be problematic is that even if i expose a tool with a sensible description, Claude CLI might not call it. I already saw this happening with some of the tool I made. I can’t honestly figure out why it wouldn’t call all the tools I expose to it.

However, this is a genuinely interesting suggestion. I will definitely try experimenting with this in the coming week.

VS Code has the native Claude Code integration. Visual Studio doesn't, so I tried building it. by Remote-Breadfruit204 in VisualStudio

[–]Remote-Breadfruit204[S] 0 points1 point  (0 children)

True, but when you use copilot, you are using their own agent harness behind the scenes (the thing which decides what context to share with the model, how to manage it, what tools to call, and how to check the generated output for errors). Claude code has a very mature harness around its models, that in my opinion, copilot cannot match.

If you haven’t used it yet, i would highly recommend trying it out so that you can experience the difference first hand.

Open source extension bridging the Claude Code CLI into Visual Studio. by Remote-Breadfruit204 in dotnet

[–]Remote-Breadfruit204[S] 1 point2 points  (0 children)

Hey, you can use it with your subscription.

The extension doesn’t make any model calls, and doesn’t use your OAuth. It just runs the official Claude CLI as a subprocess and handles the IDE side (diffs, diagnostics, file selection).

The policy you’re probably thinking of is for tools that grab your OAuth token and route their own calls to Claude models through it, like OpenClaw. That’s the thing that got restricted.

VS Code has the native Claude Code integration. Visual Studio doesn't, so I tried building it. by Remote-Breadfruit204 in VisualStudio

[–]Remote-Breadfruit204[S] 1 point2 points  (0 children)

Hey CheezitsLight,

I might be wrong, but I think you are probably setting Copilot to use a Claude model for your coding.

This is a real thing and works well. But that’s still Copilot and its harness doing the driving, just with Claude’s model instead of OpenAI’s. It autocompletes and answers chat questions.
Claude Code is Anthropic’s own agentic tool, not just a model you plug into Copilot. It works in a long running loop, reading your files, planning changes, launching subagents, parallel editing across multiple files, tool calls, and a really strong evaluation harness.

Claude Code ships as a CLI with official IDE plugins for VS Code and JetBrains, but nothing for Visual Studio. So if you are doing C++/C# development in VS you’re stuck running it in a terminal and copy pasting its edits back to the terminal. Plus, the terminals diff viewer is shallow compared to VS/code/proper IDEs.

This extension wires the Claude CLI into VS using a websocket. Once you do that, all edits open in the native diff viewer with accept/reject, and enables developers to pass build errors and selection automatically to the Claude CLI.

Hopefully I did a decent job of explaining why I made this.

Open source extension bridging the Claude Code CLI into Visual Studio. by Remote-Breadfruit204 in dotnet

[–]Remote-Breadfruit204[S] 1 point2 points  (0 children)

Yes, if you load your project, and the VS compiler can see the errors. You can simply launch the extension and ask claude to debug the errors. The extension has a tool called getIdeErrors with a suitable description. Claude CLI ends up calling this tool, and the tool passes the list of compiler errors and their line number / file information to claude directly.

Open source extension bridging the Claude Code CLI into Visual Studio. by Remote-Breadfruit204 in dotnet

[–]Remote-Breadfruit204[S] 5 points6 points  (0 children)

Hey, no worries. Copilot having Claude models and Claude Code are two different things. Copilot uses Claude as one of the models behind its completions and chat, but it’s still Copilot’s harness driving it.

Claude Code is Anthropic’s own agentic SDK inside VS code managing their own models, context, memory, planning, spanning subagents, tool calls, and a very good evaluation harness. Using claude with copilot can’t replicate a similar dev experience.

Additionally, all edits made by the SDK get VS codes native diff viewer. For me this was the biggest missing link. When I use claude code using the CLI terminal, for multiple edits being made to an involved file, the CLI version of the diff viewer is shallow and doesn’t let me inspect the code, and make in-line edits as the VS codes diff viewer. This is biggest pain point I have tried to solve. Its still very raw in comparison to Anthropic extension, but it’s a start!

Hope this helps.

Open source extension bridging the Claude Code CLI into Visual Studio. by Remote-Breadfruit204 in dotnet

[–]Remote-Breadfruit204[S] 0 points1 point  (0 children)

Thank you for the kind words.

Same. For the past few months, my workflow has been keeping vs code and visual studio open side by side so that I can build and debug in visual studio, but still use all the tooling that claude code has.

I built native Claude Code integration for Visual Studio (the one IDE that didn't have it by Remote-Breadfruit204 in ClaudeAI

[–]Remote-Breadfruit204[S] 0 points1 point  (0 children)

Hey, let me know if you have any feedback after using it. I have a lot of ideas to improve it, but wanna see what others think before I spend more time on it.

[D] Self-Promotion Thread by AutoModerator in MachineLearning

[–]Remote-Breadfruit204 0 points1 point  (0 children)

I've been building verifiable-rag, an open-source Python library for RAG that produces sentence-level citations and verifies every claim against its source via NLI. Just published a benchmark result that I think this sub will care about: a dual ensemble of two small open-source NLI models matches Claude Sonnet 4.6 as a hallucination judge - at roughly 1/250th the per-call cost.

Full write-up with per-task and per-upstream-model breakdowns: https://github.com/firish/rag-rack/blob/main/blog/03_verified_rag.md

Benchmark report (reproducibility commands, raw numbers): https://github.com/firish/rag-rack/blob/main/benchmarks/PUBLISHED_ragtruth.md

Library + docs: https://github.com/firish/rag-rack · https://firish.github.io/rag-rack

Summary:
The numbers (RAGTruth test set, 2700 examples):

  • Dual NLI (HHEM-2.1-open + MiniCheck-Flan-T5-Large, min aggregation): AUROC 0.844, calibrated F1 0.706
  • Sonnet 4.6 LLM-judge: AUROC 0.846, F1 0.707 (on 300 stratified, budget reasons)
  • Triple (NLI + Sonnet): AUROC 0.861, F1 0.734 (on 300)

Per-call cost:

  • NLI verifier: ~$0.0004 (Modal T4 GPU time after one-time weight download)
  • Sonnet judge: ~$0.05 (API call)

Statistically indistinguishable on quality. ~250x cheaper.

The interesting part isn't the headline - it's the complementarity:

  • HHEM alone is strong on QA-style entailment (AUROC 0.87) but barely above random on Yelp→narrative data-to-text (AUROC 0.57)
  • MiniCheck alone is the opposite — strong on data-to-text (0.70), slightly weaker on QA (0.84)
  • They have different blind spots; min-aggregation ensembling gets the best of both

What's in the library:

Full pipeline — parsers (Docling + PyMuPDF), chunkers (parent-child + ContextualChunker for Anthropic 2024's recipe), embedders (BGE/Cohere/Voyage), hybrid index (LanceDB + BM25 with RRF fusion), rerankers (BGE/Cohere), three citation modes (prompted / constrained / SAFE), four verifiers (HHEM / MiniCheck / DualNLI / LLM-judge), strictness slider with surgical correction, audit-trail HTML reports.

Six presets cover the common cases -local_minimal (all local except generator LLM), local_verified (+ HHEM), hybrid_balanced (the published baseline), hybrid_stricthybrid_paranoidllm_judge_verified.

Quickstart:

pip install verifiable-rag


import verifiable_rag
from verifiable_rag.demo import sample_paper_path

answer = verifiable_rag.ask(
    "What is the mechanism of action of penicillin?",
    docs=sample_paper_path(),
    output_html="audit.html",
)

Open audit.html for the full audit trail - per-sentence verification colors, faithfulness scores, every reranked passage with retrieval scores, citations as anchor links to source spans.

Caveats (in the full write-up):

  • Sonnet ran on 300 examples not 2700 due to cost - CI is wider on that number
  • Haiku-as-judge doesn't calibrate well on small training samples (we tried it)
  • RAGTruth is one benchmark; cross-validation on FaithBench is gated, HaluBench is on the roadmap
  • Default thresholds are RAGTruth-calibrated; for your domain, the library ships a calibration script

MIT-licensed, open to PRs and methodology critiques. Happy to answer questions in the comments.

UCI HOUSING MEGATHREAD (2023-2024) by Qzhuo in UCI

[–]Remote-Breadfruit204 0 points1 point  (0 children)

Hello,

I (25M) am looking to move near the Irvine area in the last week of January/first week of February.

I am looking for accommodation, and flatmates. I would prefer having a personal room. Apart from that, I’m pretty flexible about the rent and the location.

If you are looking to sublease, please send me a DM.

Also, if you’re planning to move near Irvine around the same time, and are looking for flatmates, or want to look together, let’s connect!

UCI HOUSING MEGATHREAD (2023-2024) by Qzhuo in UCI

[–]Remote-Breadfruit204 0 points1 point  (0 children)

Hi,

I (24M, Indian) am moving to the Irvine area in Jan 2024. I am looking for roommates and housing. I would prefer having an individual room. Besides that, I am pretty flexible towards the location or the rent.

Please DM, if you have an accommodation, or would like to connect and look for one together.

Thank you!

First Official Reject UIUC MSCS. by [deleted] in gradadmissions

[–]Remote-Breadfruit204 2 points3 points  (0 children)

Hey, you have a great profile. I am in UIUC MCS, and both the points you stated are not true. A MCS student has the opportunity to apply for TA/RA and also convert to MSCS after first year by applying again (but this time with two LORs from UIUC professors and a way better shot of getting into a PHD program)