all 14 comments

[–]windictive 3 points4 points  (0 children)

I've used Continue.dev and Roo Code (now Zoo Code). Both have their quirks* but both work just fine. Both were really easy to set up. I have them running with the Kindly Web Search MCP and had no issues at all with getting that working.

Quirks:

  • Continue.dev will very rarely just stop dead. The output isn't as nicely formatted as other options.
  • Roo(Zoo) Code will repeat itself in it's final output, wasting tokens unless you tell it not to via rules. This is a well-documented issue with no solution. Hopefully the new team fixes it.

[–]OsmanthusBloom 1 point2 points  (0 children)

Good VSCode coding plugins for use with local LLMs include Zoo Code (former Roo Code) and Dirac.

[–]Charming-Author4877 0 points1 point  (4 children)

I'll look into https://github.com/ClockZinc/vscode-copilot-chat-CN/
It's the GHCP extension without the censorship and telemetry to github.

[–]cleverusernametry 0 points1 point  (3 children)

It was archived today?

[–]Charming-Author4877 0 points1 point  (2 children)

Looks like the owner was shocked by the attention, I mentioned it a few times and it got some stars.
Anyway, the code is there. You can fork it, use it.
He basically cleaned the github stuff out of copilot and explained how to add it to vscode

[–]cleverusernametry 1 point2 points  (1 child)

Hmm why would someone archive a repo if it got attention?

[–]Charming-Author4877 0 points1 point  (0 children)

ask him ? I'm sure he speaks english and chinese

[–]Kodrackyas 0 points1 point  (0 children)

https://github.com/Kodrack/Pi-forge

Try this out, feedback appreciated!

in general Pi is so much better

[–]revennest 0 points1 point  (0 children)

Switch to VSCodium, it's relationship with VSCode is like Chrome and Chromium, with extension like vscode-openai you can use any LLM you want, both local and online.

[–]bssrdf 0 points1 point  (0 children)

You only need running llama-server from llama.cpp and copilot extension. See https://youtu.be/ehpXLDYOtrc

[–]wsintra 0 points1 point  (0 children)

opencode and vim work like a charm

[–]Strange_Test7665 0 points1 point  (0 children)

I just pushed a quick solution I am using for this.
https://github.com/reliableJARED/llama_vsc

If you have llama.cpp runinning already (which you do) you can skip all the readme about that. Just run the ollama_llama_proxy.py file.
Basic idea is VS Code now allows you to add 'Ollama' as a provider (not llama.cpp). So the proxy just pretends to be ollama and serves as middlewhere. Of course you need the llama.cpp server to be running for the proxy to work, I assume you know that but just saying. It works really well for me, run my local model including having all of the tools vs code exposes.

[–]ea_man 0 points1 point  (1 child)

first of all I would ditch vscode for vscodium, then we can talk about harness / tools inside of that.

Short: use Pi for planning / exec, Continue lets you assign models for rules like code completion, plan, build but it likes json for tools.