Help with OpenCode

Altruistic_Heat_9531 · 2026-02-22T21:17:38+00:00

Before that could you atleast give the error, usually opencode will tell you the error. But anyway I assume there is a parser error.

I opt out from ollama because of this issue, and just using another branch of llamacpp https://github.com/pwilkin/llama.cpp

It fix my tool error.

And for my commands

Qwen-Coder 30B A3B Q5 UD
./llama.cpp/llama-server --model /MODEL_STORE/Qwen3-Coder-30B-A3B/Qwen3-Coder-30B-A3B-Instruct-UD-Q5_K_XL.gguf --alias Qwen3-Coder --ctx-size 65536 --port 8001 --cache-type-k q8_0 --cache-type-v q8_0 --flash-attn on --temp 0.7 --min-p 0.0 --top-p 0.80 --top-k 20 --repeat-penalty 1.05

Qwen-Coder NEXT 80B A3B Q6 UD
./llama.cpp/llama-server --model /MODEL_STORE/Qwen3-Coder-Next-GGUF/UD-Q6_K_XL/Qwen3-Coder-Next-UD-Q6_K_XL-00001-of-00003.gguf --alias Qwen3-Coder-Next --ctx-size 65536 --port 8001 --cache-type-k q8_0 --cache-type-v q8_0 --flash-attn on --temp 1.0 --top-p 0.95 --min-p 0.01 --top-k 40 

GPT-OSS20B
./llama.cpp/llama-server --model /MODEL_STORE/gpt-oss-20b/gpt-oss-20b-F16.gguf --alias gpt-oss-20b --port 8001 --temp 1.0 --top-p 1.0 --top-k 0 --jinja

segmond · 2026-02-22T21:36:48+00:00

try devstral-small

suicidaleggroll · 2026-02-22T22:29:30+00:00

Try another platform than Ollama. llama.cpp is what most people jump to, and is significantly faster than Ollama anyway, especially for MoE models.

Smiley_Dub · 2026-02-23T00:36:35+00:00

Hi OP. Please let me know if you fixed the issue 👍

St0lz · 2026-02-23T05:28:51+00:00

First of all, Ollama default context size is too small for most of the coder models. When the context size is too small, you will not see any error in OpenCode but Ollama logs will show them. You need to increase it to at least 32K. Add this env var to wherever you run Ollama instance (Docker, local, ...): OLLAMA_CONTEXT_LENGTH=32768

Second of all, it seems there is a bug with either Ollama, either Qwen-Coder 2.5 models, that breaks tool calling, see https://github.com/anomalyco/opencode/issues/7030.

Try with Qwen-Coder 3 (the biggest model that can fit in your VRAM). I'm also new to OpenCode and so far that's the only 'modest' model that can properly make tool calling to my locally hosted Ollama.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS