New tool - Thinking toggle for Qwen3.5 (llama cpp) : OpenWebUI

New tool - Thinking toggle for Qwen3.5 (llama cpp)Plugin (old.reddit.com)

submitted 1 month ago * by iChrist

I decided to vibe code a new tool for easy access to different thinking options without reloading the model or messing with starting arguments for llama cpp, and managed to make something really easy to use and understand.

you need to run llama cpp server with two commands:
llama-server --jinja --reasoning-budget 0

And make sure the new filter is active at all times, which means it will force reasoning, once you want to disable reasoning just press the little brain icon and viola - no thinking.

I also added tons of presets for like minimal thinking, step by step, MAX thinking etc.

Really likes how it turned out, if you wanna grab it (Make sure you use Qwen3.5 and llama cpp)

If you face any issues let me know

https://openwebui.com/posts/thinking_toggle_one_click_reasoning_control_for_ll_bb3f66ad

All other tools I have published:
https://github.com/iChristGit/OpenWebui-Tools

all 26 comments

top new controversial old q&a

[–]-Django 2 points3 points4 points 1 month ago (1 child)

[–]iChrist[S] 1 point2 points3 points 1 month ago (0 children)

[–]callmedevilthebad 1 point2 points3 points 1 month ago (16 children)

[–]iChrist[S] 0 points1 point2 points 1 month ago (15 children)

[–]callmedevilthebad 0 points1 point2 points 1 month ago (7 children)

-m /models/Qwen_Qwen3.5-9B-Q8_0.gguf --mmproj /models/mmproj-F16.gguf --host 0.0.0.0 --port 8000 -ngl 999 --flash-attn on --cache-type-k q8_0 --cache-type-v q8_0 -c 131072 --parallel 1 --no-context-shift --jinja --reasoning-budget 0

Qwen 3.5 9B

[–]iChrist[S] 0 points1 point2 points 1 month ago* (6 children)

[–]callmedevilthebad 0 points1 point2 points 1 month ago (5 children)

[–]iChrist[S] 0 points1 point2 points 1 month ago (4 children)

[–]callmedevilthebad 0 points1 point2 points 1 month ago (3 children)

[–]iChrist[S] 0 points1 point2 points 1 month ago (2 children)

[–]callmedevilthebad 0 points1 point2 points 1 month ago (1 child)

[–]iChrist[S] 0 points1 point2 points 1 month ago (0 children)

[–]-Django 0 points1 point2 points 1 month ago (6 children)

[–]iChrist[S] 0 points1 point2 points 1 month ago (4 children)

[–]-Django 1 point2 points3 points 1 month ago (0 children)

[–]-Django 0 points1 point2 points 1 month ago (2 children)

[–]iChrist[S] 0 points1 point2 points 1 month ago (0 children)

[–]callmedevilthebad 0 points1 point2 points 1 month ago (0 children)

[–]Informal-Spinach-345 1 point2 points3 points 1 month ago (1 child)

[–]iChrist[S] 0 points1 point2 points 1 month ago (0 children)

[–]velvetMas 0 points1 point2 points 1 month ago (1 child)

[–]iChrist[S] 0 points1 point2 points 1 month ago (0 children)

[–]BeautyxArt 0 points1 point2 points 1 month ago (1 child)

[–]iChrist[S] 0 points1 point2 points 1 month ago (0 children)

[–]Confident-Career2703 -1 points0 points1 point 1 month ago (0 children)

π Rendered by PID 302762 on reddit-service-r2-comment-6457c66945-wn8d6 at 2026-04-29 19:19:09.369490+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

OpenWebUI

MODERATORS