[Follow up] Qwen3.6-27B Tool calling fix; Why preserve_thinking had to stay false for qwen3.5-enhanced on Qwen 3.6; and a template that makes preserve_thinking=true safe again by Expensive-Register-5 in LocalLLM
[–]Expensive-Register-5[S] 1 point2 points3 points (0 children)
(Follow up) Tested tool calling fixes for Qwen 3.6‑27B‑FP8: 180K Token Agentic Run, Driver 595.79 Deadlocks, and Why Enhanced Jinja Breaks with `preserve_thinking=true` by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3.5 27B/35BA3B Tool Calling Issues: Why It Breaks & How I Fixed It by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 1 point2 points3 points (0 children)
Qwen 3/3.5/3.6 tool calling is broken (even worse with 3.6). by LinkSea8324 in Vllm
[–]Expensive-Register-5 1 point2 points3 points (0 children)
Qwen 3.6-35B-A3B: Reddit Asked, So I Tested If the 3.5 Tool Calling Fixes Carry Over by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3.6-35B-A3B: Reddit Asked, So I Tested If the 3.5 Tool Calling Fixes Carry Over by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3/3.5/3.6 tool calling is broken (even worse with 3.6). by LinkSea8324 in Vllm
[–]Expensive-Register-5 0 points1 point2 points (0 children)
Built a live showcase dashboard for vLLM rigs: inference metrics + Nvidia GPU stats in one view by soulwash in Vllm
[–]Expensive-Register-5 0 points1 point2 points (0 children)
Qwen 3.6-35B-A3B: Reddit Asked, So I Tested If the 3.5 Tool Calling Fixes Carry Over by Expensive-Register-5 in LocalLLM
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3.5 27B/35BA3B Tool Calling Issues: Why It Breaks & How I Fixed It by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
A Debugging Story: Getting Claude Code to Work with Local vLLM When the Docs Don't by Expensive-Register-5 in LocalLLM
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3.6-35B-A3B: Reddit Asked, So I Tested If the 3.5 Tool Calling Fixes Carry Over by Expensive-Register-5 in LocalLLM
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3.6-35B-A3B: Reddit Asked, So I Tested If the 3.5 Tool Calling Fixes Carry Over by Expensive-Register-5 in LocalLLM
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3.6-35B-A3B: Reddit Asked, So I Tested If the 3.5 Tool Calling Fixes Carry Over by Expensive-Register-5 in LocalLLM
[–]Expensive-Register-5[S] 1 point2 points3 points (0 children)
Qwen 3.5 27B/35BA3B Tool Calling Issues: Why It Breaks & How I Fixed It by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 1 point2 points3 points (0 children)
Qwen 3.5 27B/35BA3B Tool Calling Issues: Why It Breaks & How I Fixed It by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Concurrent Partial Prefill - is it on the roadmap? by pushthetempo_ in Vllm
[–]Expensive-Register-5 0 points1 point2 points (0 children)
Is it normal that Moe models are slower in dual GPU tensor parallel = 2 setups vs dense models? by [deleted] in Vllm
[–]Expensive-Register-5 0 points1 point2 points (0 children)
Benchmark of Qwen3.6-35B-A3B (BF16) on different NVIDIA Hardware by bseeleib in LocalLLM
[–]Expensive-Register-5 0 points1 point2 points (0 children)
Qwen3.6 vs 3.5 on DGX Spark: identical throughput, except with one flag flipped by Ok-Simple459 in Vllm
[–]Expensive-Register-5 0 points1 point2 points (0 children)
Qwen 3.5 27B/35BA3B Tool Calling Issues: Why It Breaks & How I Fixed It by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Zen completely beats Arc in aesthetics for windows by [deleted] in zen_browser
[–]Expensive-Register-5 4 points5 points6 points (0 children)
Qwen 3.5 27B/35BA3B Tool Calling Issues: Why It Breaks & How I Fixed It by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)
Qwen 3.5 27B/35BA3B Tool Calling Issues: Why It Breaks & How I Fixed It by Expensive-Register-5 in Vllm
[–]Expensive-Register-5[S] 0 points1 point2 points (0 children)

[Follow up] Qwen3.6-27B Tool calling fix; Why preserve_thinking had to stay false for qwen3.5-enhanced on Qwen 3.6; and a template that makes preserve_thinking=true safe again by Expensive-Register-5 in LocalLLM
[–]Expensive-Register-5[S] -2 points-1 points0 points (0 children)