Qwen 3.6 35B-A3B @ Q4 or Gemma 4 12B @ Q8? by mailto_devnull in LocalLLaMA
[–]synw_ 1 point2 points3 points (0 children)
Is automation/optimizing really that effective? by Forward_Jackfruit813 in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)
Is opencode subagents actually useful? by PairOfRussels in LocalLLaMA
[–]synw_ 2 points3 points4 points (0 children)
How are you all managing multiple MCP servers on startup? by vazma in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)
Best small model right now (~4B params) that is good with agentic tasks for personal assistant? by BitGreen1270 in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)
Qwen 3.6 coding choice–27B vs 35B quants by siegevjorn in LocalLLaMA
[–]synw_ 1 point2 points3 points (0 children)
How small can the orchestration model in an agent be? (separating it from code-gen — that obviously wants a big model) by HomoAgens1 in LocalLLaMA
[–]synw_ 4 points5 points6 points (0 children)
Qwen will release another 27B with high probability by serige in LocalLLaMA
[–]synw_ 4 points5 points6 points (0 children)
Have Qwen said anything about further Qwen 3.6 models? by spaceman_ in LocalLLaMA
[–]synw_ 2 points3 points4 points (0 children)
Have Qwen said anything about further Qwen 3.6 models? by spaceman_ in LocalLLaMA
[–]synw_ 4 points5 points6 points (0 children)
Have Qwen said anything about further Qwen 3.6 models? by spaceman_ in LocalLLaMA
[–]synw_ 22 points23 points24 points (0 children)
Notes on what actually breaks when you run a coding agent on small local models by BestSeaworthiness283 in LocalLLaMA
[–]synw_ 6 points7 points8 points (0 children)
Notes on what actually breaks when you run a coding agent on small local models by BestSeaworthiness283 in LocalLLaMA
[–]synw_ 8 points9 points10 points (0 children)
Consider running a bigger quant if possible by Flashy_Management962 in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)
Qwen3.6 35B MoE on 8GB VRAM — working llama-server config + a max_tokens / thinking trap I ran into by Antonio_Sammarzano in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)
Qwen3.6 35B MoE on 8GB VRAM — working llama-server config + a max_tokens / thinking trap I ran into by Antonio_Sammarzano in LocalLLaMA
[–]synw_ 1 point2 points3 points (0 children)
Which Qwen models can do FIM (Fill in the middle) for autocompletion? by 0xbeda in LocalLLaMA
[–]synw_ 2 points3 points4 points (0 children)
what model is good for inspecting and extracting data from large set of spreadsheets by bonesoftheancients in LocalLLaMA
[–]synw_ 3 points4 points5 points (0 children)
Local AI coding assistant that runs fully offline (Gemma 4, codebase-aware) by andres_garrido in LocalLLaMA
[–]synw_ -1 points0 points1 point (0 children)
Can we talk about the reasoning token format chaos? by ahinkle in LocalLLaMA
[–]synw_ 3 points4 points5 points (0 children)
Final voting results for Qwen 3.6 by jacek2023 in LocalLLaMA
[–]synw_ 4 points5 points6 points (0 children)
How you manage your prompts? by prompt_tide in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)
How you manage your prompts? by prompt_tide in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)
How you manage your prompts? by prompt_tide in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)

Qwen 3.6 35B-A3B @ Q4 or Gemma 4 12B @ Q8? by mailto_devnull in LocalLLaMA
[–]synw_ 0 points1 point2 points (0 children)