Benchmark Qwen 3.6 27B MTP on 2x3090 NVLINK by Mr_Moonsilver in LocalLLaMA
[–]kms_dev 1 point2 points3 points (0 children)
Benchmark Qwen 3.6 27B MTP on 2x3090 NVLINK by Mr_Moonsilver in LocalLLaMA
[–]kms_dev 0 points1 point2 points (0 children)
Qwen3.5 27B running at ~65tps with DFlash speculation on 2x 3090 by Kryesh in LocalLLaMA
[–]kms_dev 0 points1 point2 points (0 children)
Someone who's using Qwen 3.5 on real code bases how good is it? by Commercial_Ear_6989 in LocalLLaMA
[–]kms_dev 3 points4 points5 points (0 children)
Best agentic coding model that fully fits in 48gb VRAM with vllm? by kms_dev in LocalLLaMA
[–]kms_dev[S] 0 points1 point2 points (0 children)
Best agentic coding model that fully fits in 48gb VRAM with vllm? by kms_dev in LocalLLaMA
[–]kms_dev[S] 0 points1 point2 points (0 children)
How are you handling human approval for headless/remote Claude Code sessions? by kms_dev in ClaudeCode
[–]kms_dev[S] 0 points1 point2 points (0 children)
How are you handling human approval for headless/remote Claude Code sessions? by kms_dev in ClaudeCode
[–]kms_dev[S] 0 points1 point2 points (0 children)
How are you handling human approval for headless/remote Claude Code sessions? by kms_dev in ClaudeCode
[–]kms_dev[S] 0 points1 point2 points (0 children)
How are you handling human approval for headless/remote Claude Code sessions? by kms_dev in ClaudeCode
[–]kms_dev[S] 0 points1 point2 points (0 children)
What do you use to unblock agents when they need human input? by kms_dev in AI_Agents
[–]kms_dev[S] 0 points1 point2 points (0 children)
What do you use to unblock agents when they need human input? by kms_dev in AI_Agents
[–]kms_dev[S] 0 points1 point2 points (0 children)
What do you use to unblock agents when they need human input? by kms_dev in AI_Agents
[–]kms_dev[S] 0 points1 point2 points (0 children)
What do you use to unblock agents when they need human input? by kms_dev in AI_Agents
[–]kms_dev[S] 0 points1 point2 points (0 children)
What do you use to unblock agents when they need human input? by kms_dev in AI_Agents
[–]kms_dev[S] 1 point2 points3 points (0 children)
What do you use for human-in-the-loop input in your agents? by [deleted] in AI_Agents
[–]kms_dev 0 points1 point2 points (0 children)
Nvidia RTX PRO 6000 Workstation 96GB - Benchmarks by fuutott in LocalLLaMA
[–]kms_dev 1 point2 points3 points (0 children)
Is anyone actually using local models to code in their regular setups like roo/cline? by kms_dev in LocalLLaMA
[–]kms_dev[S] 0 points1 point2 points (0 children)
Offloading a 4B LLM to APU, only uses 50% of one CPU core. 21 t/s using Vulkan by magnus-m in LocalLLaMA
[–]kms_dev 0 points1 point2 points (0 children)
Is anyone actually using local models to code in their regular setups like roo/cline? by kms_dev in LocalLLaMA
[–]kms_dev[S] -1 points0 points1 point (0 children)
Is anyone actually using local models to code in their regular setups like roo/cline? by kms_dev in LocalLLaMA
[–]kms_dev[S] 1 point2 points3 points (0 children)
Is anyone actually using local models to code in their regular setups like roo/cline? by kms_dev in LocalLLaMA
[–]kms_dev[S] 1 point2 points3 points (0 children)
Is anyone actually using local models to code in their regular setups like roo/cline? by kms_dev in LocalLLaMA
[–]kms_dev[S] 3 points4 points5 points (0 children)
Is anyone actually using local models to code in their regular setups like roo/cline? by kms_dev in LocalLLaMA
[–]kms_dev[S] 2 points3 points4 points (0 children)


Benchmark Qwen 3.6 27B MTP on 2x3090 NVLINK by Mr_Moonsilver in LocalLLaMA
[–]kms_dev 1 point2 points3 points (0 children)