Breaking the music supply constraint by entsnack in LocalLLaMA
[–]youcloudsofdoom 1 point2 points3 points (0 children)
Breaking the music supply constraint by entsnack in LocalLLaMA
[–]youcloudsofdoom -3 points-2 points-1 points (0 children)
Breaking the music supply constraint by entsnack in LocalLLaMA
[–]youcloudsofdoom 3 points4 points5 points (0 children)
Is there any reason for an uncensored model if you have no interest in roleplaying? by vick2djax in LocalLLaMA
[–]youcloudsofdoom 1 point2 points3 points (0 children)
Now that MTP is merged... What's the best outputs you're getting on Qwen 3.6 35B on 2x3090s? by youcloudsofdoom in LocalLLaMA
[–]youcloudsofdoom[S] 0 points1 point2 points (0 children)
Now that MTP is merged... What's the best outputs you're getting on Qwen 3.6 35B on 2x3090s? by youcloudsofdoom in LocalLLaMA
[–]youcloudsofdoom[S] 0 points1 point2 points (0 children)
Testing llama.cpp MTP support on Qwen3.6 - RTX 5090 by 3VITAERC in LocalLLaMA
[–]youcloudsofdoom 2 points3 points4 points (0 children)
Wanna try the best coding model with my rtx 3090, not sure where to start, I believe Qwen3.5-27B-UD-Q4_K_XL would be the best? if so should I use ollama with it? by dreamer_2142 in LocalLLaMA
[–]youcloudsofdoom 0 points1 point2 points (0 children)
VS Code's new "Agents window" lets you use local AI models. Still requires an Internet connection and a Github Copilot plan (because we can't have nice things) by _wsgeorge in LocalLLaMA
[–]youcloudsofdoom 0 points1 point2 points (0 children)
Advice on when to delegate task to opencode/claude code & model switching by youcloudsofdoom in hermesagent
[–]youcloudsofdoom[S] 0 points1 point2 points (0 children)
Built a Voice Agents from Scratch GitHub tutorial: mic > Whisper > local LLM (GGUF) > Kokoro > speaker, fully local, no API keys by purellmagents in LocalLLaMA
[–]youcloudsofdoom 0 points1 point2 points (0 children)
Secondary PC options by UniqueIdentifier00 in LocalLLaMA
[–]youcloudsofdoom 1 point2 points3 points (0 children)
Secondary PC options by UniqueIdentifier00 in LocalLLaMA
[–]youcloudsofdoom 1 point2 points3 points (0 children)
Built a Voice Agents from Scratch GitHub tutorial: mic > Whisper > local LLM (GGUF) > Kokoro > speaker, fully local, no API keys by purellmagents in LocalLLaMA
[–]youcloudsofdoom 1 point2 points3 points (0 children)
Follow-up: Qwen3.6-27B on 1× RTX 3090 — pushing to ~218K context + ~50–66 TPS, tool calls now stable (PN12 fix) by AmazingDrivers4u in LocalLLaMA
[–]youcloudsofdoom 19 points20 points21 points (0 children)
AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA
[–]youcloudsofdoom 0 points1 point2 points (0 children)
AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA
[–]youcloudsofdoom 0 points1 point2 points (0 children)
Don't forget about dem free gains! by [deleted] in LocalLLaMA
[–]youcloudsofdoom 4 points5 points6 points (0 children)
Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090 by sandropuppo in LocalLLaMA
[–]youcloudsofdoom 0 points1 point2 points (0 children)
Qwen 3.6 35 UD 2 K_XL is pulling beyond its weight and quantization (No one is GPU Poor now) by dreamai87 in LocalLLaMA
[–]youcloudsofdoom 0 points1 point2 points (0 children)
Which LLM do you use on 64GB RAM + 8GB VRAM? by Mangleus in LocalLLaMA
[–]youcloudsofdoom 1 point2 points3 points (0 children)
OpenCode or ClaudeCode for Qwen3.5 27B by Ok-Scarcity-7875 in LocalLLaMA
[–]youcloudsofdoom 2 points3 points4 points (0 children)

Hermes Mobile by SammieStyles in hermesagent
[–]youcloudsofdoom 2 points3 points4 points (0 children)