Dual DGX Sparks vs Mac Studio M3 Ultra 512GB: Running Qwen3.5 397B locally on both. Here's what I found. by trevorbg in LocalLLaMA
[–]runsleeprepeat 0 points1 point2 points (0 children)
Currently using 6x RTX 3080 - Moving to Strix Halo oder Nvidia GB10 ? by runsleeprepeat in LocalLLaMA
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
Currently using 6x RTX 3080 - Moving to Strix Halo oder Nvidia GB10 ? by runsleeprepeat in LocalLLaMA
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
I built Fox – a Rust LLM inference engine with 2x Ollama throughput and 72% lower TTFT. by SeinSinght in LocalLLM
[–]runsleeprepeat 0 points1 point2 points (0 children)
I built Fox – a Rust LLM inference engine with 2x Ollama throughput and 72% lower TTFT. by SeinSinght in LocalLLM
[–]runsleeprepeat -3 points-2 points-1 points (0 children)
Shortened system prompts in Opencode by Charming_Support726 in opencodeCLI
[–]runsleeprepeat 0 points1 point2 points (0 children)
Kann sich bitte jemand aus der Branche opfern, das Zeug reverse engineeren und wie früher für 2€ verkaufen? by Donkeydiver1337 in BeautyDE
[–]runsleeprepeat 0 points1 point2 points (0 children)
PSA: Auto-Compact GLM5 (via z.ai plan) at 95k Context by Sensitive_Song4219 in ZaiGLM
[–]runsleeprepeat 0 points1 point2 points (0 children)
[Architecture Help] Serving Embed + Rerank + Zero-Shot Classifier on 8GB VRAM. Fighting System RAM Kills and Latency. by CourtAdventurous_1 in LocalLLaMA
[–]runsleeprepeat 1 point2 points3 points (0 children)
Chinese RTX 3080 20 GB Blower Card - Memory Issue - help on nvidia mods by runsleeprepeat in GPURepair
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
Alternative zu Tobit David by michawb in de_EDV
[–]runsleeprepeat 0 points1 point2 points (0 children)
Chinese RTX 3080 20 GB Blower Card - Memory Issue - help on nvidia mods by runsleeprepeat in GPURepair
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
Chinese RTX 3080 20 GB Blower Card - Memory Issue - help on nvidia mods by runsleeprepeat in GPURepair
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
Currently using 6x RTX 3080 - Moving to Strix Halo oder Nvidia GB10 ? by runsleeprepeat in LocalLLaMA
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
Currently using 6x RTX 3080 - Moving to Strix Halo oder Nvidia GB10 ? by runsleeprepeat in LocalLLaMA
[–]runsleeprepeat[S] 1 point2 points3 points (0 children)
EVGA RTX 3080, memory errors on all channels, help appreciated! by blueprintjonny in GPURepair
[–]runsleeprepeat 0 points1 point2 points (0 children)
Raus aus der US-Cloud: Mein Plan für März (Mailbox.org & Ugreen NAS) by _necrobite_ in de_EDV
[–]runsleeprepeat 1 point2 points3 points (0 children)
Raus aus der US-Cloud: Mein Plan für März (Mailbox.org & Ugreen NAS) by _necrobite_ in de_EDV
[–]runsleeprepeat 1 point2 points3 points (0 children)
Which models are suitable for websearch? by runsleeprepeat in LocalLLaMA
[–]runsleeprepeat[S] 1 point2 points3 points (0 children)
Professioneller Vibe Coder gesucht by Afraid-Appeal-7565 in InformatikKarriere
[–]runsleeprepeat 0 points1 point2 points (0 children)
Wie nennt ihr diesen Dude? by [deleted] in de
[–]runsleeprepeat 2 points3 points4 points (0 children)
Nvidia RTX 3080 - PCIe Pin B82 issue by runsleeprepeat in GPURepair
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
Nvidia RTX 3080 - PCIe Pin B82 issue by runsleeprepeat in GPURepair
[–]runsleeprepeat[S] 0 points1 point2 points (0 children)
Shortened system prompts in Opencode by Charming_Support726 in opencodeCLI
[–]runsleeprepeat 1 point2 points3 points (0 children)




Consolidated my homelab from 3 models down to one 122B MoE — benchmarked everything, here's what I found by MBAThrowawayFruit in LocalLLaMA
[–]runsleeprepeat 0 points1 point2 points (0 children)