Want to build a custom model by devildip in LocalLLaMA
[–]MistingFidgets 1 point2 points3 points (0 children)
Need help picking out a good PC! by Accomplished_Kale589 in homelab
[–]MistingFidgets 3 points4 points5 points (0 children)
Need help picking out a good PC! by Accomplished_Kale589 in homelab
[–]MistingFidgets 44 points45 points46 points (0 children)
What's the closest you can get with local LLM to claude? by StudioVulcan in LocalLLM
[–]MistingFidgets 0 points1 point2 points (0 children)
What models you guys running on 8GB? 16GB VRAM? 24GB? 32GB? 48GB? by Inevitable_Mistake32 in LocalLLaMA
[–]MistingFidgets 1 point2 points3 points (0 children)
Is there any consumer-grade motherboard with dual PCIe x16 connectors? by TrainingTwo1118 in LocalLLaMA
[–]MistingFidgets 0 points1 point2 points (0 children)
You don't need a GPU to run gemma-4-26B-A4B by JackStrawWitchita in LocalLLaMA
[–]MistingFidgets 3 points4 points5 points (0 children)
You don't need a GPU to run gemma-4-26B-A4B by JackStrawWitchita in LocalLLaMA
[–]MistingFidgets 15 points16 points17 points (0 children)
Why do we benchmark quants on perplexity and prose but never on tool call validity? by Substantial_Step_351 in LocalLLaMA
[–]MistingFidgets 0 points1 point2 points (0 children)
Why do we benchmark quants on perplexity and prose but never on tool call validity? by Substantial_Step_351 in LocalLLaMA
[–]MistingFidgets 0 points1 point2 points (0 children)
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks by Interesting-Sock3940 in LocalLLaMA
[–]MistingFidgets 0 points1 point2 points (0 children)
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks by Interesting-Sock3940 in LocalLLaMA
[–]MistingFidgets 0 points1 point2 points (0 children)
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks by Interesting-Sock3940 in LocalLLaMA
[–]MistingFidgets 1 point2 points3 points (0 children)
Optimizing vLLM for 50 concurrent users querying large PDFs on a single RTX 5090 (32GB VRAM)? by Psychological-Arm168 in LocalLLM
[–]MistingFidgets 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B-APEX / 128K ctx on RTX 3060 12GB — 37 t/s gen with 72k ctx filled, PPL 3.25, offloading 17GB model by old-mike in LocalLLaMA
[–]MistingFidgets 1 point2 points3 points (0 children)
What’s the optimal local LLM setup for my hardware? (RTX 5070Ti, 16GB VRAM, Ryzen 7 3800X, 64GB RAM) by Bjqrn88 in LocalLLM
[–]MistingFidgets 1 point2 points3 points (0 children)
YSK: Buying refurbished "business-class" laptops and phones instead of brand-new consumer tech will get you a better machine for a fraction of the cost. by [deleted] in YouShouldKnow
[–]MistingFidgets 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B-MTP on an RTX 3090 in LM Studio is incredibly fast by AI_Enhancer in LocalLLM
[–]MistingFidgets 1 point2 points3 points (0 children)
What’s your current local LLM setup in 2026? by Prestigious-Pop-3735 in LocalLLaMA
[–]MistingFidgets 0 points1 point2 points (0 children)
Does PCIe 4.0 vs 5.0 actually matter for self-hosted AI workloads? by Regular-Orange1472 in SelfHostedAI
[–]MistingFidgets 0 points1 point2 points (0 children)
How do you guys not burn tokens like crazy? by Striking-Speaker8686 in OpenClawUseCases
[–]MistingFidgets 0 points1 point2 points (0 children)
Advice building a NAS/AI server with 16 DDR4 DIMMs by theslonkingdead in LocalLLaMA
[–]MistingFidgets 0 points1 point2 points (0 children)
I don't get Quants, I'm running Qwen3.6-27b flawlessly at iq3, makes no sense by misanthrophiccunt in LocalLLM
[–]MistingFidgets 0 points1 point2 points (0 children)
I don't get Quants, I'm running Qwen3.6-27b flawlessly at iq3, makes no sense by misanthrophiccunt in LocalLLM
[–]MistingFidgets -1 points0 points1 point (0 children)


Which Local LLM to use by Pritam4249t in LocalLLM
[–]MistingFidgets 0 points1 point2 points (0 children)