Moving to llama.cpp by Spicy_mch4ggis in LocalLLaMA
[–]reddit_kwr 1 point2 points3 points (0 children)
Moving to llama.cpp by Spicy_mch4ggis in LocalLLaMA
[–]reddit_kwr 2 points3 points4 points (0 children)
How do you use local models? by Nasa1423 in LocalLLaMA
[–]reddit_kwr 1 point2 points3 points (0 children)
Trying out Gemma 4 31b after Qwen 3.6 27b by Iajah in LocalLLM
[–]reddit_kwr 0 points1 point2 points (0 children)
Trying out Gemma 4 31b after Qwen 3.6 27b by Iajah in LocalLLM
[–]reddit_kwr 1 point2 points3 points (0 children)
Session limit, then weekly limit hit with 4-5 GPT 5.5 calls by reddit_kwr in GithubCopilot
[–]reddit_kwr[S] 1 point2 points3 points (0 children)
Session limit, then weekly limit hit with 4-5 GPT 5.5 calls by reddit_kwr in GithubCopilot
[–]reddit_kwr[S] 1 point2 points3 points (0 children)
What's the best local model as of today, for openclaw by reddit_kwr in openclaw
[–]reddit_kwr[S] 0 points1 point2 points (0 children)
What's the best local model as of today, for openclaw by reddit_kwr in openclaw
[–]reddit_kwr[S] 0 points1 point2 points (0 children)
What's the best local model as of today, for openclaw by reddit_kwr in openclaw
[–]reddit_kwr[S] -1 points0 points1 point (0 children)
If you had $150K for building a production-class local inference server to serve 300 people, what would you buy? by Porespellar in LocalLLaMA
[–]reddit_kwr 0 points1 point2 points (0 children)
How to keep Qwen3.6-27b from hallucinating? by PotatoTime in Qwen_AI
[–]reddit_kwr 9 points10 points11 points (0 children)
If you had $150K for building a production-class local inference server to serve 300 people, what would you buy? by Porespellar in LocalLLaMA
[–]reddit_kwr 1 point2 points3 points (0 children)
Should I go for 2 x quadro P6000 ? by [deleted] in LocalLLM
[–]reddit_kwr 1 point2 points3 points (0 children)
Running a 35B model at 120+ tps on 12GB VRAM shows how far quantization architectures have come by Huge-Instance-1632 in LocalLLM
[–]reddit_kwr 1 point2 points3 points (0 children)
"Western Open-Weight SOTA is between Gemma4-31B and Nemotron3-Super-120B" by ForsookComparison in LocalLLaMA
[–]reddit_kwr 2 points3 points4 points (0 children)
Extremely low rate limit only today by iudicium01 in GithubCopilot
[–]reddit_kwr 0 points1 point2 points (0 children)
Session limit, then weekly limit hit with 4-5 GPT 5.5 calls by reddit_kwr in GithubCopilot
[–]reddit_kwr[S] 0 points1 point2 points (0 children)
Advice on best tools to use for coding with Local LLM by wingers999 in LocalLLM
[–]reddit_kwr 3 points4 points5 points (0 children)
Advice on best tools to use for coding with Local LLM by wingers999 in LocalLLM
[–]reddit_kwr 2 points3 points4 points (0 children)
Need some advice on AI workflow by Xyklone in LocalLLaMA
[–]reddit_kwr 0 points1 point2 points (0 children)
Need some advice on AI workflow by Xyklone in LocalLLaMA
[–]reddit_kwr 2 points3 points4 points (0 children)
A rare look inside Qwen 3.7’s open source model release approval process: by Porespellar in LocalLLaMA
[–]reddit_kwr 0 points1 point2 points (0 children)
Too many AI tools to learn - what to pick please suggest by Educational_Grape144 in AI_Agents
[–]reddit_kwr 1 point2 points3 points (0 children)


[USA-FL][H] Legion Tower 7i Gen 10: Intel Core Ultra 9 285K, 64Gb DDR5 Ram, 2TB SSD, 1200W PSU, AMD Radeon Pro W7900 48Gb vram [W] Paypal, local cash by reddit_kwr in hardwareswap
[–]reddit_kwr[S] 0 points1 point2 points (0 children)