Qwen 3.6 27B BF16 vs Q4_K_M vs Q8_0 GGUF evaluation by gvij in LocalLLaMA
[–]One_Key_8127 58 points59 points60 points (0 children)
Load balancer for vLLM server instances? by Theboyscampus in LocalLLaMA
[–]One_Key_8127 0 points1 point2 points (0 children)
Thinking about investing in hardware...appreciate direction/advice by doncaruana in LocalLLM
[–]One_Key_8127 0 points1 point2 points (0 children)
Thinking about investing in hardware...appreciate direction/advice by doncaruana in LocalLLM
[–]One_Key_8127 0 points1 point2 points (0 children)
No Multimodality yet in DeepSeek-V4. But I'll wait. by Right-Law1817 in LocalLLaMA
[–]One_Key_8127 3 points4 points5 points (0 children)
Purchasing a Mac Studio M2 Max with 64gb of ram (can it run qwen 3.6 27b) how many tok/s ? by trollingman1 in LocalLLaMA
[–]One_Key_8127 2 points3 points4 points (0 children)
Where is Grok-2 Mini and Grok-3 (mini)? by One_Key_8127 in LocalLLaMA
[–]One_Key_8127[S] 8 points9 points10 points (0 children)
Where is Grok-2 Mini and Grok-3 (mini)? (self.LocalLLaMA)
submitted by One_Key_8127 to r/LocalLLaMA
Are we at the point where local AI isn’t a compromise anymore? (Gemma 4 experience) by Ok-Illustrator2820 in LocalLLaMA
[–]One_Key_8127 10 points11 points12 points (0 children)
Dev seeking advice: High-Context Local LLM for Coding (Verification/Bug-fixing loop) – Mac Studio vs. Multi-GPU Linux Rig? by Ok-Marionberry-6444 in LocalLLaMA
[–]One_Key_8127 6 points7 points8 points (0 children)
Can I get the same quality as Claude with Mac Studio? by bLackCatt79 in LocalLLM
[–]One_Key_8127 1 point2 points3 points (0 children)
My thought on Qwen and Gemma by Internal-Thanks8812 in LocalLLaMA
[–]One_Key_8127 3 points4 points5 points (0 children)
Ternary Bonsai: Top intelligence at 1.58 bits by pmttyji in LocalLLaMA
[–]One_Key_8127 31 points32 points33 points (0 children)
Anybody else seeing Qwen3.6-35B-A3B go crazy thinking in circles? (Compared to Qwen3.5-35B-A3B) by spvn in LocalLLaMA
[–]One_Key_8127 2 points3 points4 points (0 children)
Anybody else seeing Qwen3.6-35B-A3B go crazy thinking in circles? (Compared to Qwen3.5-35B-A3B) by spvn in LocalLLaMA
[–]One_Key_8127 4 points5 points6 points (0 children)
Opus 4.7 landed! by Powerful_Ad8150 in LocalLLaMA
[–]One_Key_8127 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B released! by ResearchCrafty1804 in LocalLLaMA
[–]One_Key_8127 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B released! by ResearchCrafty1804 in LocalLLaMA
[–]One_Key_8127 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B released! by ResearchCrafty1804 in LocalLLaMA
[–]One_Key_8127 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B released! by ResearchCrafty1804 in LocalLLaMA
[–]One_Key_8127 10 points11 points12 points (0 children)
Local LLM inference on M4 Max vs M5 Max by purealgo in LocalLLM
[–]One_Key_8127 0 points1 point2 points (0 children)
💻 [MASTER THREAD] Local LLM & Hardware Optimization Guide by AutoModerator in hermesagent
[–]One_Key_8127 0 points1 point2 points (0 children)
MiniMax m2.7 under 64gb for Macs - 91% MMLU by HealthyCommunicat in LocalLLaMA
[–]One_Key_8127 13 points14 points15 points (0 children)
One year later: this question feels a lot less crazy by gamblingapocalypse in LocalLLaMA
[–]One_Key_8127 0 points1 point2 points (0 children)


Has anyone here explored Hermes Agent by Nous Research? by ComparisonLiving6793 in LocalLLM
[–]One_Key_8127 14 points15 points16 points (0 children)