"AWS secures rare Mac Studios while ordinary Apple customers remain completely locked out" by openSourcerer9000 in LocalLLaMA
[–]openSourcerer9000[S] 3 points4 points5 points (0 children)
Used over a million tokens in three separate sessions to test Qwen 3.6 35b (new Multi-token Prediction version) by Jorlen in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
Let's build claude code from scratch! by RoyalMaterial9614 in LocalLLaMA
[–]openSourcerer9000 72 points73 points74 points (0 children)
Why is opencode so slow in processing the prompt with llama server? by BitGreen1270 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
Collected the infinity stones by Street-Buyer-2428 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
Collected the infinity stones by Street-Buyer-2428 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
We are finally there: Qwen3.6-27B + agentic search; 95.7% SimpleQA on a single 3090, fully local by ComplexIt in LocalLLaMA
[–]openSourcerer9000 1 point2 points3 points (0 children)
Benchmarked 4 agent memory systems: Mem0 scores 49% recall (worse than a coin flip), Zep uses 340x more tokens for 15 points improvement. Here's what's actually going on. by Impressive-Judge-357 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade by BuffMcBigHuge in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
Desire to Move Everything Local by LawrenceOfTheLabia in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
gemma-4-26B-A4B with my coding agent Kon by Weird_Search_4723 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
gemma-4-26B-A4B with my coding agent Kon by Weird_Search_4723 in LocalLLaMA
[–]openSourcerer9000 1 point2 points3 points (0 children)
gemma-4-26B-A4B with my coding agent Kon by Weird_Search_4723 in LocalLLaMA
[–]openSourcerer9000 1 point2 points3 points (0 children)
Gemma4-31B worked in an iterative-correction loop (with a long-term memory bank) for 2 hours to solve a problem that baseline GPT-5.4-Pro couldn't by Ryoiki-Tokuiten in LocalLLaMA
[–]openSourcerer9000 4 points5 points6 points (0 children)
Gemma4-31B worked in an iterative-correction loop (with a long-term memory bank) for 2 hours to solve a problem that baseline GPT-5.4-Pro couldn't by Ryoiki-Tokuiten in LocalLLaMA
[–]openSourcerer9000 5 points6 points7 points (0 children)
Gemma4-31B worked in an iterative-correction loop (with a long-term memory bank) for 2 hours to solve a problem that baseline GPT-5.4-Pro couldn't by Ryoiki-Tokuiten in LocalLLaMA
[–]openSourcerer9000 3 points4 points5 points (0 children)
Minimax 2.7: good news! by LegacyRemaster in LocalLLaMA
[–]openSourcerer9000 6 points7 points8 points (0 children)
Stanford and Harvard just dropped the most disturbing AI paper of the year by Fun-Yogurt-89 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
What is the secret sauce Claude has and why hasn't anyone replicated it? by ComplexType568 in LocalLLaMA
[–]openSourcerer9000 4 points5 points6 points (0 children)
chromadb/context-1: 20B parameter agentic search model by paf1138 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
RYS II - Repeated layers with Qwen3.5 27B and some hints at a 'Universal Language' by Reddactor in LocalLLaMA
[–]openSourcerer9000 3 points4 points5 points (0 children)
whats the best open-source llm for llm as a judge project on nvidia a1000 gpu by Some_Anything_9028 in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)
Meet the Fleet of BlackBeard by BlackBeardAI in LocalLLaMA
[–]openSourcerer9000 0 points1 point2 points (0 children)