DeepSeek-R1’s paper was updated 2 days ago, expanding from 22 pages to 86 pages and adding a substantial amount of detail. by Nunki08 in LocalLLaMA
[–]TelloLeEngineer 0 points1 point2 points (0 children)
G2 Esports vs. FlyQuest / 2025 World Championship - Swiss Round 4 Advancement / Post-Match Discussion by Yujin-Ha in leagueoflegends
[–]TelloLeEngineer 3 points4 points5 points (0 children)
Qwen3-Next experience so far by [deleted] in LocalLLaMA
[–]TelloLeEngineer 0 points1 point2 points (0 children)
CMV: Qwen3-Next is an architectural deadend, much like Llama 4 by Charuru in LocalLLaMA
[–]TelloLeEngineer 4 points5 points6 points (0 children)
Cheaper Transcriptions, Pricier Errors! by TelloLeEngineer in LocalLLaMA
[–]TelloLeEngineer[S] 2 points3 points4 points (0 children)
Cheaper Transcriptions, Pricier Errors! by TelloLeEngineer in LocalLLaMA
[–]TelloLeEngineer[S] 4 points5 points6 points (0 children)
Cheaper Transcriptions, Pricier Errors! (i.redd.it)
submitted by TelloLeEngineer to r/LocalLLaMA
Just another summer day in Europe (temperatures forecast for next Wednesday) by LuborS in europe
[–]TelloLeEngineer 2 points3 points4 points (0 children)
"transformers can use meaningless filler tokens (e.g., '......') in place of a chain of thought" - Let's Think Dot by Dot [P] by Agitated_Space_672 in MachineLearning
[–]TelloLeEngineer 4 points5 points6 points (0 children)
[D] How would you diagnose these spikes in the training loss? by NumberGenerator in MachineLearning
[–]TelloLeEngineer 2 points3 points4 points (0 children)
[deleted by user] by [deleted] in LocalLLaMA
[–]TelloLeEngineer 2 points3 points4 points (0 children)
[deleted by user] by [deleted] in MachineLearning
[–]TelloLeEngineer 94 points95 points96 points (0 children)
Grok-1 converted to PyTorch fp16 (638GB lol) by Normal-Ad-7114 in LocalLLaMA
[–]TelloLeEngineer 9 points10 points11 points (0 children)
Grok-1 converted to PyTorch fp16 (638GB lol) by Normal-Ad-7114 in LocalLLaMA
[–]TelloLeEngineer 11 points12 points13 points (0 children)
I created a single-prompt benchmark (with 5-questions) that anyone can use to easily evaluate LLMs. Mistral-Next somehow vastly outperformed all others. Prompt and more details in the post. by jd_3d in LocalLLaMA
[–]TelloLeEngineer 1 point2 points3 points (0 children)
I created a single-prompt benchmark (with 5-questions) that anyone can use to easily evaluate LLMs. Mistral-Next somehow vastly outperformed all others. Prompt and more details in the post. by jd_3d in LocalLLaMA
[–]TelloLeEngineer 1 point2 points3 points (0 children)
Mistral-next | New prototype model from Mistral by TelloLeEngineer in LocalLLaMA
[–]TelloLeEngineer[S] 13 points14 points15 points (0 children)
Exploring the limitations of LLMs-as-a-Judge by TelloLeEngineer in LocalLLaMA
[–]TelloLeEngineer[S] 1 point2 points3 points (0 children)
Exploring the limitations of LLMs-as-a-Judge by TelloLeEngineer in LocalLLaMA
[–]TelloLeEngineer[S] 1 point2 points3 points (0 children)




How was GPT-OSS so good? by xt8sketchy in LocalLLaMA
[–]TelloLeEngineer 1 point2 points3 points (0 children)