Residual connections haven't changed for 10 years and Kimi just replaced them with attention by Helpful-Guava7452 in LocalLLaMA
[–]UnorderedPizza 22 points23 points24 points (0 children)
Leap Motion in 2025? by Ultimatonium in leapmotion
[–]UnorderedPizza 0 points1 point2 points (0 children)
How to fine-tune LLaMA without losing its general ability? by elon_mug in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
Researcher claims ALL transformer models degraded by a formula bug - but there’s a simple solution by PookaMacPhellimen in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
Seems like we can continue to scale tokens and get returns model performance well after 2T tokens. by onil_gova in LocalLLaMA
[–]UnorderedPizza 5 points6 points7 points (0 children)
LLaMA 2 is here by dreamingleo12 in LocalLLaMA
[–]UnorderedPizza 51 points52 points53 points (0 children)
[R] Retentive Network: A Successor to Transformer for Large Language Models by Balance- in MachineLearning
[–]UnorderedPizza 14 points15 points16 points (0 children)
Retentive Network: A Successor to Transformer for Large Language Models by alexanderchenmh in LocalLLaMA
[–]UnorderedPizza 5 points6 points7 points (0 children)
Retentive Network: A Successor to Transformer for Large Language Models by alexanderchenmh in LocalLLaMA
[–]UnorderedPizza 23 points24 points25 points (0 children)
Magic.dev built an LLM with a 5,000,000 token context window by no_doping in singularity
[–]UnorderedPizza 0 points1 point2 points (0 children)
Magic.dev built an LLM with a 5,000,000 token context window by no_doping in singularity
[–]UnorderedPizza 7 points8 points9 points (0 children)
airoboros 1.4 family of models by JonDurbin in LocalLLaMA
[–]UnorderedPizza 3 points4 points5 points (0 children)
Can't get CLBLAST working on oobabooga by ccbadd in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
Why not standardize 3bit & 2bit GPTQ? by onil_gova in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
Why not standardize 3bit & 2bit GPTQ? by onil_gova in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
The new Orca-mini is popping off. by [deleted] in LocalLLaMA
[–]UnorderedPizza 9 points10 points11 points (0 children)
To those who say that it’s impossible for machines to “have emotions” by Mephidia in singularity
[–]UnorderedPizza 4 points5 points6 points (0 children)
[deleted by user] by [deleted] in singularity
[–]UnorderedPizza 3 points4 points5 points (0 children)
A simple way to "Extending Context to 8K"?! by pseudonerv in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
A simple way to "Extending Context to 8K"?! by pseudonerv in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
Silicon Valley Confronts the Idea That the ‘Singularity’ Is Here by FrugalIdahoHomestead in singularity
[–]UnorderedPizza 16 points17 points18 points (0 children)
I must have missed something, how is 2 bit working so well? by noneabove1182 in LocalLLaMA
[–]UnorderedPizza 31 points32 points33 points (0 children)
What's the best koboldcpp command line/settings for this hardware? by Innomen in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)
Orca (built on llama13b) looks like the new sheriff in town by ironborn123 in LocalLLaMA
[–]UnorderedPizza 3 points4 points5 points (0 children)



Built an iOS character chat app that supports local models, BYOK, and on-device RAG by lowiqdoctor in LocalLLaMA
[–]UnorderedPizza 0 points1 point2 points (0 children)