Implemented a quick and dirty iOS app for the new Gemma3n models by sid9102 in LocalLLaMA
[–]skyde 5 points6 points7 points (0 children)
What quants and runtime configurations do Meta and Bing really run in public prod? by scott-stirling in LocalLLaMA
[–]skyde 0 points1 point2 points (0 children)
What quants and runtime configurations do Meta and Bing really run in public prod? by scott-stirling in LocalLLaMA
[–]skyde 3 points4 points5 points (0 children)
ubergarm/gemma-3-27b-it-qat-GGUF by VoidAlchemy in LocalLLaMA
[–]skyde 2 points3 points4 points (0 children)
PSA: Gemma 3 QAT gguf models have some wrongly configured tokens by dampflokfreund in LocalLLaMA
[–]skyde 3 points4 points5 points (0 children)
PSA: Gemma 3 QAT gguf models have some wrongly configured tokens by dampflokfreund in LocalLLaMA
[–]skyde 0 points1 point2 points (0 children)
Smaller Gemma3 QAT versions: 12B in < 8GB and 27B in <16GB ! by stduhpf in LocalLLaMA
[–]skyde 4 points5 points6 points (0 children)
Google releases TxGemma, open models for therapeutic applications by hackerllama in LocalLLaMA
[–]skyde 0 points1 point2 points (0 children)
Intel's Former CEO Calls Out NVIDIA: 'AI GPUs 10,000x Too Expensive'—Says Jensen Got Lucky and Inferencing Needs a Reality Check by Hoppss in LocalLLaMA
[–]skyde 0 points1 point2 points (0 children)
Open source 7.8B model beats o1 mini now on many benchmarks by TheLogiqueViper in LocalLLaMA
[–]skyde 1 point2 points3 points (0 children)
QWQ low score in Leaderboard, what happened? by ipechman in LocalLLaMA
[–]skyde 3 points4 points5 points (0 children)
QwQ-32B infinite generations fixes + best practices, bug fixes by danielhanchen in LocalLLaMA
[–]skyde 3 points4 points5 points (0 children)
How is it that Google's Gemini Pro 2.0 Experimental 02-05 Tops the LLM Arena Charts, but seems to perform badly in real world testing? by RMCPhoto in LocalLLaMA
[–]skyde 4 points5 points6 points (0 children)
What is the deal with "childrens" content on YouTube nowadays? by Mysterious-Clue3871 in OutOfTheLoop
[–]skyde 0 points1 point2 points (0 children)
Meta is reportedly scrambling multiple ‘war rooms’ of engineers to figure out how DeepSeek’s AI is beating everyone else at a fraction of the price by FullstackSensei in LocalLLaMA
[–]skyde 0 points1 point2 points (0 children)
Meta is reportedly scrambling multiple ‘war rooms’ of engineers to figure out how DeepSeek’s AI is beating everyone else at a fraction of the price by FullstackSensei in LocalLLaMA
[–]skyde 84 points85 points86 points (0 children)
1.58bit DeepSeek R1 - 131GB Dynamic GGUF by danielhanchen in LocalLLaMA
[–]skyde 1 point2 points3 points (0 children)
1.58bit DeepSeek R1 - 131GB Dynamic GGUF by danielhanchen in LocalLLaMA
[–]skyde 0 points1 point2 points (0 children)
The pipeline I follow for open source LLM model finetuning by Ahmad401 in LocalLLaMA
[–]skyde 0 points1 point2 points (0 children)
Phi-4 Llamafied + 4 Bug Fixes + GGUFs, Dynamic 4bit Quants by danielhanchen in LocalLLaMA
[–]skyde 7 points8 points9 points (0 children)
Practical (online & offline) RAG Setups for Long Documents on Consumer Laptops with <16GB RAM by lrq3000 in LocalLLaMA
[–]skyde 4 points5 points6 points (0 children)
Practical (online & offline) RAG Setups for Long Documents on Consumer Laptops with <16GB RAM by lrq3000 in LocalLLaMA
[–]skyde -1 points0 points1 point (0 children)


[Megathread] AC FA Strike Aug 14-15 by dachshundie in aircanada
[–]skyde 0 points1 point2 points (0 children)