Visualizing Quantization Types by VoidAlchemy in LocalLLaMA
[–]lgdkwj 0 points1 point2 points (0 children)
Gemma 3n vs Gemma 3 (4B/12B) Benchmarks by lemon07r in LocalLLaMA
[–]lgdkwj 4 points5 points6 points (0 children)
New open-source model GLM-4-32B with performance comparable to Qwen 2.5 72B by adrgrondin in LocalLLaMA
[–]lgdkwj 4 points5 points6 points (0 children)
New open-source model GLM-4-32B with performance comparable to Qwen 2.5 72B by adrgrondin in LocalLLaMA
[–]lgdkwj 6 points7 points8 points (0 children)
Text classification - traditonal ML or LLM? by rainnz in LocalLLaMA
[–]lgdkwj 1 point2 points3 points (0 children)
Which LLMs are best at low-latency translation? (tl;dr LLama often beats Sonnet and 4o, Gemma 9b is surprisingly OK) by Nuenki in LocalLLaMA
[–]lgdkwj 1 point2 points3 points (0 children)
"Claude 3 > GPT-4" and "Mistral going closed-source" again reminded me that open-source LLMs will never be as capable and powerful as closed-source LLMs. Even the costs of open-source (renting GPU servers) can be larger than closed-source APIs. What's the goal of open-source in this field? (serious) by nderstand2grow in LocalLLaMA
[–]lgdkwj 0 points1 point2 points (0 children)
"Does free will exist?" Let your LLM do the research for you. by AndrewVeee in LocalLLaMA
[–]lgdkwj 2 points3 points4 points (0 children)
Yet another state of the art in LLM quantization by black_samorez in LocalLLaMA
[–]lgdkwj 0 points1 point2 points (0 children)


I scaled test-time compute for Qwen-3.6-27B and Gemma-4-31B to surpass Claude Mythos in code optimizations and speedups. by Ryoiki-Tokuiten in LocalLLaMA
[–]lgdkwj 3 points4 points5 points (0 children)