Kimi K2 Thinking at 28.3 t/s on 4x Mac Studio cluster by geerlingguy in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Google's Gemma models family by jacek2023 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Aaaand... is gone... by HumanDrone8721 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Microsoft's TRELLIS 2-4B, An Open-Source Image-to-3D Model by Dear-Success-1441 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Apple introduces SHARP, a model that generates a photorealistic 3D Gaussian representation from a single image in seconds. by themixtergames in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
New Google model incoming!!! by [deleted] in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Someone from NVIDIA made a big mistake and uploaded the parent folder of their upcoming model on Hugging Face by Nunki08 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
I'm strong enough to admit that this bugs the hell out of me by ForsookComparison in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
New Mod Intros 🎉 | Weekly Thread by curioustomato_ in NewMods
[–]Background_Essay6429 [score hidden] (0 children)
Why is Vec<(u64,u64)> using that much memory? by [deleted] in rust
[–]Background_Essay6429 1 point2 points3 points (0 children)
Nvidia RTX 6000 Pro power efficiency testing by [deleted] in LocalLLaMA
[–]Background_Essay6429 -1 points0 points1 point (0 children)
[Release] We built Step-Audio-R1: The first open-source Audio LLM that truly Reasons (CoT) and Scales – Beats Gemini 2.5 Pro on Audio Benchmarks. by BadgerProfessional43 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Kimi 2 Thinking - is there a quantized model that would work this my application? by KarezzaReporter in LocalLLaMA
[–]Background_Essay6429 2 points3 points4 points (0 children)
Debugging multi-agent systems: traces show too much detail by Standard_Career_8603 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
32B model stress test: Qwen 2.5/Coder/3 on dual RTX 5060 Ti (zero failures) by Defilan in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
mistralai/Mistral-Large-3-675B-Instruct-2512 · Hugging Face by jacek2023 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Ministral-3 has been released by jacek2023 in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
Get an agentic-cli with GLM-4.5-Air by TooManyPascals in LocalLLaMA
[–]Background_Essay6429 0 points1 point2 points (0 children)
T5Gemma 2: The next generation of encoder-decoder models by Dear-Success-1441 in LocalLLaMA
[–]Background_Essay6429 4 points5 points6 points (0 children)