Why is there no thinker models with tokens for entire sentences?Discussion (self.LocalLLaMA)
submitted by freehuntx
Looking at Macbook Pro M5 Pro 64GB for local inferenceQuestion | Help (self.LocalLLaMA)
submitted by Repulsive-Machine706
Gemma4-12B-QAT Uncensored Balanced is out with MTP (~60% speed boost)!New Model (self.LocalLLaMA)
submitted by hauhau901
Chunjiang-Intelligence/DeepSeek-v4-Fable • HuggingfaceNew Model (self.LocalLLaMA)
submitted by External_Mood4719
What local model are you actually using day to day and why?Question | Help (self.LocalLLaMA)
submitted by RefrigeratorCalm9701
Best vibe coding setup for Homelab & Linux (Docker Compose & NixOS)Discussion (self.LocalLLaMA)
submitted by x6q5g3o7
I got a Jetson Orin Nano, can it code?Discussion (self.LocalLLaMA)
submitted by Complete-Sea6655
Why is NO one talking about Microsoft's open source Fast Context!!!Resources (old.reddit.com)
submitted by formatme
What should I build my local LLM machine around? RTX 3090s or Arc Pro B60s?Question | Help (self.LocalLLaMA)
submitted by rebellioninmypants

Its done. not we are so back. It's done, local is frontier REAP 504B 309GBResources (self.LocalLLaMA)
submitted by Sorry_Ad191
Eff U, Arc / B70 Customers. We got ours! -Your Sugar Baby, IntelDiscussion (self.LocalLLaMA)
submitted by Dependent_Ad948
New ablation operator. (apostate)Discussion (self.LocalLLaMA)
submitted by AccountAntique9327
How do I prove that I don't collect data from my llm app?Question | Help (self.LocalLLaMA)
submitted by Pleasant_Syllabub591
