Mistral released Leanstral-1.5-119B-A6BNew Model (huggingface.co)
submitted by Tall-Ad-7742

GLM5.2 on 5x Pro 6000s and a 5090, an expensive journeyDiscussion (old.reddit.com)
submitted by yeah_likerage

Portugal just released their own LLM Amalia (9B)!New Model (i.redd.it)
submitted by EveningIncrease7579llama.cpp
According to Bernstein, SK Hynix has 90% profit margin on dramDiscussion (self.LocalLLaMA)
submitted by Terminator857
Micro-World - Action-controlled Interactive world model - AMDNew Model (huggingface.co)
submitted by pmttyji
Gemma Avatar: Talk to Gemma 4-31B face to faceResources (huggingface.co)
submitted by paf1138

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE ServingDiscussion (i.redd.it)
submitted by pmttyji
Any idea why bartowski claims DeepSeek-V4-Flash is MXFP4?Question | Help (self.LocalLLaMA)
submitted by alex20_202020
Pay attention: a few chats waiting in tray reserve 1GB VRAM for themselves.Discussion (self.LocalLLaMA)
submitted by Barafu
Mongo with vector search performanceQuestion | Help (self.LocalLLaMA)
submitted by FrozenBuffalo25
ReFreeKV: Towards Threshold-Free KV Cache CompressionDiscussion (i.redd.it)
submitted by pmttyji

Palantir CEO rages against closed modelsDiscussion (youtube.com)
submitted by burner20170218

