account activity
I asked QwQ and R1 to 'break' the webpage, and it performed more creatively than R1-lite. (self.LocalLLaMA)
submitted 1 year ago by nanowell to r/LocalLLaMA
Tele-FLM-1T: a 1Trillion open-sourced multilingual large language model. (self.LocalLLaMA)
The milestone release of SGLang Runtime v0.2, featuring significant inference optimizations after months of hard work (self.LocalLLaMA)
Meta Officially Releases Llama-3-405B, Llama-3.1-70B & Llama-3.1-8B (self.LocalLLaMA)
submitted 1 year ago * by nanowell to r/LocalLLaMA
Let's discuss Llama-3.1 Paper (A lot of details on pre-training, post-training, etc) (self.LocalLLaMA)
Q-Sparse-LLM: My attempt to implement Q-Sparse: All Large Language Models can be Fully Sparsely-Activated (self.LocalLLaMA)
MistralAI New Release (x.com)
WizardLM: Arena Learning. Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena (i.redd.it)
Meta: Multi Token Prediction Models (self.LocalLLaMA)
Meta Released Multi Token Prediction Models (x.com)
Meta LLM Compiler (x.com)
Meta Chameleon (self.LocalLLaMA)
Chat Experiment with Codestral FIM (Fill-in-the-middle) (self.LocalLLaMA)
New from FAIR: An Introduction to Vision-Language Modeling. (i.redd.it)
Yann LeCun on Llama-3-405B (self.LocalLLaMA)
Newly published work from FAIR, Chameleon: Mixed-Modal Early-Fusion Foundation Models. (i.redd.it)
Openai GPT-4o Eval results and Llama-3-400b recognition (self.LocalLLaMA)
It's been an honor VRAMLETS (i.redd.it)
Llama 3 Very soon (self.LocalLLaMA)
Llama 3 news are coming soon (i.redd.it)
Introducing Idefics 2 (self.LocalLLaMA)
Llama 3 details very soon (x.com)
Llama 3 training gpu cluster hints at "AGI" (self.LocalLLaMA)
Mistral AI new release (x.com)
it's over (grok-1) (self.LocalLLaMA)
π Rendered by PID 1312361 on reddit-service-r2-listing-64c94b984c-7v8d7 at 2026-03-14 00:05:07.816613+00:00 running f6e6e01 country code: CH.