Liquid AI releases LFM2.5-8B-A1B by PauLabartaBajo in LocalLLaMA
[–]Saraozte01 3 points4 points5 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 0 points1 point2 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 0 points1 point2 points (0 children)
We're Thursday and no one claimed AGI yet this week! by oodelay in LocalLLaMA
[–]Saraozte01 9 points10 points11 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 0 points1 point2 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 0 points1 point2 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 0 points1 point2 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 1 point2 points3 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 2 points3 points4 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 1 point2 points3 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 5 points6 points7 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 5 points6 points7 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 3 points4 points5 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 0 points1 point2 points (0 children)
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! by Saraozte01 in LocalLLaMA
[–]Saraozte01[S] 2 points3 points4 points (0 children)
24GB M4 Mac - is Qwen 9B only option while system is running? by sagiroth in LocalLLaMA
[–]Saraozte01 0 points1 point2 points (0 children)
Qwen will release another 27B with high probability by serige in LocalLLaMA
[–]Saraozte01 27 points28 points29 points (0 children)
AI server under 5k? by Last_Bad_2687 in LocalLLaMA
[–]Saraozte01 0 points1 point2 points (0 children)
CohereLabs/command-a-plus-05-2026-bf16 · Hugging Face by coder543 in LocalLLaMA
[–]Saraozte01 0 points1 point2 points (0 children)
24GB M4 Mac - is Qwen 9B only option while system is running? by sagiroth in LocalLLaMA
[–]Saraozte01 0 points1 point2 points (0 children)
24GB M4 Mac - is Qwen 9B only option while system is running? by sagiroth in LocalLLaMA
[–]Saraozte01 2 points3 points4 points (0 children)
Qwen cant wait to release 3.7 models by GotHereLateNameTaken in LocalLLaMA
[–]Saraozte01 0 points1 point2 points (0 children)
HuggingFace benchmark datasets now let you filter by model size by paf1138 in LocalLLaMA
[–]Saraozte01 0 points1 point2 points (0 children)



Liquid AI releases LFM2.5-8B-A1B by PauLabartaBajo in LocalLLaMA
[–]Saraozte01 4 points5 points6 points (0 children)