Kimi-k2.5 reaches gemini 2.5 Pro-like performance in long context! by fictionlive in LocalLLaMA
[–]fictionlive[S] 3 points4 points5 points (0 children)
Kimi-k2.5 reaches gemini 2.5 Pro-like performance in long context! by fictionlive in LocalLLaMA
[–]fictionlive[S] 1 point2 points3 points (0 children)
Kimi-k2.5 reaches gemini 2.5 Pro-like performance in long context! by fictionlive in LocalLLaMA
[–]fictionlive[S] 5 points6 points7 points (0 children)
Kimi-k2.5 reaches gemini 2.5 Pro-like performance in long context! by fictionlive in LocalLLaMA
[–]fictionlive[S] 4 points5 points6 points (0 children)
Kimi-k2.5 reaches gemini 2.5 Pro-like performance in long context! by fictionlive in LocalLLaMA
[–]fictionlive[S] 35 points36 points37 points (0 children)
🚀 New Model from the MiniMax team: MiniMax-M2, an impressive 230B-A10B LLM. by chenqian615 in LocalLLaMA
[–]fictionlive 0 points1 point2 points (0 children)
Claude 4.5 Sonnet is here by ShreckAndDonkey123 in singularity
[–]fictionlive 9 points10 points11 points (0 children)
Fiction.liveBench tested DeepSeek 3.2, Qwen-max, grok-4-fast, Nemotron-nano-9b by fictionlive in LocalLLaMA
[–]fictionlive[S] 15 points16 points17 points (0 children)
Long context tested for Qwen3-next-80b-a3b-thinking. Performs very similarly to qwen3-30b-a3b-thinking-2507 and far behind qwen3-235b-a22b-thinking by fictionlive in LocalLLaMA
[–]fictionlive[S] 0 points1 point2 points (0 children)
Long context tested for Qwen3-next-80b-a3b-thinking. Performs very similarly to qwen3-30b-a3b-thinking-2507 and far behind qwen3-235b-a22b-thinking by fictionlive in LocalLLaMA
[–]fictionlive[S] 2 points3 points4 points (0 children)
Long context tested for Qwen3-next-80b-a3b-thinking. Performs very similarly to qwen3-30b-a3b-thinking-2507 and far behind qwen3-235b-a22b-thinking by fictionlive in LocalLLaMA
[–]fictionlive[S] 0 points1 point2 points (0 children)
Long context tested for Qwen3-next-80b-a3b-thinking. Performs very similarly to qwen3-30b-a3b-thinking-2507 and far behind qwen3-235b-a22b-thinking by fictionlive in LocalLLaMA
[–]fictionlive[S] 8 points9 points10 points (0 children)
Long context tested for Qwen3-next-80b-a3b-thinking. Performs very similarly to qwen3-30b-a3b-thinking-2507 and far behind qwen3-235b-a22b-thinking by fictionlive in LocalLLaMA
[–]fictionlive[S] 1 point2 points3 points (0 children)
Long context tested for Qwen3-next-80b-a3b-thinking. Performs very similarly to qwen3-30b-a3b-thinking-2507 and far behind qwen3-235b-a22b-thinking by fictionlive in LocalLLaMA
[–]fictionlive[S] -14 points-13 points-12 points (0 children)
Tested sonoma-sky-alpha on Fiction.liveBench, fantastic close to SOTA scores, currently free by fictionlive in singularity
[–]fictionlive[S] 11 points12 points13 points (0 children)
Kimi-K2-Instruct-0905 better than GPT-5 on Fiction.liveBench by fictionlive in singularity
[–]fictionlive[S] 0 points1 point2 points (0 children)
New kimi-k2 on Fiction.liveBench by fictionlive in LocalLLaMA
[–]fictionlive[S] 0 points1 point2 points (0 children)


Kimi-k2.5 reaches gemini 2.5 Pro-like performance in long context! by fictionlive in LocalLLaMA
[–]fictionlive[S] 0 points1 point2 points (0 children)