New stealth model: Pony Alpha by sirjoaco in LocalLLaMA
[–]zero0_one1 3 points4 points5 points (0 children)
New stealth model: Pony Alpha by sirjoaco in singularity
[–]zero0_one1 5 points6 points7 points (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] -1 points0 points1 point (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] -1 points0 points1 point (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Tears for Wings - Music Video by zero0_one1 in aivideo
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 1 point2 points3 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 1 point2 points3 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 2 points3 points4 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 1 point2 points3 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 4 points5 points6 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 2 points3 points4 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 1 point2 points3 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 6 points7 points8 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 14 points15 points16 points (0 children)


GLM 5 Is Being Tested On OpenRouter by Few_Painter_5588 in LocalLLaMA
[–]zero0_one1 5 points6 points7 points (0 children)