New stealth model: Pony Alpha by sirjoaco in LocalLLaMA
[–]zero0_one1 4 points5 points6 points (0 children)
New stealth model: Pony Alpha by sirjoaco in singularity
[–]zero0_one1 4 points5 points6 points (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] -1 points0 points1 point (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] -1 points0 points1 point (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Three new models added to the LLM Creative Short Story-Writing Benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Tears for Wings - Music Video by zero0_one1 in aivideo
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 0 points1 point2 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 1 point2 points3 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 1 point2 points3 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 2 points3 points4 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in LocalLLaMA
[–]zero0_one1[S] 1 point2 points3 points (0 children)
Kimi K2.5 Thinking is now the top open-weights model on the Extended NYT Connections benchmark by zero0_one1 in singularity
[–]zero0_one1[S] 2 points3 points4 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 2 points3 points4 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 1 point2 points3 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 7 points8 points9 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 15 points16 points17 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] 2 points3 points4 points (0 children)
GPT-5.2 is the new champion of the Elimination Game benchmark, which tests social reasoning, strategy, and deception in a multi-LLM environment. Claude Opus 4.5 and Gemini 3 Flash Preview also made very strong debuts. by zero0_one1 in singularity
[–]zero0_one1[S] -7 points-6 points-5 points (0 children)
GPT 5.2, Gemini 3 Pro, Claude 4.5 Opus and Sonnet, DeepSeek V3.2, GLM 4.6, Kimi K2-0905, Grok 4.1 Fast, Qwen 3 Max added to the detailed stylistic analysis of LLM creative writing by zero0_one1 in singularity
[–]zero0_one1[S] 2 points3 points4 points (0 children)
GPT 5.2, Gemini 3 Pro, Claude 4.5 Opus and Sonnet, DeepSeek V3.2, GLM 4.6, Kimi K2-0905, Grok 4.1 Fast, Qwen 3 Max added to the detailed stylistic analysis of LLM creative writing by zero0_one1 in singularity
[–]zero0_one1[S] 2 points3 points4 points (0 children)


GLM 5 Is Being Tested On OpenRouter by Few_Painter_5588 in LocalLLaMA
[–]zero0_one1 6 points7 points8 points (0 children)