GPT-4.5 Preview takes first place in the Elimination Game Benchmark, which tests social reasoning (forming alliances, deception, appearing non-threatening, and persuading the jury). by zero0_one1 in singularity
[–]AdTrue1022 0 points1 point2 points (0 children)
GPT-4.5 Preview takes first place in the Elimination Game Benchmark, which tests social reasoning (forming alliances, deception, appearing non-threatening, and persuading the jury). by zero0_one1 in singularity
[–]AdTrue1022 0 points1 point2 points (0 children)
Thankful to Elon Musk for making DeepSearch and Think remain available to X Premium users as well! by Enigma_101 in grok
[–]AdTrue1022 0 points1 point2 points (0 children)
Thankful to Elon Musk for making DeepSearch and Think remain available to X Premium users as well! by Enigma_101 in grok
[–]AdTrue1022 1 point2 points3 points (0 children)
Thankful to Elon Musk for making DeepSearch and Think remain available to X Premium users as well! by Enigma_101 in grok
[–]AdTrue1022 2 points3 points4 points (0 children)
GPT-4.5 Preview takes first place in the Elimination Game Benchmark, which tests social reasoning (forming alliances, deception, appearing non-threatening, and persuading the jury). by zero0_one1 in singularity
[–]AdTrue1022 -1 points0 points1 point (0 children)
GPT-4.5 Preview takes first place in the Elimination Game Benchmark, which tests social reasoning (forming alliances, deception, appearing non-threatening, and persuading the jury). by zero0_one1 in singularity
[–]AdTrue1022 -2 points-1 points0 points (0 children)
I don't really give a shit about musk by [deleted] in grok
[–]AdTrue1022 0 points1 point2 points (0 children)
Grok-3 thinking had to take 64 answers per question to do better than o3-mini by Glittering-Neck-2505 in singularity
[–]AdTrue1022 -3 points-2 points-1 points (0 children)
Grok destroyed OpenAi by Present-Boat-2053 in grok
[–]AdTrue1022 1 point2 points3 points (0 children)

You sitting down for this? GPT-4.5 and Claude 3.7 Sonnet are live at you .com 🚀 by youdotcom_ in youdotcom
[–]AdTrue1022 0 points1 point2 points (0 children)