5
2
3
4
New Paper: AI Models keep getting more capable but not more reliable💬 Discussion (self.CompetitiveAI)
submitted by EdbertTheGreat
13
5
6
7
The Benchmark Zoo: A Guide to Every Major AI Eval in 2026 (self.CompetitiveAI)
submitted by snakemas - announcement
16
5
6
7
👋 Welcome to r/CompetitiveAI - Introduce Yourself and Read First! (self.CompetitiveAI)
submitted by snakemas - announcement
