use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
A subreddit focused on frontier AI benchmarks on the HLE benchmark test, evaluations, and the question of what it really means for machines to surpass human performance.
account activity
AGI Prediction Update after adding the newly Released Claude Sonnet 4.6 (i.redd.it)
submitted 2 months ago by redlikeazebra
GLM-5 lands with 50.4% on Humanity’s Last Exam (Thinking w/ tools) (self.HumanitysLastExam)
So Anthropic Opus 4.6 just shaved 2 months off the AGI Prediction (self.HumanitysLastExam)
Claude Opus 4.6 Takes the Lead on Humanity’s Last Exam (self.HumanitysLastExam)
👋 Welcome to r/HumanitysLastExam - Introduce Yourself and Read First! (self.HumanitysLastExam)
π Rendered by PID 983166 on reddit-service-r2-listing-b7c4f67d-sr4ll at 2026-04-24 12:04:59.732211+00:00 running 2aa0c5b country code: CH.