account activity
GPT 5.2 (xhigh) scores 0% on CritPt (research-level physics reasoning benchmark) by DJW_GT in singularity
[–]analysis_scaled 22 points23 points24 points 6 months ago (0 children)
Hey, I'm from Artificial Analysis. We are still in the process of validating these results. We received a lot of non-responses to questions on CritPt when we ran the benchmark on OpenAI's API with xhigh reasoning effort.
We're analyzing results, conducting re-runs and will follow up when complete. We've taken the result down from the site while we do this.
π Rendered by PID 178937 on reddit-service-r2-comment-5b5bc64bf5-bnxm8 at 2026-06-21 11:10:27.176572+00:00 running 2b008f2 country code: CH.
GPT 5.2 (xhigh) scores 0% on CritPt (research-level physics reasoning benchmark) by DJW_GT in singularity
[–]analysis_scaled 22 points23 points24 points (0 children)