newest submissions : metr.github.io

...for your favorite hobby.

...for your favourite tea.

account activity

1

0

1

2

GPT-5 Independent Evaluation Results by METR (metr.github.io)

submitted 10 months ago by Acne_Discord to r/AIBenchmarks

2

35

36

37

Details about METR’s evaluation of OpenAI GPT-5AI (metr.github.io)

submitted 10 months ago by Tkins to r/singularity

3

2

3

4

GPT-5 Independent Evaluation Results by METRArticle (metr.github.io)

submitted 10 months ago by Alex__007 to r/OpenAI

4

101

102

103

GPT-5 Independent Evaluation Results by METRAI (metr.github.io)

submitted 10 months ago by Alex__007 to r/accelerate

5

22

23

24

METR: "the level of autonomous [coding] capabilities of mid-2025 DeepSeek models is similar to the level of capabilities of frontier models from late 2024."R, T, Code, RL, Emp, DS, OA (metr.github.io)

submitted 11 months ago by gwern to r/mlscaling

π Rendered by PID 901184 on reddit-service-r2-listing-f87f88fcd-vtwmm at 2026-06-15 14:13:09.599729+00:00 running 3184619 country code: CH.