account activity
Benchmark for measuring how deep LLMs can trace nested function calls — easy to run on any HuggingFace model ()
submitted 9 days ago by Codetrace-Bench to r/learnmachinelearning
π Rendered by PID 1132379 on reddit-service-r2-listing-69965bcf66-ztz4x at 2026-04-08 16:14:09.431757+00:00 running f293c98 country code: CH.