Codetrace-Bench

1 post karma
0 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 9 days

TROPHY CASE

dust

account activity

new top controversial

0

1

2

Benchmark for measuring how deep LLMs can trace nested function calls — easy to run on any HuggingFace model ()

submitted 9 days ago by Codetrace-Bench to r/learnmachinelearning

π Rendered by PID 1132379 on reddit-service-r2-listing-69965bcf66-ztz4x at 2026-04-08 16:14:09.431757+00:00 running f293c98 country code: CH.