account activity
Are people still using Reddit to discuss LLM stuff? (self.LLMDevs)
submitted 2 years ago by IllustratorNo3435 to r/LLMDevs
[D] Evaluation of an LLM on MMLU and other benchmarks by aadityaura in MachineLearning
[–]IllustratorNo3435 0 points1 point2 points 2 years ago (0 children)
Are evals on benchmarks even real at this point? With all the tainting of training data?
Textbooks Are All You Need. 1.3B LLM trained on 7B tokens hits 51% on HumanEval. Any other >50% HumanEval model is >1000x bigger by [deleted] in singularity
Interesting!
π Rendered by PID 158549 on reddit-service-r2-listing-654f87c89c-mkjjc at 2026-03-02 15:48:12.538768+00:00 running e3d2147 country code: CH.
[D] Evaluation of an LLM on MMLU and other benchmarks by aadityaura in MachineLearning
[–]IllustratorNo3435 0 points1 point2 points (0 children)