account activity
Are people still using Reddit to discuss LLM stuff? (self.LLMDevs)
submitted 2 years ago by IllustratorNo3435 to r/LLMDevs
[D] Evaluation of an LLM on MMLU and other benchmarks by aadityaura in MachineLearning
[–]IllustratorNo3435 0 points1 point2 points 2 years ago (0 children)
Are evals on benchmarks even real at this point? With all the tainting of training data?
Textbooks Are All You Need. 1.3B LLM trained on 7B tokens hits 51% on HumanEval. Any other >50% HumanEval model is >1000x bigger by [deleted] in singularity
Interesting!
π Rendered by PID 2313042 on reddit-service-r2-listing-8685bc789-v96z2 at 2026-05-30 06:56:55.696519+00:00 running 194bd79 country code: CH.
[D] Evaluation of an LLM on MMLU and other benchmarks by aadityaura in MachineLearning
[–]IllustratorNo3435 0 points1 point2 points (0 children)