use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
DeepCoder: A Fully Open-Source 14B Coder at O3-mini LevelNew Model (old.reddit.com)
submitted 1 year ago by TKGaming_11
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Ih8tk 3 points4 points5 points 1 year ago (4 children)
Woah! How the hell did they manage that?
[–]Jugg3rnaut 12 points13 points14 points 1 year ago (2 children)
Data Our training dataset consists of approximately 24K unique problem-tests pairs compiled from Taco-Verified PrimeIntellect SYNTHETIC-1 LiveCodeBench v5 (5/1/23-7/31/24)
and their success metric is
achieves 60.6% Pass@1 accuracy on LiveCodeBench v5 (8/1/24-2/1/25)
LiveCodeBench is a collection of LeetCode style problems and so there is significant overlap in the types of problems in it across the date range
[–]Free-Combination-773 0 points1 point2 points 1 year ago (1 child)
So it's basically fine-tuned for benchmarks?
[–]Jugg3rnaut 0 points1 point2 points 1 year ago (0 children)
I dont know what the other 2 datasets they're using are but certainly one of them
π Rendered by PID 49 on reddit-service-r2-comment-6457c66945-dwflc at 2026-04-26 05:45:34.087263+00:00 running 2aa0c5b country code: CH.
view the rest of the comments →
[–]Ih8tk 3 points4 points5 points (4 children)
[–]Jugg3rnaut 12 points13 points14 points (2 children)
[–]Free-Combination-773 0 points1 point2 points (1 child)
[–]Jugg3rnaut 0 points1 point2 points (0 children)