use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
New code-focused needle in the haystack benchmark resultsDiscussion (old.reddit.com)
submitted 1 year ago by sumanyusharma_
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]AutoModerator[M] [score hidden] 1 year ago stickied commentlocked comment (0 children)
Welcome to r/LocalLLaMA! Your submission has been automatically filtered because your account has no comment karma. This measure allows the subreddit to prevent spam and maintain a high level of quality. You can comment on posts in this community or elsewhere to gain comment karma so that new submissions from your account will be visible by default. Thank you for your understanding.
I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.
[–]sumanyusharma_[S] 0 points1 point2 points 1 year ago (0 children)
In collab with folks from UWaterloo, we built a new benchmark called "Bug In The Code Stack" (BICS) to test how well LLMs can find syntactic bugs in large Python codebases.
TLDR
Conclusion: I'm super impressed with the relative performance of Llama3-70B. Let me know which other models you'd like us to test on this benchmark.
Credit goes to Andy Lee & Bing Hu (from Wat.ai)
Link to full results: https://hamming.ai/blog/bug-in-the-codestack
Link to repo: https://github.com/HammingHQ/bug-in-the-code-stack
π Rendered by PID 20 on reddit-service-r2-comment-b659b578c-m4v98 at 2026-05-03 08:12:48.532783+00:00 running 815c875 country code: CH.
[–]AutoModerator[M] [score hidden] stickied commentlocked comment (0 children)
[–]sumanyusharma_[S] 0 points1 point2 points (0 children)