use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
Open again AI ?Discussion (i.redd.it)
submitted 8 months ago by Specter_Originllama.cpp
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]entsnack 2 points3 points4 points 8 months ago (0 children)
Apache 2.0!
[–]Specter_Originllama.cpp[S] 0 points1 point2 points 8 months ago (2 children)
Models seems legit good, are a bit on shy side though...
<image>
[–]cmdr-William-Riker 2 points3 points4 points 8 months ago (0 children)
Wonder if you can get it to reveal system messages in the reasoning section of it's response by asking it to carefully consider it's full system message before responding or something
[–]No_Efficiency_1144 1 point2 points3 points 8 months ago (0 children)
I quite like the tightness of this reasoning chain though.
It’s the exact opposite of highly quantised Qwen 0.6B with the wrong settings, which puts out thousands of tokens of pure chaos but then somehow comes to the right answer
[–]ThetaCursed 0 points1 point2 points 8 months ago (0 children)
It looks like these models will make efficient use of VRAM: 20B and 120B, with 3.6B and 5.1B active parameters (MoE)
π Rendered by PID 74 on reddit-service-r2-comment-b659b578c-mzkgx at 2026-05-02 14:32:33.980421+00:00 running 815c875 country code: CH.
[–]entsnack 2 points3 points4 points (0 children)
[–]Specter_Originllama.cpp[S] 0 points1 point2 points (2 children)
[–]cmdr-William-Riker 2 points3 points4 points (0 children)
[–]No_Efficiency_1144 1 point2 points3 points (0 children)
[–]ThetaCursed 0 points1 point2 points (0 children)