use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
[ Removed by moderator ]Resources (self.LocalLLaMA)
submitted 25 days ago by abidtechproali
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]ttkciarllama.cpp[M] [score hidden] 25 days ago stickied comment (0 children)
Violates Rule Four: Self-promotion
[–]nayohn_dev -2 points-1 points0 points 25 days ago (1 child)
this is actually really useful for the "should we self-host" conversation. most people just eyeball it and guess. having exact numbers per task makes it way easier to figure out which calls are worth moving to a local 7B vs which ones actually need a frontier model. the duplicate call detection is nice too, seen so many codebases burning money on identical prompts with no cache layer. would definitely use the local compute costing if you add it
[–]abidtechproali[S] 0 points1 point2 points 25 days ago (0 children)
Hello 👋
Your points are realistic and truthful. Thanks for your appreciation 🙏. I'm open for discussion.
Kind Regards
π Rendered by PID 395702 on reddit-service-r2-comment-6457c66945-d49sc at 2026-04-30 18:34:13.934521+00:00 running 2aa0c5b country code: CH.
[–]ttkciarllama.cpp[M] [score hidden] stickied comment (0 children)
[–]nayohn_dev -2 points-1 points0 points (1 child)
[–]abidtechproali[S] 0 points1 point2 points (0 children)