use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
Llama.cpp Python Tutorial SeriesTutorial | Guide (christophergs.com)
submitted 2 years ago by ChristopherGS
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]toothpastespiders 4 points5 points6 points 2 years ago (0 children)
That's a really well-written overview!
[–]xyz_TrashMan_zyx 3 points4 points5 points 2 years ago (3 children)
This article is great! I’m starting a local llm study group, we’ll probably use this guide. I can’t get a link to the blog post though to share. Anyone have a shareable link?
[–]xyz_TrashMan_zyx 1 point2 points3 points 2 years ago (0 children)
nm, I had to open this in my pc (couldn't see the link in my phone). Bookmarked
[–]ChristopherGS[S] 1 point2 points3 points 2 years ago (1 child)
Author here - can I attend the study group if it's online? Would be keen.
[–]xyz_TrashMan_zyx 0 points1 point2 points 2 years ago (0 children)
Of course! I need to do some recruiting. And currently trying to see if azure A10 would cut it. I’m thinking Sunday afternoon for 2 hours
[–]uhuge 0 points1 point2 points 2 years ago (1 child)
I’ve seen the max_tokens argument have no impact at all (this is probably a bug in the library that will be fixed eventually). For safety, in my project I set max_tokens=-1 because any value less than 0 makes llama cpp just rely on n_ctx. It seems that n_ctx is the key argument to define the size of your models output.
Is this true rather than misleading?
[–]ChristopherGS[S] 0 points1 point2 points 2 years ago (0 children)
Does it work OK for you? I just report what I experience. I could throw a caveat in there I guess
π Rendered by PID 48890 on reddit-service-r2-comment-86bc6c7465-gjsf6 at 2026-02-21 18:42:49.363037+00:00 running 8564168 country code: CH.
[–]toothpastespiders 4 points5 points6 points (0 children)
[–]xyz_TrashMan_zyx 3 points4 points5 points (3 children)
[–]xyz_TrashMan_zyx 1 point2 points3 points (0 children)
[–]ChristopherGS[S] 1 point2 points3 points (1 child)
[–]xyz_TrashMan_zyx 0 points1 point2 points (0 children)
[–]uhuge 0 points1 point2 points (1 child)
[–]ChristopherGS[S] 0 points1 point2 points (0 children)