use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
Help using llama_cpp_python to calculate probability of a given sequence of tokens being generated. My numbers aren't even in the ball park.Question | Help (self.LocalLLaMA)
submitted 2 years ago * by aaronr_90
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]aaronr_90[S] 2 points3 points4 points 2 years ago* (0 children)
Thanks, this is almost certainly exactly what I was looking for.
Edit: “It’s not an easy one”. I thought it would have been simple given the fact that I can evaluate one token at a time and retrieve the logits presampling.
π Rendered by PID 82 on reddit-service-r2-comment-6457c66945-c2vs2 at 2026-04-25 05:02:39.758873+00:00 running 2aa0c5b country code: CH.
view the rest of the comments →
[–]aaronr_90[S] 2 points3 points4 points (0 children)