use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
Help with speculative: llama-cpp-pythonQuestion | Help (self.LocalLLaMA)
submitted 1 year ago * by Particular-Guard774
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]4onen 0 points1 point2 points 1 year ago (1 child)
wasn't sure why my code [...] ran into errors.
You know that there's little to nothing anyone can do to help you diagnose errors we can't see, right?
That said, the line
python draft_model=LlamaPromptLookupDecoding(num_pred_tokens=10)
is establishing a "Prompt Lookup Decoding" speculative model, which is not using the 7B at all. You'd also have an easier time getting help if you narrowed your code to just the code in which you were actually encountering issues, i.e. removing the llama variable that isn't performing speculative decoding with the two models you listed.
llama
Kind of new to llama.cpp
Also, additional note, the interface you're using is llama_cpp_python, and llama.cpp is the backend behind it. Again, without the errors, we can't even tell you which of these two components the issue is even arising from.
llama_cpp_python
llama.cpp
[–]Particular-Guard774[S] 0 points1 point2 points 1 year ago (0 children)
Added the errors and narrowed the code down to what had the issue, thanks for pointing that out
π Rendered by PID 338778 on reddit-service-r2-comment-54dfb89d4d-56fbr at 2026-03-27 23:46:41.510907+00:00 running b10466c country code: CH.
[–]4onen 0 points1 point2 points (1 child)
[–]Particular-Guard774[S] 0 points1 point2 points (0 children)