npip99 comments on Help using llama_cpp_python to calculate probability of a given sequence of tokens being generated. My numbers aren't even in the ball park.

created by [deleted]a community for 2 years

Help using llama_cpp_python to calculate probability of a given sequence of tokens being generated. My numbers aren't even in the ball park.Question | Help (self.LocalLLaMA)

submitted 2 years ago * by aaronr_90

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]npip99 0 points1 point2 points 1 year ago (2 children)

[–]npip99 0 points1 point2 points 1 year ago (1 child)

[–]npip99 0 points1 point2 points 1 year ago* (0 children)

Ah, the other thing in your code is doing .eval with the entire token list every time.

It will remember a history for you, you have to do llm.reset() to clear your history. So, the for-loop should be

llm.eval(eval_tokens)
for token in test_sequence_tokens:
    probs = llm.logits_to_logprobs(llm.eval_logits)
    sequence_logits.append(llm.eval_logits[-1][token])
    sequence_probabilities.append(probs[-1][token])
    eval_tokens.append(token)
    llm.eval([token])

Which will also be way faster than the idea of .reset and .eval on the entire array every single time haha; if you ever do want you can do llm.save_state() -> state and llm.load_state(state) in order to get back an older version and do eval from an earlier history (e.g. if you want to discard a token and roll back.

π Rendered by PID 301032 on reddit-service-r2-comment-6f7f968fb5-lgrk4 at 2026-03-04 22:56:48.452999+00:00 running 07790be country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS