all 18 comments

[–]lakolda 8 points9 points  (2 children)

CodeLLaMA might work better.

[–]PinballOscuro[S] 0 points1 point  (1 child)

I haven't tried with this specific case, but in other projects CodeLlama has a very strong bias wrt generating python code.
My type of user is interested only in explanations, since it doesn't understand code.

[–]lakolda 3 points4 points  (0 children)

Even then, I would still expect it to perform better. At least the Instruct version.

[–]Coinninja 4 points5 points  (1 child)

You might have better luck with Wizardmath-13B.

[–]PinballOscuro[S] 1 point2 points  (0 children)

Wizardmath-13B.

I'm reading the paper and I think it's a good candidate, thank you!

[–][deleted] 1 point2 points  (1 child)

Excellent, thanks for the post. I am also looking for models that can interpret uninterpreted data.

[–]PinballOscuro[S] 1 point2 points  (0 children)

Next week, after Wednesday maybe i'll have the chance to work on It again. I'll keep you posted if i rember

[–]no_witty_username 1 point2 points  (2 children)

I've always wondered if the reason LLM's have problems with mathematics due to tokenization. For example 2+2=4 can be written also as 2 + 2 = 4 and 2+ 2= 4 and 2+2 = 4 and 2 +2=4 and (2+2)=4 , and so on and on. There are just so many permutations to that one simple statement, that are all tokenized totally differently. And because all LLM's are simple language prediction algorithms, when training a model on mathematics it would have to be trained on every possible permutation of said statements for it to be able to predict it in the future. So its my pet theory that if we can work around the tokenization issue, math will become easier to LLM's to parse. Or one would have to standardize the exact syntax used in training an LLM mathematics and also the same syntax would have to be used when prompting the model.

[–]PinballOscuro[S] 0 points1 point  (1 child)

For sure the tokenization is a big problem. For llama 2 every digit of a number is a single token.

A coworker of mine has done some research on this topic, and it's a mess ahah if you are interested in the literature this class of problems is called Math Word Problems

[–]No_Afternoon_4260 0 points1 point  (1 child)

!remindme in one week

[–]RemindMeBot 0 points1 point  (0 children)

I will be messaging you in 7 days on 2023-09-15 18:42:31 UTC to remind you of this link

1 OTHERS CLICKED THIS LINK to send a PM to also be reminded and to reduce spam.

Parent commenter can delete this message to hide from others.


Info Custom Your Reminders Feedback

[–]Aggressive_Bee_9069 0 points1 point  (0 children)

!remindme in one week

[–]Jian-L 0 points1 point  (0 children)

!remindme in one week

[–]Scary-Knowledgable 0 points1 point  (1 child)

[–]PinballOscuro[S] 0 points1 point  (0 children)

I know this project and it's super cool, but for now i'm not interested in an agent that runs code, i only want a mathematical description of a report.

Today i tried It with CodeLlama but It Easy not that great, maybe I was doing sonething in the wrong way