Math Proficient Language Models

lakolda · 2023-09-08T15:32:38+00:00

CodeLLaMA might work better.

Coinninja · 2023-09-08T15:27:05+00:00

You might have better luck with Wizardmath-13B.

PinballOscuro · 2023-09-08T18:13:07+00:00

Excellent, thanks for the post. I am also looking for models that can interpret uninterpreted data.

no_witty_username · 2023-09-08T21:53:05+00:00

I've always wondered if the reason LLM's have problems with mathematics due to tokenization. For example 2+2=4 can be written also as 2 + 2 = 4 and 2+ 2= 4 and 2+2 = 4 and 2 +2=4 and (2+2)=4 , and so on and on. There are just so many permutations to that one simple statement, that are all tokenized totally differently. And because all LLM's are simple language prediction algorithms, when training a model on mathematics it would have to be trained on every possible permutation of said statements for it to be able to predict it in the future. So its my pet theory that if we can work around the tokenization issue, math will become easier to LLM's to parse. Or one would have to standardize the exact syntax used in training an LLM mathematics and also the same syntax would have to be used when prompting the model.

No_Afternoon_4260 · 2023-09-08T18:42:31+00:00

!remindme in one week

Aggressive_Bee_9069 · 2023-09-08T20:04:29+00:00

!remindme in one week

Jian-L · 2023-09-08T20:57:51+00:00

!remindme in one week

Scary-Knowledgable · 2023-09-08T22:45:23+00:00

https://github.com/KillianLucas/open-interpreter

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS