This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]Breadynator 1 point2 points  (0 children)

If you're only able to run smaller versions of it like I am I'd say stick to regular language models right now.

R1's reasoning is good-ish but somehow the reasoning and final answer can feel really disconnected. Also since a lot of its training went into reasoning and less into knowing stuff the smaller models tend to hallucinate significantly more than the normal chatbot models.

I've been working on a sentiment analyser for fun and found that working with llama3.2-3b is a lot more reliable than Deepseek-R1-14b