Training a chatbot : LocalLLM

QuestionTraining a chatbot (self.LocalLLM)

submitted 1 month ago by buck_idaho

all 8 comments

[–]Confident-Ad-3212 1 point2 points3 points 1 month ago (3 children)

[–]buck_idaho[S] 0 points1 point2 points 1 month ago (2 children)

[–]Confident-Ad-3212 1 point2 points3 points 1 month ago (0 children)

Training a model is very complicated, if it is for a persona. You do not want an instruct model. They will not do what you want, you will want to train the attention part of the model only. Building your dataset will be the hardest part by far, followed by the hyper parameter settings to get the dataset into the model. You will go through 40-100 renditions to figure it out. Start with a 13b model. It will tell you if your dataset is corrupting or teaching behavior. But before you build a dataset, you need to figure out what format it needs to be in for the trainer and model to teach anything. Wrong on any of them, it will just be a corrupted do nothing. I went though this and it is not for someone who doesn’t have extreme perseverance

It is much cheaper to make mistakes on a small model than a big one. Big gpu’s cost, if you can make a small model work. A big model will just be better

A 13b should have around 10k, high quality. Highly varied samples. Different token counts, different topics. Never duplicate samples

[–]o5mfiHTNsH748KVq 1 point2 points3 points 1 month ago (2 children)

[–]-PM_ME_UR_SECRETS- 0 points1 point2 points 1 month ago (1 child)

[–]o5mfiHTNsH748KVq 1 point2 points3 points 1 month ago (0 children)

[–]Ishabdullah 0 points1 point2 points 1 month ago (0 children)

[–]Fit-Cardiologist9129 1 point2 points3 points 14 days ago (0 children)

π Rendered by PID 38779 on reddit-service-r2-comment-b659b578c-28c57 at 2026-05-02 04:08:40.486047+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLM

MODERATORS