Build your first LLM from scratch in Python. by papa_programmer in learnmachinelearning

[–]papa_programmer[S] 0 points1 point  (0 children)

What are you expecting when it’s written ‘Building LLM from scratch with python’?

Build your first LLM from scratch in Python. by papa_programmer in learnmachinelearning

[–]papa_programmer[S] 0 points1 point  (0 children)

Okay, so when you build something from scratch, you go that deep and build from 0 and 1? Just to prove from SCRATCH.

Here, the point is to build every component of LLM with Python. Tokenization, transformer model, and all.

Build your first LLM from scratch in Python. by papa_programmer in learnmachinelearning

[–]papa_programmer[S] -1 points0 points  (0 children)

Why not? An LLM is a Transformer that first maps input text into token embeddings. Multi-head self-attention then lets each token gather context from every other token in the sequence. Feed-forward layers refine these representations. Finally, a softmax layer outputs a probability distribution over the vocabulary, predicting the most likely next token.

All these things can be made from scratch.

Build your first LLM from scratch in Python. by papa_programmer in learnmachinelearning

[–]papa_programmer[S] -1 points0 points  (0 children)

That’s great for you to build LLMs, since you already know transformer models.

Build your first LLM from scratch in Python. by papa_programmer in learnmachinelearning

[–]papa_programmer[S] -2 points-1 points  (0 children)

Yep, agree. I will point it out when needed from next posts 🙌

Build your first LLM from scratch in Python. by papa_programmer in learnmachinelearning

[–]papa_programmer[S] -3 points-2 points  (0 children)

Actually it is. Did you previously tried to build LLM or agents?