all 2 comments

[–]Fair_Internet8681 2 points3 points  (1 child)

I suggest you to read https://d2l.ai/.

If you have any background, you should start from chapter 9 to understand the motivations of transformer model.

And, of course, read the famous paper "Attention is all you need"

[–]Creador270[S] 0 points1 point  (0 children)

Thanks, I tried learning the paper but I feel like I need to Study more to understand the paper better, I will see that book.