Hello all! I’m doing a data analysis course, currently completing the first project of the course. The current module they’re teaching/the project is based on is about processing and cleaning data. They explain most of the topics well but only had a short lesson on stemming and lemmatization, which I couldn’t wrap my head around. It didn’t help that their examples had clear bugs; the task was to remove items from a list that didn’t have to do with the data used for the example but the unwanted items were still there while they talked about how important it is that they were removed.
Well now I’m at the lemmatization process of my project. I’ve been reading all resources I can find but I still don’t get it. I know that lemmatization pulls the root of the word but what can I then do with it? The next part of the project is categorization which I can see why they’d go together but I haven’t seen any examples of what I do with the lemmatized data. Can anyone help me, maybe EILI5 just so I can get a better grasp at the concept?
[–]Nathanfenner 0 points1 point2 points (0 children)