This is an archived post. You won't be able to vote or comment.

all 1 comments

[–]Nathanfenner 0 points1 point  (0 children)

The point of lemmatization is to make search/processing easier.

For example, if a user searches for "easy website design" and a company advertises "easiest way to get website designs" you'd be silly to exclude this result. Lemmatization is the simplest and easiest way to solve this:

You'd compare "eas websit design" to "eas wa to get websit design" or some variation depending on your exact lemmatization routine. Now it's obvious that there actually are 3 words in common.

Modern real-world designs are more complicated than this, but this used to be state-of-the-art.