Lemmatization

Lemmatization is a text normalization technique used in natural language processing. The main difference from stemming is that lemmatization aims to remove endings to get to the root word, called the lemma, while considering context and vocabulary to ensure correctness.

Here are the key points about lemmatization:

In summary, lemmatization considers context, dictionary definitions and morphology to transform words into root forms accurately, enabling better performance for search, IR and text analytics. It is more complex but resolves issues with ambiguity inherent in stemming.

See also: