lemmatisation
Lemmatization is the process of reducing a word to its base or dictionary form, the lemma. Unlike simple stemming, which may produce truncated or non-dictionary forms, lemmatization uses morphological analysis and, often, part-of-speech information to assign the most appropriate lemma for a given token in context.
In practice, a lemmatizer may rely on lexicons and inflection tables, rule-based morphological analyzers, or statistical
Lemmatization is used in information retrieval to improve matching across word forms, in natural language processing
Challenges include ambiguity, handling of proper nouns and multiword expressions, languages with rich morphology, and resource