Lemmatizáció
Lemmatizáció is a process in natural language processing and computational linguistics that involves reducing a word to its base or dictionary form, known as its lemma. This is distinct from stemming, which often chops off word endings without regard for the word's actual meaning or grammatical correctness. Lemmatization, on the other hand, uses a vocabulary and morphological analysis to return the canonical form of a word, ensuring it is a valid dictionary word.
For example, the lemma of "running", "ran", and "runs" is "run". Similarly, the lemma of "better" and
The effectiveness of lemmatization depends heavily on the quality of the lexicon and the morphological analyzer