UPOS
UPOS, short for Universal Part-of-Speech tags, is a coarse-grained tagset used in the Universal Dependencies (UD) framework. It provides a language-agnostic set of POS categories to annotate tokens across languages, enabling cross-linguistic comparisons, alignment of parallel texts, and standardized evaluation in natural language processing and linguistic annotation.
The UPOS tagset comprises 17 tags: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON,
In practice, UPOS is applied to annotated corpora known as UD treebanks, which are used to train