Udpipe
UDPipe is an open-source natural language processing toolkit designed to process text in alignment with the Universal Dependencies framework. It provides a trainable pipeline that can tokenize, segment sentences, perform part-of-speech tagging, lemmatization, morphological analysis, and dependency parsing. The resulting annotations are compatible with the CoNLL-U format used by UD treebanks, enabling standardized linguistic annotation across languages.
The tool combines language-specific models with a general architecture that supports training on user-provided data. Users
UDPipe provides models for many languages covered by the Universal Dependencies project, allowing researchers and developers
Availability and licensing are governed by its open-source status, with distribution through the official UDPipe resources