TreetaggerUD
TreetaggerUD is a computational linguistics tool that performs part-of-speech tagging and lemmatization on text. It is an adaptation of the original Treetagger, a widely used tagger developed by Helmut Schmid, specifically designed to work with Universal Dependencies (UD) datasets. Universal Dependencies is a project that aims to provide a consistent framework for part-of-speech tagging and dependency parsing across different languages. TreetaggerUD leverages the principles and annotation schemes of UD to produce standardized linguistic annotations.
The tool takes plain text as input and outputs a sequence of tokens, each accompanied by its