TeximusHT
TeximusHT is a high-throughput text processing framework designed to support scalable natural language processing workflows. It provides a modular dataflow pipeline engine, an extensible plugin system, and bindings for multiple programming languages, with support for both streaming and batch processing.
The project began in 2022 as a community effort, with the first public release in 2023. It
The core is a dataflow runtime that constructs processing graphs from components such as readers, transformers,
Typical applications include large-scale text normalization, tokenization, named-entity recognition, sentiment analysis pipelines, log analytics, and data
TeximusHT is maintained by a community governance model with a core team overseeing releases. It has drawn