Jword
Jword is a cross-language natural language processing toolkit designed to provide word-level analysis and indexing for textual corpora. It supports tokenization, morphological analysis, lemmatization and stemming, part-of-speech tagging, and an inverted index to enable efficient search. The design prioritizes modularity and language-agnostic interfaces.
Origin and development: The project was initiated by the Jword Foundation in 2012, with its first public
Architecture and features: Jword consists of a core engine written to the Java Virtual Machine, with adapters
Usage and reception: Jword is used in academic projects, NLP demonstrations, and small-scale industry applications. Users
See also: Natural language processing, Tokenization, Morphology, Lemmatization, Stemming, Part-of-speech tagging, Inverted index.