Hazm

Hazm is a Python library for Persian natural language processing. It provides a set of tools and resources designed to process Persian text, supporting tasks common in NLP workflows such as text normalization, tokenization, stemming and lemmatization, part-of-speech tagging, and basic parsing capabilities. The library is widely used in academic research and practical applications dealing with Persian language data.

Normalization in Hazm standardizes Persian text by handling character forms, spacing, and diacritics, helping to reduce

Hazm is open-source and maintained on platforms like GitHub. It can be installed in Python environments via

Limitations and scope reflect the state of Persian NLP tooling. Hazm performs well on standard Persian texts

Hazm sits within the broader field of Persian NLP, alongside other libraries and resources that support language

domain-specific

domain-specific