ctexts
ctexts is an open-source software project and data ecosystem designed to support the creation, curation, and analysis of Chinese texts, with a focus on classical and historical material. It combines a digitized repository, text-processing tools, and an application programming interface intended for researchers and educators. The project is community-driven, welcoming contributions from linguists, philologists, and software developers.
Core components include a text repository with metadata, an annotation framework, and tooling for normalization, segmentation,
The project emphasizes openness and reproducibility, with documentation, example datasets, and clear licensing for contributed materials.
See also: digital humanities, text encoding initiatives, corpora, Chinese Text Project.