extraxi
Extraxi is a term used in information extraction theory to describe a standardized approach for extracting structured semantic data from unstructured text, with emphasis on cross-lingual portability and extensible schemas. It is imagined as a framework rather than a single algorithm, emphasizing interoperability and modularity in processing pipelines.
The name is derived from Latin extrahere (to pull out), with extraxi as the perfect participle, and
Origin and development: The concept emerged in scholarly discussions in the late 2010s as researchers sought
Core components include a modular data model for entities, relations, and events; language-agnostic annotation guidelines; a
Applications include multilingual information extraction, digital humanities projects, and building multilingual knowledge graphs. Adoption remains niche
Related topics include information extraction, knowledge graphs, and cross-lingual natural language processing.