Korporaation
Korporaation is an international research and development consortium focused on corpus linguistics, natural language processing, and data governance. It gathers universities, technology firms, and public institutions to develop multilingual linguistic resources, evaluation standards, and software tools for language technologies. Its mission emphasizes high‑quality language data, open science, and responsible AI through transparent governance.
Its origins trace back to a mid‑to‑late 1990s collaboration among European and North American research labs,
Core activities include acquiring, annotating, and curating large multilingual corpora; developing annotation schemes and benchmarks; providing
Its resources span data repositories, compute platforms, and standards documentation. The consortium offers open‑access datasets alongside
Ethical guidelines cover privacy, consent, data minimization, bias mitigation, and transparency. An ethics charter is published