LabelWhat
LabelWhat is an open-source, modular platform for data annotation designed to support labeling workflows across multiple data modalities used in machine learning, including images, text, and audio. It provides a centralized workspace for defining label taxonomies, coordinating annotator tasks, validating annotations, and exporting labeled datasets into common formats. The project aims to promote reproducibility and collaboration by preserving provenance, versioning label schemas, and recording adjudication decisions.
LabelWhat supports hierarchical and multi-label taxonomies, per-task assignment, review queues, and quality-control mechanisms such as adjudication
Architecture and workflow: LabelWhat follows a client-server model. A central server stores label schemas, task definitions,
History and reception: The project originated from a collaboration among researchers and practitioners in 2020 and
See also: data annotation tools, active learning, annotation schema.