labelt
Labelt is a domain-specific language designed for annotating datasets with semantic labels and metadata to support machine learning, data governance, and reproducible experiments. The language emphasizes portability, human readability, and machine parsability, promoting consistent labeling across projects and teams. Labelt aims to bridge simple tag systems and more formal data-description languages by providing a lightweight, schema-driven approach.
Origins of Labelt trace to open-source data-labeling communities in the 2020s, seeking a common format to replace
Key features include a schema system for labels and attributes, support for hierarchical and multi-label structures,
An annotation in Labelt consists of an entry with an identifier, a primary label, and optional attributes.
Applications include labeling for computer vision, natural language processing, and multimodal datasets, as well as data
See also: Data labeling, Data governance, Taxonomy, Metadata.