datamakes
Datamakes is a term used to describe the set of practices and artifacts involved in creating, curating, and distributing data assets for research, development, and deployment of data-driven systems.
Core activities include data collection and ingestion, cleaning and normalization, labeling and annotation, data augmentation, and
Governance and ethics: Datamakes addresses privacy and consent, bias assessment, fairness, and compliance with legal frameworks.
Lifecycle and tools: Workflows typically use data pipelines, version control for datasets, data catalogs, and tooling
Applications: In machine learning and analytics, Datamakes underpins model training, evaluation, and research reproducibility. In journalism
Critics and outlook: Critics caution that Datamakes can lag behind rapid development cycles or emphasize data