expandem
Expandem is a term used in data science to describe a modular framework for controlled data expansion to augment machine learning datasets. It denotes both a conceptual approach and, in some ecosystems, a software-suite that aims to increase dataset size and diversity without disproportionately increasing labeling effort. The goal is to improve model generalization while monitoring data quality and bias.
Core concepts include an expansion engine that applies transformations and synthetic data generation, adapters to connect
Techniques used under expandem range from generative modeling and domain-specific augmentation to domain randomization and synthetic-to-real
History and status: The term emerged in online ML discourse in the early 2020s and has since
Related topics include data augmentation, synthetic data, generative models, domain randomization, and dataset versioning.