Datenwahl
Datenwahl (literally data choice) is a concept in data governance and information ethics referring to the process by which individuals or organizations select data sources, data sets, or features to be used in analysis, modeling, policy-making, or reporting. It encompasses decisions about what data to collect, which variables to include, and which data sources to rely on. The term is used especially in discussions of bias, transparency, and reproducibility in data-driven work, where different selections can lead to different conclusions.
In practice, Datenwahl occurs at multiple levels: data sourcing and collection strategies, feature or variable selection
Good governance of Datenwahl includes documenting selection criteria, maintaining data provenance, and conducting sensitivity analyses. Techniques
See also data governance, data provenance, data quality, and selection bias.