recordvary
Recordvary is a data-analytic concept used to describe the degree of variability among the records in a dataset. It focuses on how much attribute values differ across records and is commonly used in data profiling, quality assessment, and feature engineering. The term can refer to two related notions: the variability of individual attributes across records and the overall diversity of records when considering multiple attributes.
For numeric attributes, recordvary is often quantified by dispersion measures such as variance or standard deviation;
Applications include identifying columns with high or low discriminative power, guiding data cleaning, detecting anomalies or
Limitations include dependence on chosen attributes, handling of missing values, and sensitivity to data encoding. Interpretations
See also: data profiling, data quality, variance, entropy, distance metrics.