NULLheavy
NULLheavy is a term used in data management to describe datasets or variables characterized by a high incidence of NULL or missing values. The term is descriptive rather than normative, signaling that missing data dominate the data structure and influence analysis, storage, and interpretation.
Coined informally in database discussions, NULLheavy combines NULL (a placeholder for missing values in SQL and
Typical indicators include missing value proportion above a threshold (e.g., >50%), patterns of missingness, and reliance
Strategies include data profiling, exploring missingness mechanisms, choosing appropriate imputation (mean/median, regression, multiple imputation), using models
Common domains include survey research with nonresponse, sensor networks with intermittent reporting, or legacy databases with