tcloseness
T-closeness is a data anonymization privacy model designed to prevent attribute disclosure in published data. It builds on k-anonymity and l-diversity by requiring that the distribution of a sensitive attribute within every equivalence class of quasi-identifiers is "close" to the distribution of that attribute in the overall data. The closeness is measured with a distance function and bounded by a threshold t, which is chosen by the data publisher.
Formally, let S be a sensitive attribute with possible values and let P be the global distribution
Purpose and rationale: t-closeness aims to reduce the risk that an attacker, who knows an individual's quasi-identifiers,
Calculation and use: To achieve t-closeness, data custodians apply anonymization techniques that generalize or suppress quasi-identifiers
Limitations: t-closeness does not provide formal guarantees like differential privacy and can still allow re-identification risks