tcloseness - Infinite Lexicon - Infinite Lexicon

tcloseness

T-closeness is a data anonymization privacy model designed to prevent attribute disclosure in published data. It builds on k-anonymity and l-diversity by requiring that the distribution of a sensitive attribute within every equivalence class of quasi-identifiers is "close" to the distribution of that attribute in the overall data. The closeness is measured with a distance function and bounded by a threshold t, which is chosen by the data publisher.

Formally, let S be a sensitive attribute with possible values and let P be the global distribution

Purpose and rationale: t-closeness aims to reduce the risk that an attacker, who knows an individual's quasi-identifiers,

Calculation and use: To achieve t-closeness, data custodians apply anonymization techniques that generalize or suppress quasi-identifiers

Limitations: t-closeness does not provide formal guarantees like differential privacy and can still allow re-identification risks

S

G

quasi-identifiers,

S

P

t

a

a

t