kanonymity

Kanonymity, commonly written as k-anonymity, is a property of a data release that aims to protect privacy by ensuring that each individual is indistinguishable from at least k−1 others with respect to a defined set of quasi-identifiers. Quasi-identifiers are attributes that may not identify a person on their own but could do so when combined with external information (for example, ZIP code, birth year, and gender).

To achieve k-anonymity, data publishers generalize or suppress quasi-identifier values until every combination of quasi-identifiers appears

Originating in the data privacy literature, the concept was introduced by Latanya Sweeney in 2002 as a

Limitations of k-anonymity include vulnerability to attribute disclosure if all records within an equivalence class share

k

a

a

a

reidentification

a

a

k

a

quasi-identifiers

a