Pseudonymisation

Pseudonymisation is a data processing technique that replaces identifying fields in a dataset with artificial identifiers, or pseudonyms, so individuals cannot be readily identified without additional information. It reduces privacy risk while preserving data usefulness for analysis. It differs from anonymisation in that the mapping between pseudonyms and real identities is kept separately and is typically recoverable, whereas anonymisation aims to make re-identification impossible.

Implementation methods include tokenization, deterministic or non-deterministic hashing with salt, encryption, and the use of stable

Benefits include safer data sharing, easier linkage across datasets for research and analytics, and stronger protection

Legal context: under the GDPR, pseudonymisation is recognised as a security measure and a component of data

Common use cases are health research datasets, customer analytics, and operational logs where direct identifiers are

pseudonymisation

access-controlled,

re-identification

responsibilities