FellegiSuntermodellen
The FellegiSunter model, also known as the Fellegi-Sunter method or the record linkage model, is a statistical framework for identifying matching records in two or more files that refer to the same entity. Developed by Ivan Fellegi and Philip Sunter in 1969, it is widely used in data management, statistics, and various fields for tasks like deduplication, data integration, and privacy protection.
The core idea of the model is to compare pairs of records, one from each file, and
The model's strength lies in its probabilistic approach, which allows for uncertainty in the matching process.
Key components of the Fellegi-Sunter model include the comparison vector, which represents the agreement/disagreement patterns for