CounterfactualFairness - Infinite Lexicon - Infinite Lexicon

CounterfactualFairness

Counterfactual Fairness is a concept in algorithmic fairness that aims to ensure a decision made by a machine learning model would remain the same if a sensitive attribute of an individual had been different, all other relevant factors being equal. It's a causal inference-based approach to fairness. The core idea is to assess whether a model's outcome is dependent on a protected characteristic, such as race, gender, or age, by imagining a counterfactual scenario. For example, if a loan application is denied, counterfactual fairness asks if it would have been approved if the applicant belonged to a different demographic group but had identical qualifications and other non-sensitive attributes.

To achieve counterfactual fairness, one typically uses causal models to represent the relationships between attributes, decisions,

a

a

correlation-based