targetencoding - Infinite Lexicon - Infinite Lexicon

targetencoding

Target encoding, also known as mean encoding or likelihood encoding, is a feature engineering technique used in machine learning. It replaces categorical features with numerical values that represent the average of the target variable for each category. This process aims to capture the relationship between a categorical feature and the target variable in a numerical format that machine learning models can readily process.

The core idea is to calculate the mean of the target variable for each unique category within

While effective, target encoding can lead to overfitting, especially when dealing with rare categories or small

a

a

a

a

Cross-validation

high-cardinality