COUNTkolumn
COUNTkolumn is a data transformation operation that enriches a dataset by counting the occurrences of values in a specified column. The result is a new feature that represents, for each row, how many rows share the same value in the target column. This makes COUNTkolumn a simple form of frequency encoding focused on a single column.
In practice, applying COUNTkolumn to a column called, for example, Status, distributes a count per value across
Variants and usage. COUNTkolumn can be used for feature engineering in machine learning, data profiling, and
Limitations and considerations. COUNTkolumn requires scanning the target column and can be expensive on large datasets.
See also: frequency encoding, histogram, SQL COUNT, data profiling.