Outcomepurity

Outcomepurity is a metric used in data analysis to quantify the homogeneity of outcomes within predefined groups, such as clusters, bins, or leaves of a decision tree. The premise is that a group with a single dominant outcome is more predictable, whereas mixed outcomes reduce reliability.

Definition and calculation: For a dataset partitioned into K groups G1 to GK, and m possible outcome

Interpretation and relation to other measures: OC reflects the average probability that a randomly chosen item

Applications and limitations: It is used to evaluate and compare partitions produced by clustering, binning, or

Related metrics: purity, Gini impurity, and entropy. Outcomepurity is a conceptual measure with varying informal definitions

i

=

*

N

1

a

predictability.

classification,

a