Randindeks
Randindeks, or Rand index, is a measure of similarity between two data clusterings or partitions of the same dataset. It evaluates how much two partitions agree in terms of which pairs of elements should be grouped together or separated.
Computation is typically done from a contingency table. Let P and Q be two partitions of n
C = sum_{i,j} binomial(n_ij, 2).
The Rand index is RI = 1 - (A + B - 2C) / N.
Equivalently, RI can be interpreted as the proportion of pairs for which the two partitions agree, i.e.,
Interpretation and use are straightforward but with caveats. The Rand index ranges from 0 to 1, with