Randindex
Randindex, or Rand index, is a measure of the similarity between two data clusterings of the same dataset. It assesses how well two partitions agree on the grouping of items, by considering all pairs of items and counting those that are either clustered together in both partitions or clustered apart in both partitions. The index ranges from 0 to 1, with 1 indicating perfect agreement.
To compute the Rand index, consider all unordered pairs of items. Classify each pair into four categories:
Limitations include its sensitivity to chance: even random labelings can yield nonzero values, especially with many
The Rand index was introduced by William M. Rand in 1971 and remains a foundational tool for