Jaccardsamankaltaisuus
Jaccardsamankaltaisuus, also known as the Jaccard index or Jaccard similarity coefficient, is a statistic used for gauging the similarity and diversity of sample sets. It is defined as the size of the intersection divided by the size of the union of the sample sets. Mathematically, for two sets A and B, the Jaccard index J(A, B) is calculated as: J(A, B) = |A ∩ B| / |A ∪ B|. The Jaccard index is always a value between 0 and 1. A value of 1 means that the two sets are identical, while a value of 0 means that the two sets have no elements in common.
This metric is widely used in various fields, including data mining, information retrieval, and machine learning.