clusterset
A cluster set, or clusterset, is a collection of clusters produced by a clustering analysis. It represents a partition of a dataset into discrete groups or, equivalently, a mapping that assigns each data item to a cluster label. In practice, a clusterset is described by the cluster identifiers, member items, and often a summary of each cluster such as its centroid, size, and dispersion.
Representations of a clusterset can vary. Common formats include a label vector that assigns a cluster id
Clustering algorithms generate clustersets as outputs. Popular methods include k-means, hierarchical clustering, DBSCAN, and spectral clustering,
Evaluation of a clusterset can be internal, external, or relative. Internal measures assess compactness and separation
Applications of clustersets span many domains, including pattern discovery, market segmentation, image and document clustering, and
Variations include soft clustering, where data points have degrees of membership across clusters, and overlapping clustering,
See also: clustering, cluster analysis, cluster validity indices, k-means, hierarchical clustering, DBSCAN, Gaussian mixture models. In