QA250 : Shrinkage estimation in K-means cluster analysis for multivariate normal distribution
Thesis > Central Library of Shahrood University > Mathematical Sciences > MSc > 2014
Authors:
Abstarct: We study a general algorithm to improve the accuracy in cluster analysis that employs
the James-Stein shrinkage effect in k-means clustering. We shrink the centroids of clusters
toward the overall mean of all data using a James-Stein-type adjustment, and then the
James-Stein shrinkage estimators act as the new centroids in the next clustering iteration
until convergence. We compare the shrinkage results to the traditional k-means method.
A Monte Carlo simulation shows that the magnitude of the improvement depends on the
within-cluster variance and especially on the effective dimension of the covariance matrix.
Using the Rand index, we demonstrate that accuracy increases significantly in simulated
data and in a real data example.
Keywords:
#Clustering #K-means #Shrinkage estimator #Baranchik estimator #Rand index
Keeping place: Central Library of Shahrood University
Visitor:
Keeping place: Central Library of Shahrood University
Visitor: