Jan 23, 2015

K-Means, coverings, and Voronoi diagrams

This is the 4th of a series of posts on cluster-algorithms and ideas in data analysis.

The $k$-Means algorithm computes a Voronoi partition of the data set such that each landmark is given by the centroid of the corresponding cell. Let me quickly quote Wikipedia on the history of the algorithm before I explain what it is about: The term "$k$-means" was first used by James MacQueen in 1967, though the idea goes back to Hugo Steinhaus in 1957. The standard algorithm was first proposed by Stuart Lloyd in 1957.