MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

k-Means has Polynomial Smoothed Complexity

Bodo Manthey
Fachrichtung Informatik - Saarbrücken
Talk
AG 1, AG 4, RG1, MMCI, AG 3, AG 5  
AG Audience
English

Date, Time and Location

Thursday, 28 May 2009
09:45
60 Minutes
E2 4 - Math
Seminar Room 10 (012)
Saarbrücken

Abstract

The k-means method is one of the most widely used clustering algorithms, drawing its popularity from its speed in practice. Recently, however, it was shown to have exponential worst-case running time. In order to close the gap between practical performance and theoretical analysis, the k-means method has been studied in the model of smoothed analysis. But even the smoothed analyses so far are unsatisfactory as the bounds are still super-polynomial in the number n of data points.


We settle the smoothed running time of the k-means method. We show that the smoothed number of iterations is bounded by a polynomial in n and 1/sigma, where sigma is the standard deviation of the Gaussian perturbations. This means that if an arbitrary input data set is randomly perturbed, then the k-means method will run in expected polynomial time on that input set.

Contact

Bodo Manthey
5502
--email hidden
passcode not visible
logged in users only

gk-sek, 05/25/2009 11:56 -- Created document.