English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Report

Consistency of Spectral Clustering

MPS-Authors
/persons/resource/persons76237

von Luxburg,  U
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;
Max Planck Institute for Biological Cybernetics, Max Planck Society;

/persons/resource/persons83824

Bousquet,  O
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;
Max Planck Institute for Biological Cybernetics, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

MPIK-TR-134.pdf
(Publisher version), 327KB

Supplementary Material (public)
There is no public supplementary material available
Citation

von Luxburg, U., Belkin, M., & Bousquet, O.(2004). Consistency of Spectral Clustering (134). Tübingen, Germany: Max Planck Institute for Biological Cybernetics.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-D74B-6
Abstract
Consistency is a key property of statistical algorithms, when the data is drawn from some underlying
probability distribution. Surprisingly, despite decades of work, little is known about consistency of most clustering
algorithms. In this paper we investigate consistency of a popular family of spectral clustering algorithms, which
cluster the data with the help of eigenvectors of graph Laplacian matrices. We show that one of the two of major
classes of spectral clustering (normalized clustering) converges under some very general conditions, while the other
(unnormalized), is only consistent under strong additional assumptions, which, as we demonstrate, are not always
satisfied in real data. We conclude that our analysis provides strong evidence for the superiority of normalized
spectral clustering in practical applications. We believe that methods used in our analysis will provide a basis for
future exploration of Laplacian-based methods in a statistical setting.