Beyond the Nystrom Approximation: Speeding up Spectral Clustering using Uniform Sampling and Weighted Kernel k-means

Beyond the Nystrom Approximation: Speeding up Spectral Clustering using Uniform Sampling and Weighted Kernel k-means

Mahesh Mohan, Claire Monteleoni

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
Main track. Pages 2494-2500. https://doi.org/10.24963/ijcai.2017/347

In this paper we present a framework for spectral clustering based on the following simple scheme: sample a subset of the input points, compute the clusters for the sampled subset using weighted kernel k-means (Dhillon et al. 2004) and use the resulting centers to compute a clustering for the remaining data points. For the case where the points are sampled uniformly at random without replacement, we show that the number of samples required depends mainly on the number of clusters and the diameter of the set of points in the kernel space. Experiments show that the proposed framework outperforms the approaches based on the Nystrom approximation both in terms of accuracy and computation time.
Keywords:
Machine Learning: Data Mining
Machine Learning: Kernel Methods
Machine Learning: Unsupervised Learning