Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Scalable Kernel Clustering: Approximate Kernel k-means

Feb 16, 2014

Radha Chitta, Rong Jin, Timothy C. Havens, Anil K. Jain

Figure 1 for Scalable Kernel Clustering: Approximate Kernel k-means

Figure 2 for Scalable Kernel Clustering: Approximate Kernel k-means

Figure 3 for Scalable Kernel Clustering: Approximate Kernel k-means

Figure 4 for Scalable Kernel Clustering: Approximate Kernel k-means

Share this with someone who'll enjoy it:

Abstract:Kernel-based clustering algorithms have the ability to capture the non-linear structure in real world data. Among various kernel-based clustering algorithms, kernel k-means has gained popularity due to its simple iterative nature and ease of implementation. However, its run-time complexity and memory footprint increase quadratically in terms of the size of the data set, and hence, large data sets cannot be clustered efficiently. In this paper, we propose an approximation scheme based on randomization, called the Approximate Kernel k-means. We approximate the cluster centers using the kernel similarity between a few sampled points and all the points in the data set. We show that the proposed method achieves better clustering performance than the traditional low rank kernel approximation based clustering schemes. We also demonstrate that its running time and memory requirements are significantly lower than those of kernel k-means, with only a small reduction in the clustering quality on several public domain large data sets. We then employ ensemble clustering techniques to further enhance the performance of our algorithm.

* 15 pages, 6 figures,extension of the work "Approximate Kernel k-means: Solution to large scale kernel clustering" published in KDD 2011

View paper on

Share this with someone who'll enjoy it:

Title:Scalable Kernel Clustering: Approximate Kernel k-means

Paper and Code