Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Self-Supervised Approach for Cluster Assessment of High-Dimensional Data

May 29, 2023

Alokendu Mazumder, Pagadala Krishna Murthy, Punit Rathore

Figure 1 for A Self-Supervised Approach for Cluster Assessment of High-Dimensional Data

Figure 2 for A Self-Supervised Approach for Cluster Assessment of High-Dimensional Data

Figure 3 for A Self-Supervised Approach for Cluster Assessment of High-Dimensional Data

Figure 4 for A Self-Supervised Approach for Cluster Assessment of High-Dimensional Data

Share this with someone who'll enjoy it:

Abstract:Estimating the number of clusters and underlying cluster structure in a dataset is a crucial task. Real-world data are often unlabeled, complex and high-dimensional, which makes it difficult for traditional clustering algorithms to perform well. In recent years, a matrix reordering based algorithm, called "visual assessment of tendency" (VAT), and its variants have attracted many researchers from various domains to estimate the number of clusters and inherent cluster structure present in the data. However, these algorithms fail when applied to high-dimensional data due to the curse of dimensionality, as they rely heavily on the notions of closeness and farness between data points. To address this issue, we propose a deep-learning based framework for cluster structure assessment in complex, image datasets. First, our framework generates representative embeddings for complex data using a self-supervised deep neural network, and then, these low-dimensional embeddings are fed to VAT/iVAT algorithms to estimate the underlying cluster structure. In this process, we ensured not to use any prior knowledge for the number of clusters (i.e k). We present our results on four real-life image datasets, and our findings indicate that our framework outperforms state-of-the-art VAT/iVAT algorithms in terms of clustering accuracy and normalized mutual information (NMI).

* Submitted to IEEE SMC 2023

View paper on

Share this with someone who'll enjoy it:

Title:A Self-Supervised Approach for Cluster Assessment of High-Dimensional Data

Paper and Code