Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Categorical data clustering: 25 years beyond K-modes

Aug 30, 2024

Tai Dinh, Wong Hauchi, Philippe Fournier-Viger, Daniil Lisik, Minh-Quyet Ha, Hieu-Chi Dam, Van-Nam Huynh

Figure 1 for Categorical data clustering: 25 years beyond K-modes

Figure 2 for Categorical data clustering: 25 years beyond K-modes

Figure 3 for Categorical data clustering: 25 years beyond K-modes

Figure 4 for Categorical data clustering: 25 years beyond K-modes

Share this with someone who'll enjoy it:

Abstract:The clustering of categorical data is a common and important task in computer science, offering profound implications across a spectrum of applications. Unlike purely numerical datasets, categorical data often lack inherent ordering as in nominal data, or have varying levels of order as in ordinal data, thus requiring specialized methodologies for efficient organization and analysis. This review provides a comprehensive synthesis of categorical data clustering in the past twenty-five years, starting from the introduction of K-modes. It elucidates the pivotal role of categorical data clustering in diverse fields such as health sciences, natural sciences, social sciences, education, engineering and economics. Practical comparisons are conducted for algorithms having public implementations, highlighting distinguishing clustering methodologies and revealing the performance of recent algorithms on several benchmark categorical datasets. Finally, challenges and opportunities in the field are discussed.

View paper on

Share this with someone who'll enjoy it:

Title:Categorical data clustering: 25 years beyond K-modes

Paper and Code