Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chaoqi Jia

Efficient Constrained $k$-Center Clustering with Background Knowledge

Jan 23, 2024

Longkun Guo, Chaoqi Jia, Kewen Liao, Zhigang Lu, Minhui Xue

Figure 1 for Efficient Constrained $k$-Center Clustering with Background Knowledge

Figure 2 for Efficient Constrained $k$-Center Clustering with Background Knowledge

Figure 3 for Efficient Constrained $k$-Center Clustering with Background Knowledge

Figure 4 for Efficient Constrained $k$-Center Clustering with Background Knowledge

Abstract:Center-based clustering has attracted significant research interest from both theory and practice. In many practical applications, input data often contain background knowledge that can be used to improve clustering results. In this work, we build on widely adopted $k$-center clustering and model its input background knowledge as must-link (ML) and cannot-link (CL) constraint sets. However, most clustering problems including $k$-center are inherently $\mathcal{NP}$-hard, while the more complex constrained variants are known to suffer severer approximation and computation barriers that significantly limit their applicability. By employing a suite of techniques including reverse dominating sets, linear programming (LP) integral polyhedron, and LP duality, we arrive at the first efficient approximation algorithm for constrained $k$-center with the best possible ratio of 2. We also construct competitive baseline algorithms and empirically evaluate our approximation algorithm against them on a variety of real datasets. The results validate our theoretical findings and demonstrate the great advantages of our algorithm in terms of clustering cost, clustering quality, and running time.

Via

Access Paper or Ask Questions