Advancements in Clustering Algorithm Design

The field of clustering algorithm design is witnessing significant developments, with a focus on enhancing the performance and robustness of existing algorithms. Researchers are exploring new approaches that integrate density and geometry, leading to improved clustering accuracy and efficiency. Noteworthy papers in this area include:

  • CoreSPECT, which leverages the interplay between distribution and geometry to boost the performance of simple algorithms like K-Means and GMM, achieving substantial gains in clustering accuracy.
  • CAS Condensed and Accelerated Silhouette, which introduces an efficient method for determining the optimal K in K-Means clustering, achieving up to 99 percent faster execution times on high-dimensional datasets while retaining precision and scalability.
  • Average Sensitivity of Hierarchical k-Median Clustering, which proposes an efficient algorithm for hierarchical k-median clustering and theoretically proves its low average sensitivity and high clustering quality.
  • On Tight Robust Coresets for k-Medians Clustering, which obtains new constructions for coresets in various metric spaces, with optimal coreset sizes up to logarithmic factors.
  • Ranking Vectors Clustering, which establishes the NP-hardness of the k-centroids ranking vectors clustering problem and develops an efficient approximation algorithm with theoretical error bounds.

Sources

CoreSPECT: Enhancing Clustering Algorithms via an Interplay of Density and Geometry

CAS Condensed and Accelerated Silhouette: An Efficient Method for Determining the Optimal K in K-Means Clustering

Two-cluster test

Average Sensitivity of Hierarchical $k$-Median Clustering

On Tight Robust Coresets for $k$-Medians Clustering

Ranking Vectors Clustering: Theory and Applications

Built with on top of