Demystifying Information-Theoretic Clustering

Greg Ver Steeg; Aram Galstyan; Fei Sha; Simon DeDeo

Back to ICML

ICML 2014

Demystifying Information-Theoretic Clustering

Conference Paper Cycle 1 Papers Artificial Intelligence · Machine Learning

Details

Abstract

We propose a novel method for clustering data which is grounded in information-theoretic principles and requires no parametric assumptions. Previous attempts to use information theory to define clusters in an assumption-free way are based on maximizing mutual information between data and cluster labels. We demonstrate that this intuition suffers from a fundamental conceptual flaw that causes clustering performance to deteriorate as the amount of data increases. Instead, we return to the axiomatic foundations of information theory to define a meaningful clustering measure based on the notion of consistency under coarse-graining for finite data.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: International Conference on Machine Learning
Archive span: 1993-2025
Indexed papers: 16471
Paper id: 270330096991199066