TY - GEN
T1 - Minimum description length and clustering with exemplars
AU - Lai, Po Hsiang
AU - O'Sullivan, Joseph A.
AU - Pless, Robert
PY - 2009
Y1 - 2009
N2 - We propose an information-theoretic clustering framework for density-based clustering and similarity or distance-based clustering with objective functions of clustering performance derived from stochastic complexity and minimum description length (MDL) arguments. Under this framework, the number of clusters and parameters can be determined in a principled way without prior knowledge from users. We show that similarity-based clustering can be viewed as combinatorial optimization on graphs. We propose two clustering algorithms, one of which relies on a minimum arborescence tree algorithm which returns optimal clustering under the proposed MDL objective function for similarity-based clustering. We demonstrate clustering performance on synthetic data.
AB - We propose an information-theoretic clustering framework for density-based clustering and similarity or distance-based clustering with objective functions of clustering performance derived from stochastic complexity and minimum description length (MDL) arguments. Under this framework, the number of clusters and parameters can be determined in a principled way without prior knowledge from users. We show that similarity-based clustering can be viewed as combinatorial optimization on graphs. We propose two clustering algorithms, one of which relies on a minimum arborescence tree algorithm which returns optimal clustering under the proposed MDL objective function for similarity-based clustering. We demonstrate clustering performance on synthetic data.
UR - https://www.scopus.com/pages/publications/70449482067
U2 - 10.1109/ISIT.2009.5205937
DO - 10.1109/ISIT.2009.5205937
M3 - Conference contribution
AN - SCOPUS:70449482067
SN - 9781424443130
T3 - IEEE International Symposium on Information Theory - Proceedings
SP - 1318
EP - 1322
BT - 2009 IEEE International Symposium on Information Theory, ISIT 2009
T2 - 2009 IEEE International Symposium on Information Theory, ISIT 2009
Y2 - 28 June 2009 through 3 July 2009
ER -