PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Block clustering of contingency table and Mixture Model
Mohamed Nadif and Gérard Govaert
In: Advances in Intelligent Data Analysis, 6th International Symposium on Data Analysis, IDA 2005, 8-10 Sep 2005, Madrid, Spain.

Abstract

Block clustering or simultaneous clustering has become an important challenge in data mining context. It has practical importance in a wide of variety of applications such as text, web-log and market basket data analysis. Typically, the data that arises in these applications is arranged as a two-way contingency or co-occurrence table. In this paper, we embed the block clustering problem in the mixture approach. We propose a Poisson block mixture model and adopting the classification maximum likelihood principle we perform a new algorithm. Simplicity, fast convergence and scalability are the major advantages of the proposed approach.

EPrint Type:Conference or Workshop Item (Talk)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
ID Code:1951
Deposited By:Gérard Govaert
Deposited On:30 December 2005