PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

An Extension of PLSA for Document Clustering
Youngmin Kim, Jean-François Pessiot, Massih Amini and Patrick Gallinari
In: CIKM 2008, 26-30 Oct 2008, Napa Valley, California, USA.


In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to co-cluster documents and terms simultaneously. We show on three datasets that our extended model produces statistically significant improvements with respect to two clustering measures over the original PLSA and the multinomial mixture MM models.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Poster)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Information Retrieval & Textual Information Access
ID Code:4159
Deposited By:Massih Amini
Deposited On:22 August 2008