PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Learning from partially labelled data -- with confidence
Eric Gaussier and Cyril Goutte
In: Learning with Partially Classified Training Data - ICML 2005 workshop, 7 August, 2005, Bonn, Germany.


In this paper, we propose a unifying treatment of several strategies for training mixture models from label-deficient data. After a review of different approaches to estimating classification models on partially labelled data using mixture models, we identify a number of problems which lead us to propose a new EM variant. The aim is to better handle unlabelled data and provide a more confident discrimination decision. This is illustrated by an experimental comparison of the different models on the Leptograpsus crab data.

PDF - PASCAL Members only - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
ID Code:1025
Deposited By:Cyril Goutte
Deposited On:22 July 2005