Reducing the Annotation Burden in Text Classification
Anastasia Krithara, Cyril Goutte, Massih Amini and Jean-Michel Renders
In: I International Conference on Multidisciplinay Information Sciences and Technologies (InSciT2006), 25-28 October 2006, Merida, Spain.
In this paper we describe a method which combines semi-supervised and active learning for the classification task. In particular, we propose a semi-supervised PLSA (Probabilistic Latent Semantic Analysis) algorithm, combined with a pool-based active learning method, in order to classify text documents.