PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Online Learning for Multi-label and Multi-variate Performance Measures
X Zhang, T Graepel and R Herbrich
In: 13th International Conference on Artificial Intelligence and Statistics (AISTATS)(2010).

Abstract

Many real world applications employ multi-variate performance measures and each example can belong to multiple classes. The currently most popular approaches train an SVM for each class, followed by ad hoc thresholding. Probabilistic models using Bayesian decision theory are also commonly adopted. In this paper, we propose a Bayesian online multi-label classication framework (BOMC) which learns a probabilistic linear classier. The likelihood is modeled by a graphical model similar to TrueSkill TM , and inference is based on Gaussian density lntefering with expectation propagation. Using samples from the posterior, we label the testing data by maximizing the expected F1-score. Our experiments on Reuters1-v2 dataset show BOMC compares favorably to the state-of-the-art online learners in macro-averaged F1-score and training time.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
Natural Language Processing
ID Code:7318
Deposited By:Wray Buntine
Deposited On:17 March 2011