Estimating Labels from Label Proportions
Novi Quadrianto, Alex J. Smola, Tiberio S. Caetano and Quoc V. Le
In: ICML 2008, Helsinki(2008).
Consider the following problem: given sets of unlabeled observations, each set with known label proportions, predict the labels of another
set of observations, also with known label proportions. This problem appears in areas like e-commerce, spam ltering and improper
content detection. We present consistent estimators which can reconstruct the correct labels with high probability in a uniform
convergence sense. Experiments show that our method works well in practice.