Fast Inference in Conditional Topic Models
Philipp Hennig, David Stern, Thore Graepel and Ralf Herbrich
In: ICML 2011, Jun 28 - Jul 2 2011, Bellevue, Washington, USA.
Topic models use word frequencies to describe semantic similarity between documents in a low-dimensional latent space. Modern document repositories often record metadata in addition to the words themselves, which can convey important semantic information. Because such corpora can also be very large, inference should be computationally lightweight. We construct a fast approximate inference scheme for topic models conditional on arbitrary features of the document. We also study the viability of single pass inference in such models, and show experimental results from large online document corpora. The result is the first “web-scale” conditional topic model.