PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Large Scale Non-parametric Inference: Data Parallelisation in the Indian Buffet Process
Finale Doshi, David Knowles, Shakir Mohamed and Zoubin Ghahramani
In: NIPS 2009, 7-12 December 2009, Vancouver, BC, Canada.

Abstract

Nonparametric Bayesian models provide a framework for flexible probabilistic modelling of complex datasets. Unfortunately, the high-dimensional averages required for Bayesian methods can be slow, especially with the unbounded representations used by nonparametric models. We address the challenge of scaling Bayesian inference to the increasingly large datasets found in real-world applications. We focus on parallelisation of inference in the Indian Buffet Process (IBP), which allows data points to have an unbounded number of sparse latent features. Our novel MCMC sampler divides a large data set between multiple processors and uses message passing to compute the global likelihoods and posteriors. This algorithm, the first parallel inference scheme for IBP-based models, scales to datasets orders of magnitude larger than have previously been possible.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Poster)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Theory & Algorithms
ID Code:5524
Deposited By:Shakir Mohamed
Deposited On:21 January 2010