PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Nonparametric Bayesian Sparse Factor Models with application to Gene Expression modelling
Zoubin Ghahramani and David Knowles
Annals of Applied Statistics 2010.


A nonparametric Bayesian extension of Factor Analysis (FA) is proposed where observed data Y is modeled as a linear superposition, G, of a potentially infinite number of hidden factors, X. The Indian Buffet Process (IBP) is used as a prior on G to incorporate sparsity and to allow the number of latent features to be inferred. The model's utility for modeling gene expression data is investigated using randomly generated datasets based on a known sparse connectivity matrix for E. Coli, and on three biological datasets of increasing complexity.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
ID Code:7466
Deposited By:David Knowles
Deposited On:17 March 2011