PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Hierarchical generative biclustering for MicroRNA Expression Analysis
Jose Caldas and Samuel Kaski
Journal of Computational Biology pp. 251-261, 2011.

Abstract

Clustering methods are a useful and common first step in gene expression studies, but the results may be hard to interpret. We bring in explicitly an indicator of which genes tie each cluster, changing the setup to biclustering. Furthermore, we make the indicators hierarchical, resulting in a hierarchy of progressively more specific biclusters. A non-parametric Bayesian formulation makes the model rigorous yet flexible and computations feasible. The model can additionally be used in information retrieval for relating relevant samples. We show that the model outperforms four other biclustering procedures on a large miRNA data set. We also demonstrate the model's added interpretability and information retrieval capability in a case study. Software is publicly available at http://research.ics.tkk.fi/mi/software/treebic/.

EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Theory & Algorithms
ID Code:7588
Deposited By:Samuel Kaski
Deposited On:21 February 2012