PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Evolutionary approaches for the reverse-engineering of gene regulatory networks: a study on a biologically realistic dataset
Cédric,C. Auliac, Vincent, V. Frouin, Xavier, X. Gidrol and Florence d'Alché-Buc
BMC Bioinformatics Volume 9, Number 91, 2008.

Abstract

Background: Inferring gene regulatory networks from data requires the development of algorithms devoted to structure extraction. When only static data are available, gene interactions may be modelled by a Bayesian Network (BN) that represents the presence of direct interactions from regulators to regulees by conditional probability distributions. We used enhanced evolutionary algorithms to stochastically evolve a set of candidate BN structures and found the model that best fits data without prior knowledge. Results: We proposed various evolutionary strategies suitable for the task and tested our choices using simulated data drawn from a given bio-realistic network of $35$ nodes, the so-called insulin network, which has been used in the literature for benchmarking. We assessed the inferred models against this reference to obtain statistical performance results. We then compared performances of evolutionary algorithms using two kinds of recombination operators that operate at different scales in the graphs. We introduced a niching strategy that reinforces diversity through the population and avoided trapping of the algorithm in one local minimum in the early steps of learning. We show the limited effect of the mutation operator when niching is applied. Finally, we compared our best evolutionary approach with various well known learning algorithms (MCMC, K$2$, greedy search) devoted to BN structure learning. Conclusions: We studied the behaviour of an evolutionary approach enhanced by niching for the learning of gene regulatory networks with BN. We show that this approach outperforms classical structure learning methods in elucidating the original model. These results were obtained for the learning of a bio-realistic network and, more importantly, on various small datasets. This is a suitable approach for learning transcriptional regulatory networks from real datasets without prior knowledge.

EPrint Type:Article
Additional Information:The paper can be downloaded freely at : http://www.biomedcentral.com/1471-2105/9/91
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
ID Code:4060
Deposited By:Florence d'Alché-Buc
Deposited On:25 February 2008