PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

A mathematical model for the validation of gene selection methods
Marco Muselli, Alberto Bertoni, Marco Frasca, Alessandro Beghini, Francesca Ruffino and Giorgio Valentini
IEEE ACM Transactions on Computational Biology and Bioinformatics 2010.

There is a more recent version of this eprint available. Click here to view it.

Abstract

Gene selection methods aim at determining biologically relevant subsets of genes in DNA microarray experiments. However, their assessment and validation represent a major difficulty since the subset of biologically relevant genes is usually unknown. To solve this problem a novel procedure for generating biologically plausible synthetic gene expression data is proposed. It is based on a proper mathematical model representing gene expression signatures and expression profiles through Boolean threshold functions. The results show that the proposed procedure can be successfully adopted to analyze the quality of statistical and machine learning-based gene selection algorithms.

PDF - PASCAL Members only - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
ID Code:6295
Deposited By:Giorgio Valentini
Deposited On:08 March 2010

Available Versions of this Item