PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Characterization of Lung tumor subtypes through gene expression cluster validity assessment
Giorgio Valentini and Francesca Ruffino
RAIRO - Theoretical Informatics and Applications Volume 40, pp. 163-176, 2006.


The problem of assessing the reliability of clusters patients identified by clustering algorithms is crucial to estimate the significance of subclasses of diseases detectable at bio-molecular level, and more in general to support bio-medical discovery of patterns in gene expression data. In this paper we present an experimental analysis of the reliability of clusters discovered in lung tumor patients using DNA microarray data. In particular we investigate if subclasses of lung adenocarcinoma can be detected with high reliability at bio-molecular level. To this end we apply cluster validity measures based on random projections recently proposed by Bertoni and coworkers. The results show that at least two subclasses of lung adenocarcinoma can be detected with relatively high reliability, confirming and extending previous findings reported in the literature.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
ID Code:2244
Deposited By:Giorgio Valentini
Deposited On:08 October 2006