An algorithm to assess the reliability of hierarchical clusters in gene expression data
R. Avogadri, M. Brioschi, F. Ruffino, F. Ferrazzi, A. Beghini and Giorgio Valentini
Knowledge-Based Intelligent Information and Engineering Systems, 12th International Conference
Springer, Lecture Notes in Computer Science
The validation of clusters discovered in bio-molecular data is
a central issue in bioinformatics. Recently, stability-based methods have been successfully applied to the analysis of the reliability of clusterings characterized by a relatively low number of examples and clusters. Nevertheless, several problems in functional genomics are characterized by a very large number of examples and clusters. We present a stability-based algorithm to discover significant clusters in hierarchical clusterings with
a large number of examples and clusters. Preliminary results on gene expression data of patients affected by Human Myeloid Leukemia, show how to apply the proposed method when thousands of gene clusters are involved.