PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Visualized atlas of a gene expression databank
Jarkko Venna and Samuel Kaski
In: KRBIO'05, 16 June 2005, Espoo, Finland.


We construct an atlas of a gene expression databank, to visualize similarity relationships between expression data sets. Such an atlas could be used as an interface to the databank, for users searching for relevant background data or data for their own in-silico analyses. The two main research problems in constructing an atlas are (1) to preprocess the data to make different sets commensurable, and (2) to visualize the data. In this work we use only very simple preprocessing to study its feasibility, and focus on the visualization. We compare several recently introduced methods in the task, and show that a method called curvilinear components analysis outperforms the newer ones in terms of trustworthiness of the projections. The visualizations reveal the main sources of variation in the data, namely the differences between data sets, different labs, and different measurement methods, which supports feasibility of the visualization method in the task. The other conclusion is that better methods are needed for making the data sets commensurable.

EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Theory & Algorithms
Information Retrieval & Textual Information Access
ID Code:949
Deposited By:Samuel Kaski
Deposited On:02 March 2005