PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Protein function prediction via graph kernels
Karsten M. Borgwardt, Cheng Soon Ong, Stefan Schoenauer, S V N Vishwanathan, Alex Smola and Hans-Peter Kriegel
Bioinformatics Volume 21, Number Supplement 1, i47-i56, 2005. ISSN 1460-2059


Motivation: Computational approaches to protein function prediction infer protein function by finding proteins with similar sequence, structure, surface clefts, chemical properties, amino acid motifs, interaction partners or phylogenetic profiles. We present a new approach that combines sequential, structural and chemical information into one graph model of proteins. We predict functional class membership of enzymes and non-enzymes using graph kernels and Support Vector Machine classification on these protein graphs. Results: Our graph model, derivable from protein sequence and structure only, is competitive with vector models that require additional protein information such as the size of surface pockets. If we include this extra information into our graph model, our classifier yields significantly higher accuracy levels than the vector models. Hyperkernels allow us to select and to optimally combine the most relevant node attributes in our protein graphs. We have laid the foundation for a protein function prediction system that integrates protein information from various sources efficiently and effectively.

EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
ID Code:1269
Deposited By:Cheng Soon Ong
Deposited On:28 November 2005