PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Computing Atom Mappings for Biochemical Reactions without Subgraph Isomorphism
Markus Heinonen, Sampsa Lappalainen, Taneli Mielikäinen and Juho Rousu
Journal of Computational Biology Volume 18, Number 1, pp. 43-58, 2011.

Abstract

The ability to trace the fate of individual atoms through the metabolic pathways is needed in many applications of systems biology and drug discovery. However, this information is not immediately available from the most common metabolome studies and needs to be separately acquired. Automatic discovery of correspondence of atoms in biochemical reactions is called the “atom mapping problem.” We suggest an efficient approach for solving the atom mapping problem exactly—finding mappings of minimum edge edit distance. The algorithm is based on A* search equipped with sophisticated heuristics for pruning the search space. This approach has clear advantages over the commonly used heuristic approach of iterative maximum common subgraph (MCS) algorithm: we explicitly minimize an objective function, and we produce solutions that typically require less manual curation. The two methods are similar in computational resource demands. We compare the performance of the proposed algorithm against several alternatives on data obtained from the KEGG LIGAND and RPAIR databases: greedy search, bi-partite graph matching, and the MCS approach. Our experiments show that alternative approaches often fail in finding mappings with minimum edit distance.

EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Theory & Algorithms
ID Code:8117
Deposited By:Juho Rousu
Deposited On:22 April 2011