PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

gBoost: A mathematical programming approach to graph classification and regression
Hiroto Saigo, Sebastian Nowozin, Tadashi Kadowaki, Taku Kudo and Koji Tsuda
Machine Learning Volume 75, pp. 69-89, 2009.

Abstract

Graph mining methods enumerate frequently appearing subgraph patterns, which can be used as features for subsequent classification or regression. However, frequent pat- terns are not necessarily informative for the given learning problem. We propose a mathe- matical programming boosting method (gBoost) that progressively collects informative pat- terns. Compared to AdaBoost, gBoost can build the prediction rule with fewer iterations. To apply the boosting method to graph data, a branch-and-bound pattern search algorithm is developed based on the DFS code tree. The constructed search space is reused in later iterations to minimize the computation time. Our method can learn more efficiently than the simpler method based on frequent substructure mining, because the output labels are used as an extra information source for pruning the search space. Furthermore, by engineering the mathematical program, a wide range of machine learning problems can be solved without modifying the pattern search algorithm.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Theory & Algorithms
ID Code:4401
Deposited By:Koji Tsuda
Deposited On:13 March 2009