PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Detecting Triangle Inequality Violations in Internet Coordinate Systems by Supervised Learning
Yongjun Liao, Mohamed Ali Kaafar, Bamba Gueye, François Cantin, Pierre Geurts and Guy Leduc
Lecture Notes in Computer Science: NETWORKING 2009 Volume 5550, pp. 352-363, 2009. ISSN 1611-3349


Internet Coordinates Systems (ICS) are used to predict Internet distances with limited measurements. However the precision of an ICS is degraded by the presence of Triangle Inequality Violations (TIVs). Simple methods have been proposed to detect TIVs, based e.g. on the empirical observation that a TIV is more likely when the distance is underestimated by the coordinates. In this paper, we apply supervised machine learning techniques to try and derive more powerful criteria to detect TIVs. We first show that (ensembles of) Decision Trees (DTs) learnt on our datasets are very good models for this problem. Moreover, our approach brings out a discriminative variable (called OREE), which combines the classical estimation error with the variance of the estimated distance. This variable alone is as good as an ensemble of DTs, and provides a much simpler criterion. If every node of the ICS sorts its neighbours according to OREE, we show that cutting these lists after a given number of neighbours, or when OREE crosses a given threshold value, achieves very good performance to detect TIVs.

EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
ID Code:6031
Deposited By:Pierre Geurts
Deposited On:08 March 2010