PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

From outliers to prototypes: Ordering data
Stefan Harmeling, Guido Dornhege, David Tax, Frank Meinecke and Klaus-Robert Müller
Neurocomputing 2005.


We propose simple and fast methods based on nearest neighbors that order objects from high-dimensional data sets from typical points to untypical points. On the one hand, we show that these easy-to-compute orderings allow us to detect outliers (i.e.~very untypical points) with a performance comparable to or better than other often much more sophisticated methods. On the other hand, we show how to use these orderings to detect prototypes (very typical points) which facilitate exploratory data analysis algorithms such as noisy nonlinear dimensionality reduction and clustering. Comprehensive experiments demonstrate the validity of our approach.

EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Theory & Algorithms
ID Code:1909
Deposited By:Stefan Harmeling
Deposited On:29 December 2005