PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

A Supervised Discretization Method for Quantitative and Qualitative Ordered Variables
Francisco Ruiz, Cecilio Angulo and Núria Agell
Computación y Sistemas Volume 9, Number 4, 2006. ISSN 1405-5546

Abstract

In this work, a new technique to define cut-points in the discretization process of a continuous attribute is presented. This method is used as a prior step in a regression problem, considered as a learning problem in which the output variable can be either quantitative (continuous or discreet) or qualitative defined over an ordinal scale. The proposed method emphasizes the concept of location to determine discretization cut-points. In the case of continuous outputs, the method is based on the maximization of the difference between distributions by using intervalar distances. In the case of qualitative outputs, a qualitative distance is defined over a structure of absolute orders of magnitude. The main characteristics of the method presented are illustrated through three examples, two for continuous outputs and the last for a qualitative output.

EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Theory & Algorithms
ID Code:2939
Deposited By:Cecilio Angulo
Deposited On:26 December 2006