Relevance and Redundancy Analysis for Ensemble Classifiers
R Duangsoithong and Terry Windeatt
In: MLDM 2009, Leipzig, Germany(2009).
In machine learning systems, especially in medical applications,clinical datasets usually contain high dimensional feature spaces with relatively few samples that lead to poor classifier performance. To overcome this problem, feature selection and ensemble classification are applied in order to improve accuracy and stability. This research presents an analysis of the effect of removing irrelevant and redundant
features with ensemble classifiers using five datasets and compared with floating search method. Eliminating redundant features provides better accuracy and computational time than removing irrelevant features of the ensemble.