PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Text Categorization via Ellipsoid Separation
Andriy Kharechko, John Shawe-Taylor, Ralf Herbrich and Thore Graepel
In: Learning Methods for Text Understanding and Mining, 26 - 29 January 2004, Grenoble, France.

Abstract

We present a new batch learning algorithm for text classification in the vector space of document representations. The algorithm uses ellipsoid separation in the feature space which leads to a semidefinite program. An approximation of the latent semantic feature extraction approach using Gram-Schmidt orthogonalization is used for the feature extraction. Preliminary results demonstrate some potential for the presented approach.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Theory & Algorithms
Information Retrieval & Textual Information Access
ID Code:20
Deposited By:Steve Gunn
Deposited On:09 May 2004