PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Adaptive XML Tree Classification on Evolving Data Streams
Albert Bifet and Ricard Gavaldà
In: Machine Learning and Knowledge Discovery in Databases European Conference, ECML PKDD 2009, September 7-11, 2009, Bled, Slovenia.

Abstract

We propose a new method to classify patterns, using closed and maximal frequent patterns as features. Generally, classification requires a previous mapping from the patterns to classify to vectors of features, and frequent patterns have been used as features in the past. Closed patterns maintain the same information as frequent patterns using less space and maximal patterns maintain approximate information. We use them to reduce the number of classification features. We present a new framework for XML tree stream classification. For the first component of our classification framework, we use closed tree mining algorithms for evolving data streams. For the second component, we use state of the art classification methods for data streams. To the best of our knowledge this is the first work on tree classification in streaming data varying with time. We give a first experimental evaluation of the proposed classification method.

EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Theory & Algorithms
ID Code:5622
Deposited By:Albert Bifet
Deposited On:08 March 2010