PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Online Suffix Trees with Counts
Breanndan O Nuallain and Steven de Rooij
In: Data Compression Conference, 23 March - 25 March 2004, Snowbird, Utah, USA.

Abstract

We extend Ukkonen's online suffix tree construction algorithm to support frequency queries, by adding count fields to the internal nodes of the tree, as suggested in the PhD thesis of N. Jesper Larsson. This may be useful in the field of sequential data compression, for example in the implementation of an efficient PPM-style compression algorithm with unbounded context length. We show that due to the onlineness requirement the algorithm's worst case time complexity is O(n^2). However, our approach remains acceptable since its average case performance can be shown to be ~ n log n under some reasonable assumptions.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Spotlight)
Additional Information:A full article on this material has been submitted to Algorithmica.
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Theory & Algorithms
Information Retrieval & Textual Information Access
ID Code:131
Deposited By:Steven de Rooij
Deposited On:28 May 2004