PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Morfessor in the Morpho Challenge
Mathias Creutz and Krista Lagus
In: PASCAL Challenge Workshop on Unsupervised segmentation of words into morphemes, 12 Apr 2006, Venice, Italy.

Abstract

In this work, Morfessor, a morpheme segmentation model and algorithm developed by the organizers of the Morpho Challenge, is outlined and references are made to earlier work. Although Morfessor does not take part in the official Challenge competition, we report experimental results for the morpheme segmentation of English, Finnish and Turkish words. The obtained results are very good. Morfessor outperforms the other algorithms in the Finnish and Turkish tasks and comes second in the English task. In the Finnish speech recognition task, Morfessor achieves the lowest letter error rate.

EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
Natural Language Processing
Speech
ID Code:2392
Deposited By:Mathias Creutz
Deposited On:22 November 2006