PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Extensions of absolute discounting (kneser-ney method).
Jesús Andrés-Ferrer and Hermann Ney
In: 34th IEEE International Conference on Acoustics, Speech, and Signal Processing, 19-24 April 2009, Taiwan.


The problem of estimating the parameters of an n-gram language model is a typical problem of estimating small probabilities. So far, two methods have been proposed and used to handle this problem: 1. the empirical Bayes method resulting in the Turing-Good estimates. Theses estimates do not have any constraints and tend to be very noisy. 2. discounting models like absolute (or linear) discounting. The discounting models are heavily constrained and typically have only a single free parameter. Both methods can be formulated in a leaving-one-out framework. In this paper, we study methods that lie between these two extremes. We design models with various types of constraints and derive efficient algorithms for estimating the parameters of these models. We propose two novel types of constraints or models: interval constraints and the exact extended Kneser-Ney model. The proposed methods are implemented and applied to language modelling in order to compare the methods in terms of perplexities. The results show that the new constrained methods outperforms other unconstrained methods.

EPrint Type:Conference or Workshop Item (Oral)
Additional Information:This paper was the winner of one of the two the IEEE Spoken Language Processing Student Travel Grant
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
ID Code:5662
Deposited By:Alfons Juan
Deposited On:08 March 2010