PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Making Faces -- State Space Models applied to multi modal signal processing
Tue Lehn-Schiøler
(2005) PhD thesis, The Technical University of Denmark.

Abstract

The two main focus areas of this thesis are State-Space Models and multi modal signal processing. The general State-Space Model is investigated and an addition to the class of sequential sampling methods is proposed. This new algorithm is denoted as the Parzen Particle Filter. Furthermore, the Markov Chain Monte Carlo (MCMC) approach to filtering is examined and a scheme for MCMC to be used in on-line applications is proposed. In estimating parameters, it is shown that the EM-algorithm exhibits slow convergence especially in the low noise limit. It is demonstrated how a general gradient optimizer can be applied to speed up convergence. The linear version of the State-Space Model, the Kalman Filter, is applied to multi modal signal processing. It is demonstrated how a State-Space Model can be used to map from speech to lip momements. Besides the State-Space Model and the multi modal application an information theoretic vector quantizer is also proposed. Based on interactions between particles, it is shown how a quantizing scheme based on an analytic cost function can be derived.

EPrint Type:Thesis (PhD)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
User Modelling for Computer Human Interaction
Learning/Statistics & Optimisation
Multimodal Integration
ID Code:1819
Deposited By:Tue Lehn-Schiøler
Deposited On:28 November 2005