PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

A probabilistic model for generating realistic lip movements from speech
Gwenn Englebienne, Tim F Cootes and Magnus Rattray
In: Advances in Neural Information Processing Systems (2008) MIT Press .

Abstract

The present work aims to model the correspondence between facial motion and speech. The face and sound are modelled separately, with phonemes being the link between both. We propose a sequential model and evaluate its suitability for the generation of the facial animation from a sequence of phonemes, which we obtain from speech. We evaluate the results both by computing the error between generated sequences and real video, as well as with a rigorous double-blind test with human subjects. Experiments show that our model compares favourably to other existing methods and that the sequences generated are comparable to real video sequences.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Book Section
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Machine Vision
Speech
Multimodal Integration
ID Code:3502
Deposited By:Magnus Rattray
Deposited On:24 March 2009