PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Modifying Spectral Envelope to Synthetically Adjust Voice Quality and Articulation Parameters for Emotional Speech Synthesis
Yanqiu Shao, Zhuoran Wang, Jiqing Han and Ting Liu
In: The First International Conference on Affective Computing and Intelligent Interaction, 22-24 Oct 2005, Beijing, China.

Abstract

Both of the prosody and spectral features are important for emotional speech synthesis. Besides prosody effects, voice quality and articulation parameters are the factors that should be considered to modify in emotional speech synthetic systems. Generally, rules and filters are designed to process these parameters respectively. This paper proves that by modifying spectral envelope, the voice quality and articulation could be adjusted as a whole. Thus, it will not need to modify each of the parameter separately depending on rules. Accordingly, it will make the synthetic system more flexible by designing an automatic spectral envelope model based on some machine learning methods. The perception test in this paper also shows that when prosody and spectral features are all modified, the best emotional synthetic speech will be obtained.

EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Speech
ID Code:7032
Deposited By:Zhuoran Wang
Deposited On:05 December 2010