Tuning Support Vector Machines for Robust Phoneme Classification with Acoustic Waveforms
Jibran Yousafzai, Zoran Cvetkovic and Peter Sollich
In: INTERSPEECH 2009, 6-10 Sept. 2009, Brighton, United Kingdom.
This work focuses on the robustness of phoneme classification to additive noise in the acoustic waveform domain using support vector machines (SVMs). We address the issue of designing kernels for acoustic waveforms which imitate the state-of-the-art representations such as PLP and MFCC and are tuned to the physical properties of speech. For comparison, classification results in the PLP representation domain with cepstral mean-and-variance normalization (CMVN) using standard kernels are also reported. It is shown that our custom-designed kernels achieve better classification performance at high noise. Finally, we combine the PLP and acoustic waveform representations to attain better classification than either of the individual representations over the entire range of noise levels tested, from quiet condition up to -18dB SNR.