Technical Program

Paper Detail

Paper:	SP-P14.5
Session:	Acoustic Modeling: Tone, Prosody, and Features
Time:	Thursday, May 20, 15:30 - 17:30
Presentation:	Poster
Topic:	Speech Processing: Acoustic Modeling for Speech Recognition
Title:	HIDDEN SPECTRAL PEAK TRAJECTORY MODEL FOR PHONE CLASSIFICATION
Authors:	Yiu-Pong Lai; Hong Kong University of Science and Technology
	Man-Hung Siu; Hong Kong University of Science and Technology
Abstract:	It is well known that spectrogram readers can classify different phones from their spectral-time characteristics, such as the formants. In this paper we present a novel acoustic model for phone classification based on the implicit estimation of the spectral peak trajectory as a polynomial time function. By making use of the known relationship between the spectral peak information and the cepstral coefficients, cepstral-based phone trajectories are built as functions of the hidden spectral trajectories. This captures the intuitive formant trajectories in the spectral domain while allowing speech modeling to be done in the more familiar cepstral domain. We have evaluated this hidden spectral peak trajectory model in both vowel classification and phone classification tasks. On a simple single Gaussian model, the hidden spectral peak trajectory model outperforms the HMM on both vowel and phone classification tasks. The new can also be combined with the HMM model. This combination performs better than a more complex HMM with similar number of parameters.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004