Technical Program

Paper Detail

Paper:	SP-P14.4
Session:	Acoustic Modeling: Tone, Prosody, and Features
Time:	Thursday, May 20, 15:30 - 17:30
Presentation:	Poster
Topic:	Speech Processing: Acoustic Modeling for Speech Recognition
Title:	DECISION TREE BASED TONE MODELING FOR CHINESE SPEECH RECOGNITION
Authors:	Pui-Fung Wong; Hong Kong University of Science and Technology
	Man-Hung Siu; Hong Kong University of Science and Technology
Abstract:	Because of the tonal nature of Chinese languages, correct recognition of lexical tones is necessary for Chinese speechrecognition. In order to corporate tone information into Chinese speech recognition, three issues need to be addressed: i) the representation of the syllable pitch contour as well as the tone contour, ii) the lexical tone probability estimation and iii) the integration of tone probabilities into the Viterbi recognition process. In this paper we propose a robust polynomial segmental representation of the pitch contour coupled with a decision tree based tone classifier. We also propose a novel approach of integrating the decision tree tone classifier directly into asingle pass recognition process. The proposed approaches were evaluated on tasks of tone classification and tonal-syllable recognition. In regard to tone classification, the robust decision tree gave a tone classification accuracy of 89% for isolated syllables and 71.2% for the continuous speech. Moreover, by incorporating the decision tree tone classifier into the Viterbi search, the tonal-syllable recognition error rate in continuous speech was reduced by 19.71%.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004