Technical Program

Paper Detail

Paper:	SP-P14.7
Session:	Acoustic Modeling: Tone, Prosody, and Features
Time:	Thursday, May 20, 15:30 - 17:30
Presentation:	Poster
Topic:	Speech Processing: Acoustic Modeling for Speech Recognition
Title:	CHINESE-ENGLISH BILINGUAL PHONE MODELING FOR CROSS-LANGUAGE SPEECH RECOGNITION
Authors:	Shengmin Yu; Chinese Academy of Sciences
	Shuwu Zhang; Chinese Academy of Sciences
	Bo Xu; Chinese Academy of Sciences
Abstract:	In this paper, three different approaches of Chinese-English bilingual phone modeling are investigated and compared. The first approach is to simply combine Chinese and English phone inventories together without phone shared across the languages. The second one is to map language-dependent phones to the inventory of the International Phonetic Association (IPA) based on phonetic knowledge to construct the bilingual phone inventory. The third one is to merge the language-dependent phone models by hierarchical phone clustering algorithm to get a compact bilingual inventory. In the third approach, two distance measures are used to perform the bottom-up clustering. One is Bhattacharyya distance. The other is acoustic likelihood distance. Experimental results show that phone clustering approach outperforms IPA-based phone mapping approach, and it can also achieve comparable performance to the simple combination of language-dependent phone inventories with less model parameters, especially when using acoustic likelihood distance measurement.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004