Paper: | SP-P14.7 | ||
Session: | Acoustic Modeling: Tone, Prosody, and Features | ||
Time: | Thursday, May 20, 15:30 - 17:30 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Acoustic Modeling for Speech Recognition | ||
Title: | CHINESE-ENGLISH BILINGUAL PHONE MODELING FOR CROSS-LANGUAGE SPEECH RECOGNITION | ||
Authors: | Shengmin Yu; Chinese Academy of Sciences | ||
Shuwu Zhang; Chinese Academy of Sciences | |||
Bo Xu; Chinese Academy of Sciences | |||
Abstract: | In this paper, three different approaches of Chinese-English bilingual phone modeling are investigated and compared. The first approach is to simply combine Chinese and English phone inventories together without phone shared across the languages. The second one is to map language-dependent phones to the inventory of the International Phonetic Association (IPA) based on phonetic knowledge to construct the bilingual phone inventory. The third one is to merge the language-dependent phone models by hierarchical phone clustering algorithm to get a compact bilingual inventory. In the third approach, two distance measures are used to perform the bottom-up clustering. One is Bhattacharyya distance. The other is acoustic likelihood distance. Experimental results show that phone clustering approach outperforms IPA-based phone mapping approach, and it can also achieve comparable performance to the simple combination of language-dependent phone inventories with less model parameters, especially when using acoustic likelihood distance measurement. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops