Paper: | SP-P2.7 | ||
Session: | Speaker Adaptation | ||
Time: | Tuesday, May 18, 13:00 - 15:00 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Adaptation/Normalization | ||
Title: | PRIOR KNOWLEDGE GUIDED MEL BASED MODEL SELECTION AND ADAPTATION FOR NONNATIVE SPEECH RECOGNITION | ||
Authors: | Xiaodong He; Microsoft | ||
Yunxin Zhao; University of Missouri-Columbia | |||
Abstract: | In this paper, an improved method of model complexity selection for nonnative speech recognition is proposed by using maximum a posteriori estimation of bias distributions. An algorithm is described for estimating the hyper-parameters of the prior distributions, and an automatic accent detection algorithm is also proposed for integration with dynamic model selection and adaptation. Experiments were performed on the WSJ1 task with American English speech, British accent speech, and mandarin Chinese accent speech. Results show that the use of prior knowledge of accents enabled reliable estimation of bias distributions in the case of very small amount of adaptation speech, or without adaptation speech. Recognition results show that the new approach is superior to the previous MEL method, especially when the adaptation data are extremely limited. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops