Paper: | SP-P15.11 | ||
Session: | Robustness in Noisy Environments | ||
Time: | Friday, May 21, 15:30 - 17:30 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Robust Speech Recognition | ||
Title: | A TREE-STRUCTURED CLUSTERING METHOD INTEGRATING NOISE AND SNR FOR PIECEWISE LINEAR-TRANSFORMATION-BASED NOISE ADAPTATION | ||
Authors: | Zhipeng Zhang; NTT DoCoMo | ||
Toshiaki Sugimura; NTT DoCoMo | |||
Sadaoki Furui; Tokyo Institute of Technology | |||
Abstract: | This paper proposes the application of a tree-structured clustering method that integrates the effects of noise as well as SNR variation in the framework of piecewise-linear transformation (PLT)-based noise adaptation for robust speech recognition. According to the clustering results, a noisy speech HMM is made for each node of the tree structure. An HMM that best matches the input speech is selected based on the likelihood maximization criterion by tracing the tree downward from the top (root), and the selected HMM is further adapted by linear transformation. The proposed method is evaluated by applying it to a Japanese dialogue recognition system. Experimental results confirm that the proposed method is effective in recognizing numerically noise-added speech and actual noisy speech uttered by a wide range of speakers under various noise conditions. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops