Paper: | SP-L9.2 | ||
Session: | Robust Features for Speech Recognition | ||
Time: | Friday, May 21, 13:20 - 13:40 | ||
Presentation: | Lecture | ||
Topic: | Speech Processing: Robust Speech Recognition | ||
Title: | HIGHER ORDER CEPSTRAL MOMENT NORMALIZATION (HOCMN) FOR ROBUST SPEECH RECOGNITION | ||
Authors: | Chang-wen Hsu; National Taiwan University | ||
Lin-shan Lee; National Taiwan University | |||
Abstract: | Cepstral mean subtraction (CMS) and cepstral normalization (CN) have been popularly used to normalize the first and the second moments of cepstral coefficients, and proved to be very helpful for robust speech recognition [1, 2]. In this paper, a unified formulation for Higher Order Cepstral Moment Normalization (HOCMN) is developed by extending the concept of CMS and CN to orders much higher than three. A whole family of normalization techniques for different orders is thus proposed. Preliminary experimental results based on Aurora 2.0 showed that the recognition accuracy can be significantly improved with this approach under all noisy conditions. For example, HOCMN[1,5,100] (normalization of the first, fifth and 100-th order cepstral moments) is shown to offer an error rate reduction of 32.83% as compared to the conventional CN with a full-utterance processing interval, or an error rate reduction of 20.78% as compared to CN with a segmental processing interval. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops