Paper: | SP-L6.1 | ||
Session: | Feature Analysis for Speech Recognition | ||
Time: | Thursday, May 20, 13:00 - 13:20 | ||
Presentation: | Lecture | ||
Topic: | Speech Processing: Speech Analysis | ||
Title: | NON-UNIFORM SPEAKER NORMALIZATION USING AFFINE-TRANSFORMATION | ||
Authors: | Bharath Kumar SV; General Electric - Global Research | ||
Umesh S.; Indian Institute of Technology | |||
Rohit Sinha; Indian Institute of Technology | |||
Abstract: | In this paper, we propose a mathematical model to describe therelation between the formant frequencies of speakers and show that with the proposed affine model, speaker differences separate out as translation factors when a ''mel-like'' warping is performed. Using speech data we estimate the parameters of this warping function and show that it is close to the usual mel-formula. This model is motivated by Rohit et al.'s [1] shift-based non-uniform speaker-normalization method, which provides improvement over the conventional maximum-likelihood based speaker normalization methods. We therefore provide a unified framework that relates the relationship between formants of speakers and method of removing speakers difference (which involves mel-warping) in a neat mathematical framework which is substantiated by our recognition experiments. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops