Technical Program

Paper Detail

Paper:SP-P2.9
Session:Speaker Adaptation
Time:Tuesday, May 18, 13:00 - 15:00
Presentation: Poster
Topic: Speech Processing: Adaptation/Normalization
Title: AN INVESTIGATION INTO FRONT-END SIGNAL PROCESSING FOR SPEAKER NORMALIZATION
Authors: S. Umesh; Indian Institute of Technology, Kanpur 
 Rohit Sinha; Indian Institute of Technology, Kanpur 
 Bharath Kumar SV; General Electric - Global Research 
Abstract: Our investigation into the front-end signal processing for maximum likelihood based speaker normalization reveals that in the linear scaling model, it is more appropriate (and evidently more correct) to assume that the spectral envelopes of any two speakers for same sound are linearly scaled versions of one and another, rather than assuming that the whole magnitude spectra (including pitch harmonics)are scaled. The use of the proposed model and its implementation results in about 4 % and 7 % relative improvement for adults and children respectively on a digit recognition task.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004