Technical Program

Paper Detail

Paper:SP-P16.6
Session:Speech Modeling for Robust Speech Recognition
Time:Friday, May 21, 15:30 - 17:30
Presentation: Poster
Topic: Speech Processing: Robust Speech Recognition
Title: MULTI-ENVIRONMENT MODELS BASED LINEAR NORMALIZATION FOR SPEECH RECOGNITION IN CAR CONDITIONS
Authors: Luis Buera; University of Zaragoza 
 Eduardo Lleida; University of Zaragoza 
 Antonio Miguel; University of Zaragoza 
 Alfonso Ortega; University of Zaragoza 
Abstract: In this paper a multi-environment adaptation technique based on minimum mean squared error estimation is proposed. MEMLIN, Multi-Environment Models based LInear Normalization, consists on a feature adaptation using stereo data and several basic defined environments. The target of this algorithm is to learn the difference between clean and noisy feature vectors associated to a pair of gaussians (one for a clean model, and the other one for a noisy model), for each basic environment. This knowledge, the gaussians associated, the conditional probability between clean and noisy gaussians, and the environments are the data used to compensate the mismatch between clean and noisy vectors. This algorithm obtains important improvements regarding other techniques that look for similar targets. The experimental results with the SpeechDat Car database shows an average improvement of more than 68%, concerning the baseline, over 7 different defined environments.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004