Technical Program

Paper Detail

Paper:SP-P3.5
Session:Topics in Speaker and Langauge Recognition
Time:Tuesday, May 18, 15:30 - 17:30
Presentation: Poster
Topic: Speech Processing: Speaker Recognition
Title: ENHANCEMENT OF MISMATCHED CONDITIONS IN SPEAKER RECOGNITION FOR MULTIMEDIA APPLICATIONS
Authors: Waleed Fakhr; Arab Academy for Science & Technology 
 Ahmed Abdelsalam; Arab Academy for Science & Technology 
 Nadder Hamdy; Arab Academy for Science & Technology 
Abstract: This paper investigates the performance of an HMM-based text-independent speaker recognition system under different model and feature combinations for matched and mismatched speech coding conditions. The effects of changing the HMM topology and acoustic features is first investigated. Training and testing the models using only the voiced segments of the samples is then considered. The best model structure in each topology is then used to test the effects of speech codecs like G729 at 8 kb/s and G723.1 at 5.3 and 6.3 kb/s, used in multimedia applications, on the performance of both matched and mismatched conditions. To improve the performance in mismatched conditions, a MAP-based adaptation with different amounts of coded training data and a diagonal Affine transform for adapting the coded cepstral features to original PCM cepstral features are investigated. Results have shown that the proposed techniques improve speaker recognition performance and produced comparable results to the matched condition test
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004