Paper: | SP-P3.5 | ||
Session: | Topics in Speaker and Langauge Recognition | ||
Time: | Tuesday, May 18, 15:30 - 17:30 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Speaker Recognition | ||
Title: | ENHANCEMENT OF MISMATCHED CONDITIONS IN SPEAKER RECOGNITION FOR MULTIMEDIA APPLICATIONS | ||
Authors: | Waleed Fakhr; Arab Academy for Science & Technology | ||
Ahmed Abdelsalam; Arab Academy for Science & Technology | |||
Nadder Hamdy; Arab Academy for Science & Technology | |||
Abstract: | This paper investigates the performance of an HMM-based text-independent speaker recognition system under different model and feature combinations for matched and mismatched speech coding conditions. The effects of changing the HMM topology and acoustic features is first investigated. Training and testing the models using only the voiced segments of the samples is then considered. The best model structure in each topology is then used to test the effects of speech codecs like G729 at 8 kb/s and G723.1 at 5.3 and 6.3 kb/s, used in multimedia applications, on the performance of both matched and mismatched conditions. To improve the performance in mismatched conditions, a MAP-based adaptation with different amounts of coded training data and a diagonal Affine transform for adapting the coded cepstral features to original PCM cepstral features are investigated. Results have shown that the proposed techniques improve speaker recognition performance and produced comparable results to the matched condition test | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops