Paper: | SP-L2.2 | ||
Session: | Modeling Approaches in Speaker Recognition | ||
Time: | Wednesday, May 19, 09:50 - 10:10 | ||
Presentation: | Lecture | ||
Topic: | Speech Processing: Speaker Recognition | ||
Title: | PARAMETER SHARING AND MINIMUM CLASSIFICATION ERROR TRAINING OF MIXTURES OF FACTOR ANALYZERS FOR SPEAKER IDENTIFICATION | ||
Authors: | Hiroyoshi Yamamoto; Nagoya Institute of Technology | ||
Yoshihoko Nankaku; Nagoya Institute of Technology | |||
Chiyomi Miyajima; Nagoya University | |||
Keiichi Tokuda; Nagoya Institute of Technology | |||
Tadashi Kitamura; Nagoya Institute of Technology | |||
Abstract: | This paper investigates the parameter tying strategies of mixtures of factor analyzers (MFA) and discriminative training of MFA for speaker identification. The parameters of factor loading matrices or diagonal matrices are shared in different mixtures of MFA. The minimum classification error (MCE) training is applied to the MFA parameters to enhance the discrimination abilities. The results of text-independent speaker identification experiments show that MFA outperforms the conventional Gaussian mixture models (GMMs) with diagonal or full covariance matrices and achieve the best performance when sharing the diagonal matrices, resulting in a relative gain of 26% over the GMM with diagonal covariance matrices. The recognition performance is further improved by the MCE training with an additional 3% error reduction. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops