Technical Program

Paper Detail

Paper:	SP-P12.5
Session:	Acoustic Modeling: Model Complexity, General Topics
Time:	Thursday, May 20, 09:30 - 11:30
Presentation:	Poster
Topic:	Speech Processing: Acoustic Modeling for Speech Recognition
Title:	AUTOMATIC DETERMINATION OF ACOUSTIC MODEL TOPOLOGY USING VARIATIONAL BAYESIAN ESTIMATION AND CLUSTERING
Authors:	Shinji Watanabe; NTT Corporation
	Atsushi Sako; Ryukoku University
	Atsushi Nakamura; NTT Corporation
Abstract:	We describe the automatic determination of an acoustic model for speech recognition, which is very complicated and includes latent variables, using VBEC: Variational Bayesian Estimation and Clustering for speech recognition. We propose an efficient Gaussian Mixture Model (GMM) based phonetic decision tree construction within the VBEC framework. The proposed method features a novel approach to reduce the unrealistically large number of computations needed for iterative calculations in the GMM-based phonetic decision tree method to a practical level by assuming that each Gaussian per state has the same occupancy and is represented by the same posterior distribution for the covariance parameter. The experimental results confirmed that VBEC automatically provided a globally optimum model topology with the highest performance level.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004