Technical Program

Paper Detail

Paper:	SP-P14.2
Session:	Acoustic Modeling: Tone, Prosody, and Features
Time:	Thursday, May 20, 15:30 - 17:30
Presentation:	Poster
Topic:	Speech Processing: Acoustic Modeling for Speech Recognition
Title:	DISCRIMINATIVE FEATURE TRANSFORMATION BY GUIDED DISCRIMINATIVE TRAINING
Authors:	Roger Hsiao; Hong Kong University of Science and Technology
	Brian Mak; Hong Kong University of Science and Technology
Abstract:	In this paper, we investigate a special form of discriminative training which we call guided discriminative training in the context of multi-class classification problems. We are interested in applications that require improving the classification performance of only a subset of the classes at the expense of possibly (but not necessarily) poorer classification performance of the remaining classes. However, should the classification of the remaining classes get worse, it is guaranteed not to be worse than an extent that the user specifies. The problem is formulated as a nonlinear programming problem, which can be translated to a unconstrained nonlinear optimization problem using the barrier method that, in turn, can be solved by gradient descent method. To prove the concept, we applied the guided discriminative training to derive an optimal linear transformation on the mel-filterbank log power spectra to improve TIMIT phoneme classification. Encouraging results are obtained.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004