Technical Program

Paper Detail

Paper:SP-P14.12
Session:Acoustic Modeling: Tone, Prosody, and Features
Time:Thursday, May 20, 15:30 - 17:30
Presentation: Poster
Topic: Speech Processing: Acoustic Modeling for Speech Recognition
Title: MINIMUM CLASSIFICATION ERROR TRAINING OF LANDMARK MODELS FOR REAL-TIME CONTINUOUS SPEECH RECOGNITION
Authors: Erik McDermott; NTT Corporation 
 Timothy Hazen; Massachusetts Institute of Technology 
Abstract: Though many studies have confirmed the effectiveness of the MinimumClassification Error (MCE) framework for discriminative training ofHMMs applied to speech recognition, few if any have reported MCEresults for large (> 100 hours) training sets in the context ofreal-world, continuous speech recognition. Here we report substantial gains in performance for the MIT JUPITER weather information task as aresult of MCE-based optimization of acoustic models. Investigation ofword error rate vs. computation time showed that small MCE modelssignificantly outperform the Maximum Likelihood (ML) baseline at allpoints of equal computation time, resulting in up to 20% word errorrate reduction for in-vocabulary utterances. The overall MCE lossfunction was minimized using Quickprop, a simple but effectivesecond-order optimization method suited to parallelization over largetraining sets.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004