Technical Program

Paper Detail

Paper:ITT-P1.3
Session:Speech and Language Applications
Time:Thursday, May 20, 15:30 - 17:30
Presentation: Poster
Topic: Industry Technology Track: Speech Recognition
Title: AUTOMATIC AND LANGUAGE INDEPENDENT TRIPHONE TRAINING USING PHONETIC TABLES
Authors: Lorin Netsch; Texas Instruments, Inc. 
 Alexis Bernard; Texas Instruments, Inc. 
Abstract: Training triphone acoustic models for speech recognition is time-consuming and requires important manual intervention. We present an alternative solution, performing automatic training by use of a pronunciation phonetic table which summarizes the articulatory characteristic of the target language. The method is able to train triphones for any language given an existing set of reference monophones in one or more languages by automatically performing the tasks of monophone seeding, triphone clustering and other training steps. The automatic nature of the training algorithm lends itself to parameter optimization, which can further improve recognition accuracy with respect to manually trained models. In a continuous digit recognition experiment, it is shown that automatically generated triphone models gave a 1.26% error rate, compared to a 2.30% error rate for its manual counterpart.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004