Technical Program

Paper Detail

Paper:SP-L6.3
Session:Feature Analysis for Speech Recognition
Time:Thursday, May 20, 13:40 - 14:00
Presentation: Lecture
Topic: Speech Processing: Feature Extraction
Title: THE ETSI EXTENDED DISTRIBUTED SPEECH RECOGNITION (DSR) STANDARDS: CLIENT SIDE PROCESSING AND TONAL LANGUAGE RECOGNITION EVALUATION
Authors: Alexander Sorin; IBM Labs 
 Tenkasi Ramabadran; Motorola Labs 
 Dan Chazan; IBM Labs 
 Ron Hoory; IBM Labs 
 Michael McLaughlin; Motorola Labs 
 David Pearce; Motorola Labs 
 Fan Wang; IBM Labs 
 Yaxin Zhang; Motorola Labs 
Abstract: In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212 [1][2]. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. This paper discusses the client-side estimation of pitch and voicing class parameters whereas a companion paper discusses the server-side speech reconstruction. Experimental results show enhancement of tonal language recognition rates of proprietary recognition engines, when the standard extensions are used.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004