Technical Program

Paper Detail

Paper:	SP-L6.3
Session:	Feature Analysis for Speech Recognition
Time:	Thursday, May 20, 13:40 - 14:00
Presentation:	Lecture
Topic:	Speech Processing: Feature Extraction
Title:	THE ETSI EXTENDED DISTRIBUTED SPEECH RECOGNITION (DSR) STANDARDS: CLIENT SIDE PROCESSING AND TONAL LANGUAGE RECOGNITION EVALUATION
Authors:	Alexander Sorin; IBM Labs
	Tenkasi Ramabadran; Motorola Labs
	Dan Chazan; IBM Labs
	Ron Hoory; IBM Labs
	Michael McLaughlin; Motorola Labs
	David Pearce; Motorola Labs
	Fan Wang; IBM Labs
	Yaxin Zhang; Motorola Labs
Abstract:	In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212 [1][2]. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. This paper discusses the client-side estimation of pitch and voicing class parameters whereas a companion paper discusses the server-side speech reconstruction. Experimental results show enhancement of tonal language recognition rates of proprietary recognition engines, when the standard extensions are used.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004