Technical Program

Paper Detail

Paper:	SP-P13.13 (ICASSP 2003 Paper)
Session:	General Topics in Robust Speech Recognition
Time:	Thursday, May 20, 13:00 - 15:00
Presentation:	Poster (ICASSP 2003 Presentation)
Topic:	Speech Processing: Large Vocabulary Recognition/Search
Title:	TOWARDS AUTOMATIC TRANSCRIPTION OF LARGE SPOKEN ARCHIVES - ENGLISH ASR FOR THE MALACH PROJECT
Authors:	Bhuvana Ramabhadran; IBM T. J. Watson Research Center
	Jing Huang; IBM T. J. Watson Research Center
	Michael Picheny; IBM T. J. Watson Research Center
Abstract:	Digital archives have emerged as the pre-eminent method for capturing the human experience. Before such archives can be used efficiently, their contents must be described. The NSF-funded MALACH project aims to provide improved access to large spoken archives by advancing the state-of-the-art in automated speech recognition (ASR), Information Retrieval (IR) and related technologies [1, 2] for multiple languages. This paper describes the ASR research for the English speech in the MALACH corpus. The MALACH corpus consists of unconstrained, natural speech filled with disfluencies, heavy accents,age-related coarticualtions, un-cued speaker and language switching, and emotional speech collected in the form of interviews from over 52000 speakers in 32 languages. In this paper, we describe this new testbed for developing speech recognition algorithms and report on the performance of well-known techniques for building better acoustic models for the speaking styles seen in this corpus. The best EnglishASR system to date has a word error rate of 43.8\% on this corpus.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004