Paper: | SP-P13.5 | ||
Session: | General Topics in Robust Speech Recognition | ||
Time: | Thursday, May 20, 13:00 - 15:00 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Robust Speech Recognition | ||
Title: | A FACTORIAL HMM APPROACH TO SIMULTANEOUS RECOGNITION OF ISOLATED DIGITS SPOKEN BY MULTIPLE TALKERS ON ONE AUDIO CHANNEL | ||
Authors: | Ameya Deoras; University of Illinois at Urbana-Champaign | ||
Mark Hasegawa-Johnson; University of Illinois at Urbana-Champaign | |||
Abstract: | This paper addresses the novel problem of recognizing digits spoken simultaneously by two different talkers. A Factorial Hidden Markov Model architecture is proposed to accurately model the simultaneous utterance of two digits. Nadas’ MIXMAX approximation is extended to a mixture of Gaussians observation PDF which enables the implementation of the proposed system. The multiple digit recognizer is found to successfully recognize pairs of simultaneous utterances of digits at 0db SNR with up to 89% accuracy. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops