Paper: | SP-P8.8 | ||
Session: | Voice Activity Detection and Speech Segmentation | ||
Time: | Wednesday, May 19, 13:00 - 15:00 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Speech Analysis | ||
Title: | A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST | ||
Authors: | Beena Ahmed; RMIT University | ||
W. Harvey Holmes; University of New South Wales | |||
Abstract: | This paper proposes a voice activity detector (VAD) that makes the speech/noise classification by applying the statistical chi-square test to each frame. It also uses a continuous update of the background noise estimate. The speech is first enhanced using a noise reduction system, with noise estimates also obtained with the help of the chi-square test. The noise-reduced signal is decomposed into sub-bands, and the chi-square test is used again in another form to compare the observed signal distribution to the estimated noise distribution. If the chi-square test determines that they are close, the frame is declared to be noise, otherwise speech. The performance of this VAD was found to be significantly superior to several benchmark VADs, with accuracies above 89% even at a SNR of 0 dB, which is up to 25% better than the others. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops