Technical Program

Paper Detail

Paper:	MLSP-P3.2
Session:	Speech and Audio Processing
Time:	Wednesday, May 19, 15:30 - 17:30
Presentation:	Poster
Topic:	Machine Learning for Signal Processing: Signal detection, Pattern Recognition and Classification
Title:	LOGISTIC DISCRIMINATIVE SPEECH DETECTORS USING POSTERIOR SNR
Authors:	Arun Surendran; Microsoft Research
	Somsak Sukittanon; University of Washington
	John Platt; Microsoft Research
Abstract:	We introduce an elegant and novel design for a speech detector which estimates the probability of the presence of speech in each time-frequency bin, as well as in each frame. The proposed system uses discriminative estimators based on logistic regression, and incorporates spectral and temporal correlations in the same framework. The detector is flexible enough to be configured in a single level or a ``stacked'' bi-level architecture depending on the needs of the application. An important part of the proposed design is the use of a new set of features: the normalized logarithm of the estimated posterior signal-to-noise ratio. These can be easily and automatically generated by tracking the noise spectrum online. We present results on the AURORA database to demonstrate that the overall design is simple, flexible and effective.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004