Technical Program

Paper Detail

Paper:SP-P8.5
Session:Voice Activity Detection and Speech Segmentation
Time:Wednesday, May 19, 13:00 - 15:00
Presentation: Poster
Topic: Speech Processing: Speech Analysis
Title: SPEECH MODELING AND VOICED/UNVOICED/MIXED/SILENCE SPEECH SEGMENTATION WITH FRACTIONALLY GAUSSIAN NOISE BASED MODELS
Authors: Shahab Oveisgharan; Sharif University of Technology 
 Mohammad Bagher Shamsollahi; Sharif University of Technology 
Abstract: The ARMA filtered fractionally differenced Gaussian Noise (FdGn) model and a new AR Filtered FdGn Added up model are applied to speech signal and performance of their parameters on speech Unvoiced/Voiced/Mixed/Silence classification is evaluated against Zero Crossing Rate (ZCR) feature. For parameter estimation of AR filtered FdGn model two methods were applied: iterative Maximum Likelihood (ML) method of Tewfik [2] and a new computationally efficient Linear Minimum Square Error (LMSE) algorithm. Also for parameters estimation of new Added up model two approaches were implemented: an Expectation-Maximization (EM) based approach and an iterative MSE approach. The described models and methods were applied to speech signal and also its real Cepstrum. The performance of described models on V/U/M/S speech classification was obtained based on J1 parameter in this order: Added up model on real Cepstrum of speech, Filtered FdGn model on real Cepstrum of speech (LMSE method), Filtered FdGn model on speech (LMSE method), ZCR, and Filtered FdGn model on speech (Tewfik method).
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004