Paper: | SP-P8.5 | ||
Session: | Voice Activity Detection and Speech Segmentation | ||
Time: | Wednesday, May 19, 13:00 - 15:00 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Speech Analysis | ||
Title: | SPEECH MODELING AND VOICED/UNVOICED/MIXED/SILENCE SPEECH SEGMENTATION WITH FRACTIONALLY GAUSSIAN NOISE BASED MODELS | ||
Authors: | Shahab Oveisgharan; Sharif University of Technology | ||
Mohammad Bagher Shamsollahi; Sharif University of Technology | |||
Abstract: | The ARMA filtered fractionally differenced Gaussian Noise (FdGn) model and a new AR Filtered FdGn Added up model are applied to speech signal and performance of their parameters on speech Unvoiced/Voiced/Mixed/Silence classification is evaluated against Zero Crossing Rate (ZCR) feature. For parameter estimation of AR filtered FdGn model two methods were applied: iterative Maximum Likelihood (ML) method of Tewfik [2] and a new computationally efficient Linear Minimum Square Error (LMSE) algorithm. Also for parameters estimation of new Added up model two approaches were implemented: an Expectation-Maximization (EM) based approach and an iterative MSE approach. The described models and methods were applied to speech signal and also its real Cepstrum. The performance of described models on V/U/M/S speech classification was obtained based on J1 parameter in this order: Added up model on real Cepstrum of speech, Filtered FdGn model on real Cepstrum of speech (LMSE method), Filtered FdGn model on speech (LMSE method), ZCR, and Filtered FdGn model on speech (Tewfik method). | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops