Paper: | SP-P5.5 | ||
Session: | Topics in Speech Coding | ||
Time: | Wednesday, May 19, 09:30 - 11:30 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Narrowband Speech Coding | ||
Title: | AUTOMATICALLY DERIVED UNITS FOR SEGMENT VOCODERS | ||
Authors: | V. Ramasubramanian; Indian Institute of Science | ||
Thippur V. Sreenivas; Indian Institute of Science | |||
Abstract: | Segment vocoders play a special role in very low bitrate speech coding to achieve intelligible speech at bitrates of 300 bits/sec. In this paper, we explore the definition and use of automatically derived units for segment quantization in segment vocoders. We consider three automatic segmentation techniques, namely, the spectral transition measures (STM), maximum-likelihood (ML) segmentation (unconstrained) and duration-constrained ML segmentation, towards defining diphone-like and phone-like units. We show that the ML segmentations realize phone-like units which are significantly better than those obtained by STM in terms of match accuracy with TIMIT phone segmentation as well as actual vocoder performance measured in terms of segmental SNR. Moreover, the phone-like units of ML segmentations also outperform the diphone-like units obtained using STM in early vocoders. We also show that the segment vocoder can operate at very high intelligibility when used in a single-speaker mode. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops