Paper: | SP-L1.6 | ||
Session: | Voice Conversion and Morphing Algorithms for TTS Systems | ||
Time: | Tuesday, May 18, 17:10 - 17:30 | ||
Presentation: | Lecture | ||
Topic: | Speech Processing: Speech Synthesis (including TTS) | ||
Title: | VOICE CONVERSION THROUGH TRANSFORMATION OF SPECTRAL AND INTONATION FEATURES | ||
Authors: | Dimitrios Rentzos; Brunel University | ||
Saeed Vaseghi; Brunel University | |||
Qin Yan; Brunel University | |||
Ching-Hsiang Ho; Fortune Institute of Technology | |||
Abstract: | This paper presents a voice conversion method based on transformation of the characteristic features of a source speaker towards a target. Voice characteristic features are grouped into two main categories: (a) the spectral features at formants and (b) the pitch and intonation patterns. Signal modelling and transformation methods for each group of voice features are outlined. The spectral features at formants are modelled using a set of two-dimensional phoneme-dependent HMMs. Subband frequency warping is used for spectrum transformation with the subbands centred on the estimates of the formant trajectories. The F0 contour is used for modelling the pitch and intonation patterns of speech. A PSOLA based method is employed for transformation of pitch, intonation patterns and speaking rate. The experiments present illustrations and perceptual evaluations of the results of transformations of the various voice features. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops