Paper: | SP-P9.12 | ||
Session: | Topics in Speech Synthesis | ||
Time: | Wednesday, May 19, 15:30 - 17:30 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Speech Synthesis (including TTS) | ||
Title: | MODELING PRONUNCIATION VARIATION FOR SPONTANEOUS SPEECH SYNTHESIS | ||
Authors: | Steffen Werner; Dresden University of Technology | ||
Matthias Wolff; Dresden University of Technology | |||
Matthias Eichner; Dresden University of Technology | |||
RĂ¼diger Hoffmann; Dresden University of Technology | |||
Abstract: | Integration of pronunciation modeling into speech synthesis makes synthetic speech more natural and colloquial. Pronunciation variation as one observable effect in spontaneous speech is a step towards spontaneous speech synthesis. In previous works we introduced different duration control methods in speech synthesis. These methods based on the observation that words, which are very likely to occur in a given context are pronounced faster and less accurate than improbable ones. Therefore we use the probability of a word in its context either to control directly the local speaking rate or to select appropriate pronunciation variants to realize the change in the local speaking rate. Extending these methods by a pronunciation sequence model, we involve knowledge about how well two subsequent variants fit together. With the here proposed algorithm we could further improve the natural and colloquial listening impression. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops