Emotional Festival-Mbrola TTS Synthesis

The topic of this work is an extension of our previous research on the development of a general data-driven procedure for creating a neutral "narrative-style" prosodic module for the Italian FESTIVAL Text-To-Speech (TTS) synthesizer, and it is focused on investigating and implementing new strategies for building a new emotional FESTIVAL TTS. The new emotional prosodic modules, similarly to the neutral case, are still based on the "Classification And Regression Tree" (CART) theory. The extension to the emotional speech synthesis is obtained using a differential approach: the emotional prosodic modules learn the differences between the neutral (without emotions) and the emotional prosodic data. Moreover, due to the fact that Voice Quality (VQ) is known to play an important role in emotive speech, a rule-based FESTIVAL-MBROLA VQ-modification module, for control of temporal and spectral characteristics of the synthesis, has also been implemented. Even if emotional synthesis still remains an attractive open issue, our preliminary evaluation results underline the effectiveness of the proposed solution.

Tipo Pubblicazione: 
Contributo in atti di convegno
Author or Creator: 
Tesser F.
Cosi P.
Drioli C.
Tisato G.
Publisher: 
ISCA c/o Institut fuer Kommunikationsforschung und Phonetik Universitaet Bonn - Poppelsdorfer Allee 47, D-53115, Bonn, DEU
Source: 
Eurospeech/Interspeech 2005 - 9th European Conference on Speech Communication Technology, pp. 505–508, Lisbon, PORTUGAL, 4-8 Settembre 2005
Date: 
2005
Resource Identifier: 
http://www.cnr.it/prodotto/i/181146
http://www.isca-speech.org/archive/interspeech_2005/
urn:isbn:978-1-60423-448-0
Language: 
Eng