Spanish Emotional Speech Synthesis

Loading...
Thumbnail Image
Journal Title
Journal ISSN
Volume Title
Sähkötekniikan korkeakoulu |
Date
2014-05-05
Department
Major/Subject
Signal processing
Mcode
S3013
Degree programme
TLT - Master’s Programme in Communications Engineering
Language
en
Pages
8+51
Series
Abstract
In this project a text-to-speech (TTS) HMM-based speech system (HTS) has been used to create emotional synthetic speech in Spanish. Nowadays the synthetic voices have high quality, but this is not enough, they must be able to capture the natural expressiveness of the human speech. Giving this expressiveness to the synthetic voices will lead to a much more natural voice, that is the goal of these systems.To achieve this, both male and female voices will be used and two different techniques will be applied: dependent models and average voice models with adaptation.In this TTS system diffeerent vocoders can be used. For this project GlottHMM has been used and then three perceptual test have been carried out to compare it with STRAIGHT vocoder.The results of the perceptual tests shows that STRAIGHT is very robust and that GlottHMM is not yet at its level regarding the emotional speech synthesis.
Description
Supervisor
Alku, Paavo
Thesis advisor
Raitio, Tuomo
Keywords
emotional speech synthesis, synthetic speech, vocoder, HMM-based, GlottHMM, STRAIGHT
Other note
Citation