Speaking style conversion from normal to Lombard speech using a glottal vocoder and Bayesian GMMs

Loading...
Thumbnail Image
Access rights
openAccess
Journal Title
Journal ISSN
Volume Title
Conference article in proceedings
This publication is imported from Aalto University research portal.
View publication in the Research portal
View/Open full text file from the Research portal
Date
2017-08
Major/Subject
Mcode
Degree programme
Language
en
Pages
5
1363-1367
Series
Proceedings of Interspeech 2017, Interspeech: Annual Conference of the International Speech Communication Association
Abstract
Speaking style conversion is the technology of converting natural speech signals from one style to another. In this study, we focus on normal-to-Lombard conversion. This can be used, for example, to enhance the intelligibility of speech in noisy environments. We propose a parametric approach that uses a vocoder to extract speech features. These features are mapped using Bayesian GMMs from utterances spoken in normal style to the corresponding features of Lombard speech. Finally, the mapped features are converted to a Lombard speech waveform with the vocoder. Two vocoders were compared in the proposed normal-to-Lombard conversion: a recently developed glottal vocoder that decomposes speech into glottal flow excitation and vocal tract, and the widely used STRAIGHT vocoder. The conversion quality was evaluated in two subjective listening tests measuring subjective similarity and naturalness. The similarity test results show that the system is able to convert normal speech into Lombard speech for the two vocoders. However, the subjective naturalness of the converted Lombard speech was clearly better using the glottal vocoder in comparison to STRAIGHT.
Description
Keywords
speaking style conversion, vocal effort, Lombard speech, glottar vocoder, Bayesian GMM
Other note
Citation
Ramirez Lopez , A , Seshadri , S , Juvela , L , Räsänen , O & Alku , P 2017 , Speaking style conversion from normal to Lombard speech using a glottal vocoder and Bayesian GMMs . in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH . vol. 2017-August , Interspeech: Annual Conference of the International Speech Communication Association , International Speech Communication Association (ISCA) , pp. 1363-1367 , Interspeech , Stockholm , Sweden , 20/08/2017 . https://doi.org/10.21437/Interspeech.2017-400