Title: | Voice source modelling techniques for statistical parametric speech synthesis Puheen äänilähteen mallintaminen tilastollisessa parametrisessa puhesynteesissä |
Author(s): | Raitio, Tuomo |
Date: | 2015 |
Language: | en |
Pages: | 182 + app. 105 |
Department: | Signaalinkäsittelyn ja akustiikan laitos Department of Signal Processing and Acoustics |
ISBN: | 978-952-60-6137-5 (electronic) 978-952-60-6136-8 (printed) |
Series: | Aalto University publication series DOCTORAL DISSERTATIONS, 40/2015 |
ISSN: | 1799-4942 (electronic) 1799-4934 (printed) 1799-4934 (ISSN-L) |
Supervising professor(s): | Alku, Paavo, Prof., Aalto University, Department of Signal Processing and Acoustics, Finland |
Thesis advisor(s): | Alku, Paavo, Prof., Aalto University, Department of Signal Processing and Acoustics, Finland |
Subject: | Acoustics |
Keywords: | statistical parametric speech synthesis, voice source modelling, glottal inverse filtering, voice quality, expressive speech synthesis, tilastollinen parametrinen puhesynteesi, äänilähteen mallintaminen, äänilähteen käänteissuodatus, äänenlaatu, ekspressiivinen puhesynteesi |
Archive | yes |
OEVS yes | |
|
|
Abstract:Puhe on ihmisten luonnollisin tapa kommunikoida, ja siksi puhetta tuottavan koneen suunnittelu on jo kauan kiehtonut ihmisiä. Kuitenkin vasta viime vuosikymmeninä puhesynteesistä on tullut käytännössä mahdollista, mikä suureksi osaksi on johtunut puheen digitaalisesta esitysmuodosta ja kasvaneesta laskentatehosta. Vaikka puhesynteesiä käytetään nykyään monenlaisissa sovelluksissa, kuten ihmisen ja tietokoneen vuorovaikutuksessa sekä avustavassa teknologiassa, nykyiset puhesyntetisaattorit ovat kuitenkin vielä kaukana ihmisten monipuolisesta puheentuottokyvystä. |
|
Parts:[Publication 1]: Harri Auvinen, Tuomo Raitio, Samuli Siltanen, Brad H. Story, and Paavo Alku. Automatic glottal inverse filtering with the Markov chain Monte Carlo method. Computer Speech and Language, vol. 28, no. 5, pp. 1139–1155, September 2014. DOI: 10.1016/j.csl.2013.09.004 View at Publisher [Publication 2]: Manu Airaksinen, Tuomo Raitio, Brad Story, and Paavo Alku. Quasi closed phase glottal inverse filtering analysis with weighted linear prediction. IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22, no. 3, pp. 596–607, March 2014. DOI: 10.1109/TASLP.2013.2294585 View at Publisher [Publication 3]: Tuomo Raitio, Antti Suni, Junichi Yamagishi, Hannu Pulakka, Jani Nurminen, Martti Vainio, and Paavo Alku. HMM-based speech synthesis utilizing glottal inverse filtering. IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 1, pp. 153–165, January 2011. DOI: 10.1109/TASL.2010.2045239 View at Publisher [Publication 4]: Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, and Paavo Alku. Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, pp. 4564–4567, May 2011. DOI: 10.1109/ICASSP.2011.5947370 View at Publisher [Publication 5]: Tuomo Raitio, Antti Suni, Lauri Juvela, Martti Vainio, and Paavo Alku. Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort. In Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech), Singapore, pp. 1969–1973, September 2014.[Publication 6]: Thomas Drugman and Tuomo Raitio. Excitation modeling for HMMbased speech synthesis: Breaking down the impact of periodic and aperiodic components. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, pp. 260–264, May 2014. DOI: 10.1109/ICASSP.2014.6853598 View at Publisher [Publication 7]: Tuomo Raitio, Antti Suni, Martti Vainio, and Paavo Alku. Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise. Computer Speech and Language, vol. 28, no. 2, pp. 648–664, March 2014. DOI: 10.1016/j.csl.2013.03.003 View at Publisher [Publication 8]: Tuomo Raitio, Antti Suni, Jouni Pohjalainen, Manu Airaksinen, Martti Vainio, and Paavo Alku. Analysis and synthesis of shouted speech. In Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech), Lyon, France, pp. 1544–1548, August 2013.[Publication 9]: Tuomo Raitio, John Kane, Thomas Drugman, and Christer Gobl. HMM-based synthesis of creaky voice. In Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech), Lyon, France, pp. 2316–2320, August 2013. |
|
|
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Page content by: Aalto University Learning Centre | Privacy policy of the service | About this site