Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network

Loading...
Thumbnail Image
Access rights
openAccess
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
Date
2017-04-01
Major/Subject
Mcode
Degree programme
Language
en
Pages
4
EL327-EL330
Series
Journal of the Acoustical Society of America, Volume 141, issue 4
Abstract
Estimation of the spectral tilt of the glottal source has several applications in speech analysis and modification. However, direct estimation of the tilt from telephone speech is challenging due to vocal tract resonances and distortion caused by speech compression. In this study, a deep neural network is used for the tilt estimation from telephone speech by training the network with tilt estimates computed by glottal inverse filtering. An objective evaluation shows that the proposed technique gives more accurate estimates for the spectral tilt than previously used techniques that estimate the tilt directly from telephone speech without glottal inverse filtering.
Description
Keywords
Other note
Citation
Jokinen , E & Alku , P 2017 , ' Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network ' , Journal of the Acoustical Society of America , vol. 141 , no. 4 , pp. EL327-EL330 . https://doi.org/10.1121/1.4979162