Formant tracking using quasi-closed phase forward-backward linear prediction analysis and deep neural networks

Loading...
Thumbnail Image
Access rights
openAccess
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal
View/Open full text file from the Research portal
Date
2021
Major/Subject
Mcode
Degree programme
Language
en
Pages
10
151631-151640
Series
IEEE Access, Volume 9
Abstract
Formant tracking is investigated in this study by using trackers based on dynamic programming (DP) and deep neural nets (DNNs). Using the DP approach, six formant estimation methods were first compared. The six methods include linear prediction (LP) algorithms, weighted LP algorithms and the recently developed quasi-closed phase forward-backward (QCP-FB) method. QCP-FB gave the best performance in the comparison. Therefore, a novel formant tracking approach, which combines benefits of deep learning and signal processing based on QCP-FB, was proposed. In this approach, the formants predicted by a DNN-based tracker from a speech frame are refined using the peaks of the all-pole spectrum computed by QCP-FB from the same frame. Results show that the proposed DNN-based tracker performed better both in detection rate and estimation error for the lowest three formants compared to reference formant trackers. Compared to the popular Wavesurfer, for example, the proposed tracker gave a reduction of 29%, 48%, and 35% in the estimation error for the lowest three formants, respectively.
Description
Keywords
Speech analysis, formant tracking, linear prediction, dynamic programming, deep neural network
Other note
Citation
Gowda, D, Bollepalli, B, Kadiri, S & Alku, P 2021, ' Formant tracking using quasi-closed phase forward-backward linear prediction analysis and deep neural networks ', IEEE Access, vol. 9, pp. 151631-151640 . https://doi.org/10.1109/ACCESS.2021.3126280