Mel-frequency cepstral coefficients of voice source waveforms for classification of phonation types in speech
Loading...
Access rights
openAccess
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Authors
Date
2019-01-01
Major/Subject
Mcode
Degree programme
Language
en
Pages
5
Series
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Volume 2019-September, pp. 2508-2512, Interspeech - Annual Conference of the International Speech Communication Association, INTERSPEECH
Abstract
Voice source characteristics in different phonation types vary due to the tension of laryngeal muscles along with the respiratory effort. This study investigates the use of mel-frequency cepstral coefficients (MFCCs) derived from voice source waveforms for classification of phonation types in speech. The cepstral coefficients are computed using two source waveforms: (1) glottal flow waveforms estimated by the quasi-closed phase (QCP) glottal inverse filtering method and (2) approximate voice source waveforms obtained using the zero frequency filtering (ZFF) method. QCP estimates voice source waveforms based on the source-filter decomposition while ZFF yields source waveforms without explicitly computing the source-filter decomposition. Experiments using MFCCs computed from the two source waveforms show improved accuracy in classification of phonation types compared to the existing voice source features and conventional MFCC features. Further, it is observed that the proposed features have complimentary information to the existing features.Description
Keywords
Glottal inverse filtering, Phonation type, Speech analysis, Voice quality, Voice source, Zero frequency filtering
Other note
Citation
Kadiri, S R & Alku, P 2019, Mel-frequency cepstral coefficients of voice source waveforms for classification of phonation types in speech . in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH . vol. 2019-September, Interspeech - Annual Conference of the International Speech Communication Association, INTERSPEECH, International Speech Communication Association (ISCA), pp. 2508-2512, Interspeech, Graz, Austria, 15/09/2019 . https://doi.org/10.21437/Interspeech.2019-2863