aalto1 untyped-item.component.html
Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing
Loading...
Access rights
openAccess
acceptedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Authors
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
Series
Journal of the Acoustical Society of America, Volume 146, issue 5, pp. EL418-EL423
Abstract
Existing studies in classification of phonation types in singing use voice source features and Mel-frequency cepstral coefficients (MFCCs) showing poor performance due to high pitch in singing. In this study, high-resolution spectra obtained using the zero-time windowing (ZTW) method is utilized to capture the effect of voice excitation. ZTW does not call for computing the source-filter decomposition (which is needed by many voice source features) which makes it robust to high pitch. For the classification, the study proposes extracting MFCCs from the ZTW spectrum. The results show that the proposed features give a clear improvement in classification accuracy compared to the existing features.
Description
Keywords
Other note
Citation
Kadiri, S R & Alku, P 2019, 'Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing', Journal of the Acoustical Society of America, vol. 146, no. 5, pp. EL418-EL423. https://doi.org/10.1121/1.5131043