Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Kadiri, Sudarsana Reddy | en_US |
dc.contributor.author | Alku, Paavo | en_US |
dc.contributor.department | Department of Signal Processing and Acoustics | en |
dc.contributor.groupauthor | Speech Communication Technology | en |
dc.date.accessioned | 2019-11-15T08:12:45Z | |
dc.date.available | 2019-11-15T08:12:45Z | |
dc.date.embargo | info:eu-repo/date/embargoEnd/2020-05-09 | en_US |
dc.date.issued | 2019-11-08 | en_US |
dc.description.abstract | Existing studies in classification of phonation types in singing use voice source features and Mel-frequency cepstral coefficients (MFCCs) showing poor performance due to high pitch in singing. In this study, high-resolution spectra obtained using the zero-time windowing (ZTW) method is utilized to capture the effect of voice excitation. ZTW does not call for computing the source-filter decomposition (which is needed by many voice source features) which makes it robust to high pitch. For the classification, the study proposes extracting MFCCs from the ZTW spectrum. The results show that the proposed features give a clear improvement in classification accuracy compared to the existing features. | en |
dc.description.version | Peer reviewed | en |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | Kadiri, S R & Alku, P 2019, 'Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing', Journal of the Acoustical Society of America, vol. 146, no. 5, pp. EL418-EL423. https://doi.org/10.1121/1.5131043 | en |
dc.identifier.doi | 10.1121/1.5131043 | en_US |
dc.identifier.issn | 1520-8524 | |
dc.identifier.other | PURE UUID: d2476c19-3afd-46ac-9c58-aba164faadc8 | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/d2476c19-3afd-46ac-9c58-aba164faadc8 | en_US |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/38524297/ELEC_Kadiri_Mel_frequency_cepstral_JasaEL.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/41274 | |
dc.identifier.urn | URN:NBN:fi:aalto-201911156279 | |
dc.language.iso | en | en |
dc.publisher | Acoustical Society of America | |
dc.relation.ispartofseries | Journal of the Acoustical Society of America | en |
dc.relation.ispartofseries | Volume 146, issue 5, pp. EL418-EL423 | en |
dc.rights | openAccess | en |
dc.title | Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing | en |
dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
dc.type.version | acceptedVersion |