Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorKadiri, Sudarsana Reddyen_US
dc.contributor.authorAlku, Paavoen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorSpeech Communication Technologyen
dc.date.accessioned2019-11-15T08:12:45Z
dc.date.available2019-11-15T08:12:45Z
dc.date.embargoinfo:eu-repo/date/embargoEnd/2020-05-09en_US
dc.date.issued2019-11-08en_US
dc.description.abstractExisting studies in classification of phonation types in singing use voice source features and Mel-frequency cepstral coefficients (MFCCs) showing poor performance due to high pitch in singing. In this study, high-resolution spectra obtained using the zero-time windowing (ZTW) method is utilized to capture the effect of voice excitation. ZTW does not call for computing the source-filter decomposition (which is needed by many voice source features) which makes it robust to high pitch. For the classification, the study proposes extracting MFCCs from the ZTW spectrum. The results show that the proposed features give a clear improvement in classification accuracy compared to the existing features.en
dc.description.versionPeer revieweden
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationKadiri, S R & Alku, P 2019, 'Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing', Journal of the Acoustical Society of America, vol. 146, no. 5, pp. EL418-EL423. https://doi.org/10.1121/1.5131043en
dc.identifier.doi10.1121/1.5131043en_US
dc.identifier.issn1520-8524
dc.identifier.otherPURE UUID: d2476c19-3afd-46ac-9c58-aba164faadc8en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/d2476c19-3afd-46ac-9c58-aba164faadc8en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/38524297/ELEC_Kadiri_Mel_frequency_cepstral_JasaEL.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/41274
dc.identifier.urnURN:NBN:fi:aalto-201911156279
dc.language.isoenen
dc.publisherAcoustical Society of America
dc.relation.ispartofseriesJournal of the Acoustical Society of Americaen
dc.relation.ispartofseriesVolume 146, issue 5, pp. EL418-EL423en
dc.rightsopenAccessen
dc.titleMel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singingen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionacceptedVersion

Files