End-to-end Pathological Speech Detection using Wavelet Scattering Network

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorMittapalle, Kiranen_US
dc.contributor.authorYagnavajjula, Madhuen_US
dc.contributor.authorAlku, Paavoen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorSpeech Communication Technologyen
dc.date.accessioned2023-01-18T09:23:01Z
dc.date.available2023-01-18T09:23:01Z
dc.date.issued2022-08-17en_US
dc.description.abstractIn recent years, developing robust systems for automatic detection of pathological speech has attracted increasing interest among researchers and clinicians. This study proposes an end-to-end approach based on wavelet scattering network (WSN) for detection of pathological speech. In the proposed approach, the WSN (which involves no learning) extracts suitable information from the input raw speech signal and this information is then passed through a multi-layer perceptron (MLP) in order to classify the speech signal as either healthy or pathological. The results show that the proposed approach outperformed a convolutional neural network (CNN) based end-to-end system in distinguishing pathological speech from healthy speech. Furthermore, the proposed system achieved comparable performance with a state-of-the-art traditional system based on hand-crafted features for uncompressed speech, but gave better performance than the traditional system for compressed speech of low bit rates.en
dc.description.versionPeer revieweden
dc.format.extent5
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationMittapalle, K, Yagnavajjula, M & Alku, P 2022, 'End-to-end Pathological Speech Detection using Wavelet Scattering Network', IEEE Signal Processing Letters, vol. 29, pp. 1863-1867. https://doi.org/10.1109/LSP.2022.3199669en
dc.identifier.doi10.1109/LSP.2022.3199669en_US
dc.identifier.issn1070-9908
dc.identifier.issn1558-2361
dc.identifier.otherPURE UUID: 5f32aa88-e7ca-47d5-abad-471d24a4b7f8en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/5f32aa88-e7ca-47d5-abad-471d24a4b7f8en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/98082403/End_to_End_Pathological_Speech_Detection_Using_Wavelet_Scattering_Network.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/118864
dc.identifier.urnURN:NBN:fi:aalto-202301181220
dc.language.isoenen
dc.publisherIEEE
dc.relation.ispartofseriesIEEE Signal Processing Lettersen
dc.relation.ispartofseriesVolume 29, pp. 1863-1867en
dc.rightsopenAccessen
dc.subject.keywordWavelet scattering networken_US
dc.subject.keywordCNNen_US
dc.subject.keywordpathological speechen_US
dc.subject.keywordMFCCen_US
dc.subject.keywordopenSMILE featuresen_US
dc.subject.keywordMP3 compressionen_US
dc.titleEnd-to-end Pathological Speech Detection using Wavelet Scattering Networken
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
End_to_End_Pathological_Speech_Detection_Using_Wavelet_Scattering_Network.pdf
Size:
1.35 MB
Format:
Adobe Portable Document Format