End-to-end Pathological Speech Detection using Wavelet Scattering Network
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.author | Mittapalle, Kiran | en_US |
| dc.contributor.author | Yagnavajjula, Madhu | en_US |
| dc.contributor.author | Alku, Paavo | en_US |
| dc.contributor.department | Department of Signal Processing and Acoustics | en |
| dc.contributor.groupauthor | Speech Communication Technology | en |
| dc.date.accessioned | 2023-01-18T09:23:01Z | |
| dc.date.available | 2023-01-18T09:23:01Z | |
| dc.date.issued | 2022-08-17 | en_US |
| dc.description.abstract | In recent years, developing robust systems for automatic detection of pathological speech has attracted increasing interest among researchers and clinicians. This study proposes an end-to-end approach based on wavelet scattering network (WSN) for detection of pathological speech. In the proposed approach, the WSN (which involves no learning) extracts suitable information from the input raw speech signal and this information is then passed through a multi-layer perceptron (MLP) in order to classify the speech signal as either healthy or pathological. The results show that the proposed approach outperformed a convolutional neural network (CNN) based end-to-end system in distinguishing pathological speech from healthy speech. Furthermore, the proposed system achieved comparable performance with a state-of-the-art traditional system based on hand-crafted features for uncompressed speech, but gave better performance than the traditional system for compressed speech of low bit rates. | en |
| dc.description.version | Peer reviewed | en |
| dc.format.extent | 5 | |
| dc.format.mimetype | application/pdf | en_US |
| dc.identifier.citation | Mittapalle, K, Yagnavajjula, M & Alku, P 2022, 'End-to-end Pathological Speech Detection using Wavelet Scattering Network', IEEE Signal Processing Letters, vol. 29, pp. 1863-1867. https://doi.org/10.1109/LSP.2022.3199669 | en |
| dc.identifier.doi | 10.1109/LSP.2022.3199669 | en_US |
| dc.identifier.issn | 1070-9908 | |
| dc.identifier.issn | 1558-2361 | |
| dc.identifier.other | PURE UUID: 5f32aa88-e7ca-47d5-abad-471d24a4b7f8 | en_US |
| dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/5f32aa88-e7ca-47d5-abad-471d24a4b7f8 | en_US |
| dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/98082403/End_to_End_Pathological_Speech_Detection_Using_Wavelet_Scattering_Network.pdf | |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/118864 | |
| dc.identifier.urn | URN:NBN:fi:aalto-202301181220 | |
| dc.language.iso | en | en |
| dc.publisher | IEEE | |
| dc.relation.ispartofseries | IEEE Signal Processing Letters | en |
| dc.relation.ispartofseries | Volume 29, pp. 1863-1867 | en |
| dc.rights | openAccess | en |
| dc.subject.keyword | Wavelet scattering network | en_US |
| dc.subject.keyword | CNN | en_US |
| dc.subject.keyword | pathological speech | en_US |
| dc.subject.keyword | MFCC | en_US |
| dc.subject.keyword | openSMILE features | en_US |
| dc.subject.keyword | MP3 compression | en_US |
| dc.title | End-to-end Pathological Speech Detection using Wavelet Scattering Network | en |
| dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
| dc.type.version | publishedVersion |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- End_to_End_Pathological_Speech_Detection_Using_Wavelet_Scattering_Network.pdf
- Size:
- 1.35 MB
- Format:
- Adobe Portable Document Format