Automatic Classification of Neurological Voice Disorders Using Wavelet Scattering Features
Loading...
Access rights
openAccess
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
2024-02
Major/Subject
Mcode
Degree programme
Language
en
Pages
10
Series
Speech Communication, Volume 157
Abstract
Neurological voice disorders are caused by problems in the nervous system as it interacts with the larynx. In this paper, we propose to use wavelet scattering transform (WST)-based features in automatic classification of neurological voice disorders. As a part of WST, a speech signal is processed in stages with each stage consisting of three operations – convolution, modulus and averaging – to generate low-variance data representations that preserve discriminability across classes while minimizing differences within a class. The proposed WST-based features were extracted from speech signals of patients suffering from either spasmodic dysphonia (SD) or recurrent laryngeal nerve palsy (RLNP) and from speech signals of healthy speakers of the Saarbruecken voice disorder (SVD) database. Two machine learning algorithms (support vector machine (SVM) and feed forward neural network (NN)) were trained separately using the WST-based features, to perform two binary classification tasks (healthy vs. SD and healthy vs. RLNP) and one multi-class classification task (healthy vs. SD vs. RLNP). The results show that WST-based features outperformed state-of-the-art features in all three tasks. Furthermore, the best overall classification performance was achieved by the NN classifier trained using WST-based features.Description
Keywords
MFCC, neural network, neurological voice disorder, spasmodic dysphonia, support vector machine, wavelet scattering transform, Spasmodic dysphonia, Neurological voice disorder, Neural network, Support vector machine, Wavelet scattering transform
Other note
Citation
Yagnavajjula, M, Mittapalle, K, Alku, P, Rao, K S & Mitra, P 2024, ' Automatic Classification of Neurological Voice Disorders Using Wavelet Scattering Features ', Speech Communication, vol. 157, 103040 . https://doi.org/10.1016/j.specom.2024.103040