Automatic Classification of Neurological Voice Disorders Using Wavelet Scattering Features

Loading...
Thumbnail Image

Access rights

openAccess
publishedVersion

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Date

2024-02

Major/Subject

Mcode

Degree programme

Language

en

Pages

10

Series

Speech Communication, Volume 157

Abstract

Neurological voice disorders are caused by problems in the nervous system as it interacts with the larynx. In this paper, we propose to use wavelet scattering transform (WST)-based features in automatic classification of neurological voice disorders. As a part of WST, a speech signal is processed in stages with each stage consisting of three operations – convolution, modulus and averaging – to generate low-variance data representations that preserve discriminability across classes while minimizing differences within a class. The proposed WST-based features were extracted from speech signals of patients suffering from either spasmodic dysphonia (SD) or recurrent laryngeal nerve palsy (RLNP) and from speech signals of healthy speakers of the Saarbruecken voice disorder (SVD) database. Two machine learning algorithms (support vector machine (SVM) and feed forward neural network (NN)) were trained separately using the WST-based features, to perform two binary classification tasks (healthy vs. SD and healthy vs. RLNP) and one multi-class classification task (healthy vs. SD vs. RLNP). The results show that WST-based features outperformed state-of-the-art features in all three tasks. Furthermore, the best overall classification performance was achieved by the NN classifier trained using WST-based features.

Description

Keywords

MFCC, neural network, neurological voice disorder, spasmodic dysphonia, support vector machine, wavelet scattering transform, Spasmodic dysphonia, Neurological voice disorder, Neural network, Support vector machine, Wavelet scattering transform

Other note

Citation

Yagnavajjula, M, Mittapalle, K, Alku, P, Rao, K S & Mitra, P 2024, ' Automatic Classification of Neurological Voice Disorders Using Wavelet Scattering Features ', Speech Communication, vol. 157, 103040 . https://doi.org/10.1016/j.specom.2024.103040