Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorNonavinakere Prabhakera, Narendraen_US
dc.contributor.authorAlku, Paavoen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorSpeech Communication Technologyen
dc.date.accessioned2020-06-25T08:35:01Z
dc.date.available2020-06-25T08:35:01Z
dc.date.embargoinfo:eu-repo/date/embargoEnd/2022-02-01en_US
dc.date.issued2021-01en_US
dc.description.abstractIn clinical practice, assessment of intelligibility in speakers with dysarthria is performed by speech-language pathologists through auditory perceptual tests which demand patients’ presence at hospital and involve time-consuming examinations. Frequent clinical monitoring can be costly and logistically inconvenient both for patients and medical experts. Here, we aim to automate the procedure of assessment of intelligibility in dysarthric speakers with an objective, speech-based method that can be employed in a telescreening application. The proposed method predicts the level of intelligibility in dysarthric speakers using four levels of speech intelligibility (very low, low, mediocre and high). The study compares several automatic methods to assess the intelligibility level in speakers with dysarthria by utilizing information generated at the level of the vocal folds through glottal features and by using coded telephone speech (i.e. speech that is used in telescreening applications). In addition to the glottal features, the openS-MILE features are used as acoustic baseline features. Using features obtained from coded speech utterances and the corresponding intelligibility level labels, multiclass-support vector machine (SVM) classifiers are trained. A separate set of multiclass-SVMs are trained using both individual glottal and acoustic features as well as their combinations. Coded telephone speech is generated with the adaptive multi-rate codec with two operational bandwidths (narrowband and wideband), from utterances of an open database of dysarthric speech (Universal Access-Speech). Experimental results showed good classification accuracies for the glottal features, indicating their effectiveness in the intelligibility level assessment in speakers with dysarthria even in the challenging coded condi-tion. Improvement in classification accuracy was obtained when the glottal features were combined with the openSMILE acoustic features, which validate the complimentary nature of the glottal features.en
dc.description.versionPeer revieweden
dc.format.extent14
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationNonavinakere Prabhakera, N & Alku, P 2021, 'Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features', Computer Speech and Language, vol. 65, 101117. https://doi.org/10.1016/j.csl.2020.101117en
dc.identifier.doi10.1016/j.csl.2020.101117en_US
dc.identifier.issn0885-2308
dc.identifier.issn1095-8363
dc.identifier.otherPURE UUID: 0664365c-ff33-4382-94e9-9c8ce502dd39en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/0664365c-ff33-4382-94e9-9c8ce502dd39en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/43187100/Automatic_assessment_of_intelligibility_in_speakers_with_dysarthria.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/45064
dc.identifier.urnURN:NBN:fi:aalto-202006254021
dc.language.isoenen
dc.publisherElsevier
dc.relation.ispartofseriesComputer Speech and Languageen
dc.relation.ispartofseriesVolume 65en
dc.rightsopenAccessen
dc.subject.keywordDysarthric speechen_US
dc.subject.keywordglottal featuresen_US
dc.subject.keywordglottal inverse filteringen_US
dc.subject.keywordglottal source estimationen_US
dc.subject.keywordopenSMILEen_US
dc.subject.keywordsupport vector machineen_US
dc.subject.keywordcoded telephone speechen_US
dc.titleAutomatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal featuresen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionacceptedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Automatic_assessment_of_intelligibility_in_speakers_with_dysarthria.pdf
Size:
722.17 KB
Format:
Adobe Portable Document Format