Speech biomarkers for automated depression level detection

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorAharonson, Vered
dc.contributor.authorCoopoo, Verushen
dc.contributor.authorCarlson, Craig S.
dc.contributor.authorPostema, Michiel
dc.contributor.departmentDepartment of Electrical Engineering and Automationen
dc.contributor.groupauthorComputational Electromechanicsen
dc.contributor.organizationUniversity of the Witwatersrand, Johannesburg
dc.contributor.organizationTampere University
dc.date.accessioned2025-11-19T09:42:46Z
dc.date.available2025-11-19T09:42:46Z
dc.date.issued2025-09-12
dc.description.abstractThis study investigates the contribution of speech audio and speech verbal content in the automated detection of depression levels. Recordings from the Distress Analysis Interview Corpus Wizard-of-Oz dataset and the depression severity labels of the recordings were used to extract acoustic features. A transcription of the recordings was used to extract textual features. The acoustic set included prosodic, cepstral, and glottal feature categories. The textual features consisted of semantic and syntactic categories. Mutual information feature selection, followed by a random forest classifier identified the set of features which optimised the depression level classification. The optimised binary classification of depression from non-depressed yielded an accuracy of 0.89 and an F1 score of 0.87. A classification of the five depression levels yielded an accuracy of 0.79 and an F1 score of 0.72. The ratio of importance scores of acoustic to textual of the speech acoustic features was greater than 3:1. Our method thus provided acoustic and textual indicators in depressed speech. These might increase the acceptability of automated depression detection by healthcare professionals. Our initial findings indicate a select set of features that can improve the effectiveness of automated depression detection and monitoring tools.en
dc.description.versionPeer revieweden
dc.format.extent4
dc.format.mimetypeapplication/pdf
dc.identifier.citationAharonson, V, Coopoo, V, Carlson, C S & Postema, M 2025, 'Speech biomarkers for automated depression level detection', Current Directions in Biomedical Engineering, vol. 11, no. 1, pp. 282-285. https://doi.org/10.1515/cdbme-2025-0172en
dc.identifier.doi10.1515/cdbme-2025-0172
dc.identifier.issn2364-5504
dc.identifier.otherPURE UUID: 7822e6e8-092a-44e0-afdf-44f714f3aafe
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/7822e6e8-092a-44e0-afdf-44f714f3aafe
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/201053497/Speech_biomarkers_for_automated_depression_level_detection.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/140681
dc.identifier.urnURN:NBN:fi:aalto-202511198822
dc.language.isoenen
dc.publisherDe Gruyter
dc.relation.ispartofseriesCurrent Directions in Biomedical Engineeringen
dc.relation.ispartofseriesVolume 11, issue 1, pp. 282-285en
dc.rightsopenAccessen
dc.rightsCC BY
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleSpeech biomarkers for automated depression level detectionen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Speech_biomarkers_for_automated_depression_level_detection.pdf
Size:
1.43 MB
Format:
Adobe Portable Document Format