Speech biomarkers for automated depression level detection
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.author | Aharonson, Vered | |
| dc.contributor.author | Coopoo, Verushen | |
| dc.contributor.author | Carlson, Craig S. | |
| dc.contributor.author | Postema, Michiel | |
| dc.contributor.department | Department of Electrical Engineering and Automation | en |
| dc.contributor.groupauthor | Computational Electromechanics | en |
| dc.contributor.organization | University of the Witwatersrand, Johannesburg | |
| dc.contributor.organization | Tampere University | |
| dc.date.accessioned | 2025-11-19T09:42:46Z | |
| dc.date.available | 2025-11-19T09:42:46Z | |
| dc.date.issued | 2025-09-12 | |
| dc.description.abstract | This study investigates the contribution of speech audio and speech verbal content in the automated detection of depression levels. Recordings from the Distress Analysis Interview Corpus Wizard-of-Oz dataset and the depression severity labels of the recordings were used to extract acoustic features. A transcription of the recordings was used to extract textual features. The acoustic set included prosodic, cepstral, and glottal feature categories. The textual features consisted of semantic and syntactic categories. Mutual information feature selection, followed by a random forest classifier identified the set of features which optimised the depression level classification. The optimised binary classification of depression from non-depressed yielded an accuracy of 0.89 and an F1 score of 0.87. A classification of the five depression levels yielded an accuracy of 0.79 and an F1 score of 0.72. The ratio of importance scores of acoustic to textual of the speech acoustic features was greater than 3:1. Our method thus provided acoustic and textual indicators in depressed speech. These might increase the acceptability of automated depression detection by healthcare professionals. Our initial findings indicate a select set of features that can improve the effectiveness of automated depression detection and monitoring tools. | en |
| dc.description.version | Peer reviewed | en |
| dc.format.extent | 4 | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Aharonson, V, Coopoo, V, Carlson, C S & Postema, M 2025, 'Speech biomarkers for automated depression level detection', Current Directions in Biomedical Engineering, vol. 11, no. 1, pp. 282-285. https://doi.org/10.1515/cdbme-2025-0172 | en |
| dc.identifier.doi | 10.1515/cdbme-2025-0172 | |
| dc.identifier.issn | 2364-5504 | |
| dc.identifier.other | PURE UUID: 7822e6e8-092a-44e0-afdf-44f714f3aafe | |
| dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/7822e6e8-092a-44e0-afdf-44f714f3aafe | |
| dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/201053497/Speech_biomarkers_for_automated_depression_level_detection.pdf | |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/140681 | |
| dc.identifier.urn | URN:NBN:fi:aalto-202511198822 | |
| dc.language.iso | en | en |
| dc.publisher | De Gruyter | |
| dc.relation.ispartofseries | Current Directions in Biomedical Engineering | en |
| dc.relation.ispartofseries | Volume 11, issue 1, pp. 282-285 | en |
| dc.rights | openAccess | en |
| dc.rights | CC BY | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.title | Speech biomarkers for automated depression level detection | en |
| dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
| dc.type.version | publishedVersion |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Speech_biomarkers_for_automated_depression_level_detection.pdf
- Size:
- 1.43 MB
- Format:
- Adobe Portable Document Format