Towards robust heart failure detection in digital telephony environments by utilizing transformer-based codec inversion
Loading...
Access rights
openAccess
CC BY
CC BY
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
15
Series
Speech Communication, Volume 173
Abstract
This study introduces the Codec Transformer Network (CTN) to enhance the reliability of automatic heart failure (HF) detection from coded telephone speech by addressing codec-related challenges in digital telephony. The study specifically addresses the codec mismatch between training and inference in HF detection. CTN is designed to map the mel-spectrogram representations of encoded speech signals back to their original, non-encoded forms, thereby recovering HF-related discriminative information. The effectiveness of CTN is demonstrated in conjunction with three HF detectors, based on Support Vector Machine, Random Forest, and K-Nearest Neighbors classifiers. The results show that CTN effectively retrieves the discriminative information between patients and controls, and performs comparably to or better than a baseline approach, based on multi-condition training.Description
Publisher Copyright: © 2025 The Authors
Other note
Citation
Tirronen, S, Javanmardi, F, Pohjalainen, H, Kadiri, S R, Mittapalle, K R, Helkkula, P, Kaitue, K, Minkkinen, M, Tolppanen, H, Nieminen, T & Alku, P 2025, 'Towards robust heart failure detection in digital telephony environments by utilizing transformer-based codec inversion', Speech Communication, vol. 173, 103279. https://doi.org/10.1016/j.specom.2025.103279