Spectral modification for recognition of children’s speech under mismatched conditions
Loading...
Access rights
openAccess
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
7
Series
Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), pp. 94–100, Linköping electronic conference proceedings ; 178
Abstract
n this paper, we propose spectral modification by sharpening formants and by reducing the spectral tilt to recognize children’s speech by automatic speech recognition (ASR) systems developed using adult speech. In this type of mismatched condition, the ASR performance is degraded due to the acoustic and linguistic mismatch in the attributes between children and adult speakers. The proposed method is used to improve the speech intelligibility to enhance the children’s speech recognition using an acoustic model trained on adult speech. In the experiments, WSJCAM0 and PFSTAR are used as databases for adults’ and children’s speech, respectively. The proposed technique gives a significant improvement in the context of the DNN-HMM-based ASR. Furthermore, we validate the robustness of the technique by showing that it performs well also in mismatched noise conditions.Description
Keywords
Other note
Citation
Kathania, H, Kadiri, S, Alku, P & Kurimo, M 2021, Spectral modification for recognition of children’s speech under mismatched conditions. in Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Linköping electronic conference proceedings, no. 178, Linköping University Electronic Press, Sweden, pp. 94–100, Nordic Conference on Computational Linguistics, Reykjavik, Iceland, 31/05/2021. < https://www.aclweb.org/anthology/2021.nodalida-main.10 >