Study of Formant Modification for Children ASR
No Thumbnail Available
Access rights
openAccess
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
2020-05
Major/Subject
Mcode
Degree programme
Language
en
Pages
7429-7433
Series
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
Abstract
The performance of automatic speech recognition systems for children’s speech is known to suffer from the large variation and mismatch in the acoustic and linguistic attributes between children’s and adults’ speech. One of the various identified sources of mismatch is the difference in formant frequencies between adults and children. In this paper, we propose a formant modification method to mitigate differences between adults’ and children’s speech and to improve the performance of ASR for children. The explored technique gives a relative 27% improvement in system performance compared to a hybrid DNN-HMM baseline. We also compare the system performance with related speaker adaptation methods like vocal tract length normalization (VTLN) and speaking rate adapta- tion (SRA) and find that the proposed method gives improvements over them, as well. Combining the proposed method with VTLN and SRA results in a further reduction of WER. We also found that the proposed method performs well even for noisy speech.Description
Keywords
hildren speech recognition, Formant modification, DNN
Other note
Citation
Kathania, H, Kadiri, S, Alku, P & Kurimo, M 2020, Study of Formant Modification for Children ASR . in 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings ., 9053334, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, pp. 7429-7433, IEEE International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, 04/05/2020 . https://doi.org/10.1109/ICASSP40776.2020.9053334