Oversampling, Augmentation and Curriculum Learning for Speaking Assessment with Limited Training Data
Loading...
Access rights
openAccess
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
5
Series
Interspeech 2024, pp. 4019-4023, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Abstract
Automated assessment systems for spontaneous speech are an increasingly important component in language proficiency tests and learning platforms. These systems have seen remarkable development in recent years, driven by advances in self-supervised learning. Nevertheless, in languages such as Finnish and Finland Swedish, their performance is still limited by the low-resource and imbalance nature of their data. To alleviate these issues, this work evaluates two data-level methods: oversampling and curriculum learning. Our results reveal that combining these methods results in the greatest boost to model performance, achieved without additional data or modification to the model structure.Description
Publisher Copyright: © 2024 International Speech Communication Association. All rights reserved.
Other note
Citation
Lun, T M, Voskoboinik, E, Al-Ghezi, R, Grosz, T & Kurimo, M 2024, Oversampling, Augmentation and Curriculum Learning for Speaking Assessment with Limited Training Data. in Interspeech 2024. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, International Society for Computers and Their Applications (ISCA), pp. 4019-4023, Interspeech, Kos Island, Greece, 01/09/2024. https://doi.org/10.21437/Interspeech.2024-760