Listening like a speech-training app: Expert and non-expert listeners’ goodness ratings of children’s speech
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Strömbergsson, Sofia | en_US |
dc.contributor.author | Fröjdh, Molly | en_US |
dc.contributor.author | Pettersson, Magdalena | en_US |
dc.contributor.author | Grósz, Tamás | en_US |
dc.contributor.author | Getman, Yaroslav | en_US |
dc.contributor.author | Kurimo, Mikko | en_US |
dc.contributor.department | Department of Information and Communications Engineering | en |
dc.contributor.groupauthor | Speech Recognition | en |
dc.contributor.organization | Karolinska Institutet | en_US |
dc.date.accessioned | 2024-06-20T08:16:18Z | |
dc.date.available | 2024-06-20T08:16:18Z | |
dc.date.issued | 2025 | en_US |
dc.description | Publisher Copyright: © 2024 The Author(s). Published with license by Taylor & Francis Group, LLC. | |
dc.description.abstract | Speech training apps are being developed that provide automatic feedback concerning children’s production of known target words, as a score on a 1–5 scale. However, this ‘goodness’ scale is still poorly understood. We investigated listeners’ ratings of ‘how many stars the app should provide as feedback’ on children’s utterances, and whether listener agreement is affected by clinical experience and/or access to anchor stimuli. In addition, we explored the association between goodness ratings and clinical measures of speech accuracy; the Percentage of Consonants Correct (PCC) and the Percentage of Phonemes Correct (PPC). Twenty speech-language pathologists and 20 non-expert listeners participated; half of the listeners in each group had access to anchor stimuli. The listeners rated 120 words, collected from children with and without speech sound disorder. Concerning reliability, intra-rater agreement was generally high, whereas inter-rater agreement was moderate. Access to anchor stimuli was associated with higher agreement, but only for non-expert listeners. Concerning the association between goodness ratings and the PCC/PPC, correlations were moderate for both listener groups, under both conditions. The results indicate that the task of rating goodness is difficult, regardless of clinical experience, and that access to anchor stimuli is insufficient for achieving reliable ratings. This raises concerns regarding the 1–5 rating scale as the means of feedback in speech training apps. More specific listener instructions, particularly regarding the intended context for the app, are suggested in collection of human ratings underlying the development of speech training apps. Until then, alternative means of feedback should be preferred. | en |
dc.description.version | Peer reviewed | en |
dc.format.extent | 22 | |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | Strömbergsson, S, Fröjdh, M, Pettersson, M, Grósz, T, Getman, Y & Kurimo, M 2025, ' Listening like a speech-training app: Expert and non-expert listeners’ goodness ratings of children’s speech ', Clinical Linguistics and Phonetics, vol. 39, no. 2, pp. 144-165 . https://doi.org/10.1080/02699206.2024.2355470 | en |
dc.identifier.doi | 10.1080/02699206.2024.2355470 | en_US |
dc.identifier.issn | 0269-9206 | |
dc.identifier.issn | 1464-5076 | |
dc.identifier.other | PURE UUID: 19fffe10-a35f-4bc1-8c13-c9070b9f9e75 | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/19fffe10-a35f-4bc1-8c13-c9070b9f9e75 | en_US |
dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85195454367&partnerID=8YFLogxK | |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/148759138/Listening_like_a_speech-training_app_Expert_and_non-expert_listeners_goodness_ratings_of_children_s_speech.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/129021 | |
dc.identifier.urn | URN:NBN:fi:aalto-202406204607 | |
dc.language.iso | en | en |
dc.publisher | Informa Healthcare | |
dc.relation.ispartofseries | Clinical Linguistics and Phonetics | en |
dc.relation.ispartofseries | Volume 39, issue 2, pp. 144-165 | en |
dc.rights | openAccess | en |
dc.subject.keyword | automatic assessment | en_US |
dc.subject.keyword | perceptual assessment | en_US |
dc.subject.keyword | Speech accuracy | en_US |
dc.subject.keyword | speech sound disorder | en_US |
dc.title | Listening like a speech-training app: Expert and non-expert listeners’ goodness ratings of children’s speech | en |
dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
dc.type.version | publishedVersion |