LLMs’ morphological analyses of complex FST-generated Finnish words

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorMoisio, Anssi
dc.contributor.authorCreutz, Mathias
dc.contributor.authorKurimo, Mikko
dc.contributor.departmentDepartment of Information and Communications Engineeringen
dc.contributor.departmentTietojenkäsittelytieteen laitoen
dc.contributor.editorKuribayashi, Tatsuki
dc.contributor.editorRambelli, Giulia
dc.contributor.editorTakmaz, Ece
dc.contributor.editorWicke, Philipp
dc.contributor.editorOseki, Yohei
dc.contributor.groupauthorSpeech Recognitionen
dc.date.accessioned2025-01-15T06:36:38Z
dc.date.available2025-01-15T06:36:38Z
dc.date.issued2024
dc.descriptionPublisher Copyright: ©2024 Association for Computational Linguistics.
dc.description.abstractRule-based language processing systems have been overshadowed by neural systems in terms of utility, but it remains unclear whether neural NLP systems, in practice, learn the grammar rules that humans use. This work aims to shed light on the issue by evaluating state-of-the-art LLMs in a task of morphological analysis of complex Finnish noun forms. We generate the forms using an FST tool, and they are unlikely to have occurred in the training sets of the LLMs, therefore requiring morphological generalisation capacity. We find that GPT-4-turbo has some difficulties in the task while GPT-3.5turbo struggles and smaller models Llama2-70B and Poro-34B fail nearly completely.en
dc.description.versionPeer revieweden
dc.format.extent13
dc.format.mimetypeapplication/pdf
dc.identifier.citationMoisio, A, Creutz, M & Kurimo, M 2024, LLMs’ morphological analyses of complex FST-generated Finnish words. in T Kuribayashi, G Rambelli, E Takmaz, P Wicke & Y Oseki (eds), CMCL 2024 - 13th Edition of the Workshop on Cognitive Modeling and Computational Linguistics, Proceedings of the Workshop. CMCL 2024 - 13th Edition of the Workshop on Cognitive Modeling and Computational Linguistics, Proceedings of the Workshop, Association for Computational Linguistics, pp. 242-254, Workshop on Cognitive Modeling and Computational Linguistics, Bangkok, Thailand, 15/08/2024. https://doi.org/10.18653/v1/2024.cmcl-1.21en
dc.identifier.doi10.18653/v1/2024.cmcl-1.21
dc.identifier.isbn979-8-89176-143-8
dc.identifier.otherPURE UUID: f2b27caa-b8ab-46fe-bfdf-aedbbd683c0c
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/f2b27caa-b8ab-46fe-bfdf-aedbbd683c0c
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85204300266&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/170078014/2024.cmcl-1.21.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/132957
dc.identifier.urnURN:NBN:fi:aalto-202501151250
dc.language.isoenen
dc.relation.ispartofWorkshop on Cognitive Modeling and Computational Linguisticsen
dc.relation.ispartofseriesCMCL 2024 - 13th Edition of the Workshop on Cognitive Modeling and Computational Linguistics, Proceedings of the Workshopen
dc.relation.ispartofseriespp. 242-254en
dc.rightsopenAccessen
dc.rightsCC BY
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleLLMs’ morphological analyses of complex FST-generated Finnish wordsen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionpublishedVersion

Files