Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages
Loading...
Access rights
openAccess
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
2021-05-01
Major/Subject
Mcode
Degree programme
Language
en
Pages
6
Series
Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), pp. 345-350, Linköping Electronic Conference Proceedings ; 178, NEALT Proceedings Series ; Volume 45
Abstract
Forced alignment is an effective process to speed up linguistic research. However, most forced aligners are language-dependent, and under-resourced languages rarely have enough resources to train an acoustic model for an aligner. We present a new Finnish grapheme-based forced aligner and demonstrate its performance by aligning multiple Uralic languages and English as an unrelated language. We show that even a simple non-expert created grapheme-to-phoneme mapping can result in useful word alignments.Description
Keywords
Other note
Citation
Leinonen, J, Virpioja, S & Kurimo, M 2021, Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages . in Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa) . Linköping Electronic Conference Proceedings, no. 178, NEALT Proceedings Series, vol. 45, Linköping University Electronic Press, pp. 345-350, Nordic Conference on Computational Linguistics, Reykjavik, Iceland, 31/05/2021 . < https://www.aclweb.org/anthology/2021.nodalida-main.36.pdf >