Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages

Loading...
Thumbnail Image

Access rights

openAccess
publishedVersion

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Major/Subject

Mcode

Degree programme

Language

en

Pages

6

Series

Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), pp. 345-350, Linköping Electronic Conference Proceedings ; 178, NEALT Proceedings Series ; Volume 45

Abstract

Forced alignment is an effective process to speed up linguistic research. However, most forced aligners are language-dependent, and under-resourced languages rarely have enough resources to train an acoustic model for an aligner. We present a new Finnish grapheme-based forced aligner and demonstrate its performance by aligning multiple Uralic languages and English as an unrelated language. We show that even a simple non-expert created grapheme-to-phoneme mapping can result in useful word alignments.

Description

Keywords

Other note

Citation

Leinonen, J, Virpioja, S & Kurimo, M 2021, Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages. in Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Linköping Electronic Conference Proceedings, no. 178, NEALT Proceedings Series, vol. 45, Linköping University Electronic Press, pp. 345-350, Nordic Conference on Computational Linguistics, Reykjavik, Iceland, 31/05/2021. < https://www.aclweb.org/anthology/2021.nodalida-main.36.pdf >