Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages

Loading...
Thumbnail Image

Access rights

openAccess
publishedVersion

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Date

2021-05-01

Major/Subject

Mcode

Degree programme

Language

en

Pages

6

Series

Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), pp. 345-350, Linköping Electronic Conference Proceedings ; 178, NEALT Proceedings Series ; Volume 45

Abstract

Forced alignment is an effective process to speed up linguistic research. However, most forced aligners are language-dependent, and under-resourced languages rarely have enough resources to train an acoustic model for an aligner. We present a new Finnish grapheme-based forced aligner and demonstrate its performance by aligning multiple Uralic languages and English as an unrelated language. We show that even a simple non-expert created grapheme-to-phoneme mapping can result in useful word alignments.

Description

Keywords

Other note

Citation

Leinonen, J, Virpioja, S & Kurimo, M 2021, Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages . in Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa) . Linköping Electronic Conference Proceedings, no. 178, NEALT Proceedings Series, vol. 45, Linköping University Electronic Press, pp. 345-350, Nordic Conference on Computational Linguistics, Reykjavik, Iceland, 31/05/2021 . < https://www.aclweb.org/anthology/2021.nodalida-main.36.pdf >