Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies
Loading...
Access rights
openAccess
(c) 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
acceptedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Date
2017-11
Major/Subject
Mcode
Degree programme
Language
en
Pages
13
Series
IEEE/ACM Transactions on Audio, Speech, and Language Processing, Volume 25, issue 11, pp. 2085-2097
Abstract
Today, the vocabulary size for language models in large vocabulary speech recognition is typically several hundreds of thousands of words. While this is already sufficient in some applications, the out-of-vocabulary words are still limiting the usability in others. In agglutinative languages the vocabulary for conversational speech should include millions of word forms to cover the spelling variations due to colloquial pronunciations, in addition to the word compounding and inflections. Very large vocabularies are also needed, for example, when the recognition of rare proper names is important.Description
Keywords
Other note
Citation
Enarvi, S, Smit, P, Virpioja, S & Kurimo, M 2017, ' Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies ', IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 11, pp. 2085-2097 . https://doi.org/10.1109/TASLP.2017.2743344