Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorEnarvi, Seppoen_US
dc.contributor.authorSmit, Peteren_US
dc.contributor.authorVirpioja, Samien_US
dc.contributor.authorKurimo, Mikkoen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorCentre of Excellence in Computational Inference, COINen
dc.contributor.groupauthorSpeech Recognitionen
dc.date.accessioned2017-10-15T20:39:42Z
dc.date.available2017-10-15T20:39:42Z
dc.date.issued2017-11en_US
dc.description.abstractToday, the vocabulary size for language models in large vocabulary speech recognition is typically several hundreds of thousands of words. While this is already sufficient in some applications, the out-of-vocabulary words are still limiting the usability in others. In agglutinative languages the vocabulary for conversational speech should include millions of word forms to cover the spelling variations due to colloquial pronunciations, in addition to the word compounding and inflections. Very large vocabularies are also needed, for example, when the recognition of rare proper names is important.en
dc.description.versionPeer revieweden
dc.format.extent13
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationEnarvi, S, Smit, P, Virpioja, S & Kurimo, M 2017, 'Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies', IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 11, pp. 2085-2097. https://doi.org/10.1109/TASLP.2017.2743344en
dc.identifier.doi10.1109/TASLP.2017.2743344en_US
dc.identifier.issn2329-9290
dc.identifier.otherPURE UUID: 74066940-5e5d-4208-af53-e61615e0603cen_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/74066940-5e5d-4208-af53-e61615e0603cen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/15343327/taslp2017.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/28219
dc.identifier.urnURN:NBN:fi:aalto-201710157079
dc.language.isoenen
dc.publisherIEEE
dc.relation.ispartofseriesIEEE/ACM Transactions on Audio, Speech, and Language Processingen
dc.relation.ispartofseriesVolume 25, issue 11, pp. 2085-2097en
dc.rightsopenAccessen
dc.rights.copyright(c) 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.en_US
dc.titleAutomatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabulariesen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionacceptedVersion

Files