Title: | Modeling Conversational Finnish for Automatic Speech Recognition Suomen puhekielen mallintaminen automaattista puheentunnistusta varten |
Author(s): | Enarvi, Seppo |
Date: | 2018 |
Language: | en |
Pages: | 117 + app. 73 |
Department: | Signaalinkäsittelyn ja akustiikan laitos Department of Signal Processing and Acoustics |
ISBN: | 978-952-60-7908-0 (electronic) 978-952-60-7907-3 (printed) |
Series: | Aalto University publication series DOCTORAL DISSERTATIONS, 52/2018 |
ISSN: | 1799-4942 (electronic) 1799-4934 (printed) 1799-4934 (ISSN-L) |
Supervising professor(s): | Kurimo, Mikko, Prof., Aalto University, Department of Signal Processing and Acoustics, Finland |
Thesis advisor(s): | Virpioja, Sami, Dr., Aalto University, Department of Signal Processing and Acoustics, Finland |
Subject: | Acoustics, Electrical engineering, Linguistics |
Keywords: | automatic speech recognition, language modeling, word classes, artificial neural networks, data collection, automaattinen puheentunnistus, kielen mallintaminen, sanaluokat, neuroverkot, tiedonkeruu |
Archive | yes |
|
|
Abstract:Automaattisen puheentunnistuksen tarkkuus on jatkuvasti parantunut viimeisten vuosikymmenien aikana. Aalto-yliopistossa on kehitetty automaattista puheentunnistusta suomen kielelle ja päästy hyvin pieniin virheprosentteihin selkeästi puhutun kirjakielen tunnistuksessa, esimerkiksi uutislähetyksistä. Luonnolliten keskustelujen tunnistaminen on paljon haastavampaa. Suomen puhekieli eroaa myös monella tavalla kirjakielestä, ja sen tunnistamiseen tarvitaan tietoaineistoa, jota ei aikaisemmin ole ollut saatavilla. |
|
Parts:[Publication 1]: Seppo Enarvi and Mikko Kurimo. A Novel Discriminative Method for Pruning Pronunciation Dictionary Entries. In Proceedings of the 7th International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Cluj-Napoca, Romania, pages 113–116, October 2013. Full text in Aaltodoc/Acris: http://urn.fi/URN:NBN:fi:aalto-201708036410. DOI: 10.1109/SpeD.2013.6682659 View at Publisher [Publication 2]: Seppo Enarvi and Mikko Kurimo. Studies on Training Text Selection for Conversational Finnish Language Modeling. In Proceedings of the 10th International Workshop on Spoken Language Translation (IWSLT), Heidelberg, Germany, pages 256–263, December 2013. Fulltext in Aaltodoc/Acris: http://urn.fi/URN:NBN:fi:aalto-201708036342. [Publication 3]: Mikko Kurimo, Seppo Enarvi, Ottokar Tilk, Matti Varjokallio, André Mansikkaniemi, and Tanel Alumäe. Modeling under-resourced languages for speech recognition. Language Resources and Evaluation, volume 51, issue 4, pages 961–987, December 2017. Fulltext in Aaltodoc/Acris: http://urn.fi/URN:NBN:fi:aalto-201708036363. DOI: 10.1007/s10579-016-9336-9 View at Publisher [Publication 4]: Seppo Enarvi and Mikko Kurimo. TheanoLM — An Extensible Toolkit for Neural Network Language Modeling. In Proceedings of the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH), San Francisco, CA, USA, pages 3052–3056, September 2016. Fulltext in Aaltodoc/Acris: http://urn.fi/URN:NBN:fi:aalto-201708036333. [Publication 5]: Seppo Enarvi, Peter Smit, Sami Virpioja, and Mikko Kurimo. Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies. IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 25, issue 11, pages 2085–2097, November 2017. Fulltext in Aaltodoc/Acris: http://urn.fi/URN:NBN:fi:aalto-201710157079. DOI: 10.1109/TASLP.2017.2743344 View at Publisher [Errata file]: Errata Seppo Enarvi DD-52/2018 Publications P1, P3, P4, P5 |
|
|
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Page content by: Aalto University Learning Centre | Privacy policy of the service | About this site