Title: | Modern subword-based models for automatic speech recognition |
Author(s): | Smit, Peter |
Date: | 2019 |
Language: | en |
Pages: | 62 + app. 136 |
Department: | Signaalinkäsittelyn ja akustiikan laitos Department of Signal Processing and Acoustics |
ISBN: | 978-952-60-8566-1 (electronic) 978-952-60-8565-4 (printed) |
Series: | Aalto University publication series DOCTORAL DISSERTATIONS, 97/2019 |
ISSN: | 1799-4942 (electronic) 1799-4934 (printed) 1799-4934 (ISSN-L) |
Supervising professor(s): | Kurimo, Mikko, Assoc. Prof., Aalto University, Department of Signal Processing and Acoustics, Finland |
Thesis advisor(s): | Virpioja, Sami, Dr., Aalto University, Department of Signal Processing and Acoustics, Finland |
Subject: | Electrical engineering |
Keywords: | automatic speech recognition, language modeling, subword models |
Archive | yes |
|
|
Abstract:In today's society, speech recognition systems have reached a mass audience, especially in the field of personal assistants such as Amazon Alexa or Google Home. Yet, this does not mean that speech recognition has been solved. On the contrary, for many domains, tasks, and languages such systems do not exist.
|
|
Parts:[Publication 1]: Sami Virpioja, Peter Smit, Stig-Arne Grönroos, Mikko Kurimo. Morfessor 2.0: Python Implementation and Extensions for Morfessor Baseline. Full text in Acris/Aaltodoc: http://urn.fi/URN:ISBN:978-952-60-5501-5. Aalto University publication series SCIENCE + TECHNOLOGY, 25/2013.[Publication 2]: Peter Smit, Sami Virpioja, Mikko Kurimo. Improved Subword Modeling for WFST-Based Speech Recognition. In Annual Conference of the International Speech Communication Association (INTERSPEECH), Stockholm, pages 2551–2555 , August 2017. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201710157202. DOI: 10.21437/Interspeech.2017-103 View at Publisher [Publication 3]: Peter Smit, Juho Leinonen, Kristiina Jokinen, Mikko Kurimo. Automatic Speech Recognition for Northern Sámi with comparison to other Uralic Languages. In Proceedings of the Second International Workshop on Computational Linguistics for Uralic Languages, Szeged, pages 80–91, January 2016. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201701191109. [Publication 4]: Juho Leinonen, Peter Smit, Sami Virpioja, Mikko Kurimo. New Baseline in Automatic Speech Recognition for Northern Sámi. In Fourth International Workshop on Computational Linguistics for Uralic Languages, Helsinki, pages 89–99, January 2018. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201802091229. DOI: 10.18653/v1/W18-0208 View at Publisher [Publication 5]: Peter Smit, Siva Reddy Gangireddy, Seppo Enarvi, Sami Virpioja, Mikko Kurimo. Character-based units for Unlimited Vocabulary Continuous Speech Recognition. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, pages 149–156, December 2017. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201802091465. DOI: 10.1109/ASRU.2017.8268929 View at Publisher [Publication 6]: Peter Smit, Siva Reddy Gangireddy, Seppo Enarvi, Sami Virpioja, Mikko Kurimo. Aalto system for the 2017 Arabic multi-genre broadcast challenge. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, pages 338–345, December 2017. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201802091512. DOI: 10.1109/ASRU.2017.8268955 View at Publisher [Publication 7]: Seppo Enarvi, Peter Smit, Sami Virpioja, Mikko Kurimo. Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies. IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 25, issue 11, pages 2085–2097, November 2017. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201710157079. DOI: 10.1109/TASLP.2017.2743344 View at Publisher [Publication 8]: Peter Smit, Sami Virpioja, Mikko Kurimo. Advances in Subword-based HMM-DNN Speech Recognition Across Languages. Submitted to Language Resources and Evaluation, 29 November 2018. |
|
|
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Page content by: Aalto University Learning Centre | Privacy policy of the service | About this site