Title: | Robust and Efficient Methods for Distributed Speech Processing - Perspectives on Coding, Enhancement and Privacy |
Author(s): | Das, Sneha |
Date: | 2021 |
Language: | en |
Pages: | 70 + app. 70 |
Department: | Signaalinkäsittelyn ja akustiikan laitos Department of Signal Processing and Acoustics |
ISBN: | 978-952-64-0576-6 (electronic) 978-952-64-0575-9 (printed) |
Series: | Aalto University publication series DOCTORAL DISSERTATIONS, 152/2021 |
ISSN: | 1799-4942 (electronic) 1799-4934 (printed) 1799-4934 (ISSN-L) |
Supervising professor(s): | Bäckström, Tom, Prof., Aalto University, Department of Signal Processing and Acoustics, Finland |
Thesis advisor(s): | Bäckström, Tom, Prof., Aalto University, Department of Signal Processing and Acoustics, Finland |
Subject: | Electrical engineering |
Keywords: | speech, speech-coding, privacy, postfiltering, speech-interfaces |
Archive | yes |
|
|
Abstract:Computers and technology are so deeply embedded in our lives today that people invest a considerable part of their day communicating with technology. Conventional modes of human-technology interaction have predominantly been device-centric, due to which the users are required to be in the vicinity of the device. This can become cumbersome as the number of personal devices owned by an individual increases. A recent positive trend is the evolution towards user-centric modes of communication with technology enabled by the growing use and adoption of speech user interfaces. Furthermore, developments in the field of virtual and ad~hoc microphone networks and sensor technology are supporting this evolution. As a result, speech processing methods are moving towards a more distributed and collaborative approach. However, this has resulted in new challenges and technical problems in managing speech enhancement, coding and user privacy in acoustic sensor networks.
|
|
Description:Defence is held on 26.11.2021 12:00 – 15:00
|
|
Parts:[Publication 1]: Sneha Das, Tom Bäckström. Postfiltering with Complex Spectral Correlations for Speech and Audio Coding. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Hyderabad, pp. 3538-3542, September 2018. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201812106350. DOI: 10.21437/Interspeech.2018-1026 View at Publisher [Publication 2]: Sneha Das, Tom Bäckström. Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Hyderabad, pp. 3543-3547, September 2018. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201812106385. DOI: 10.21437/Interspeech.2018-1027 View at Publisher [Publication 3]: Sneha Das, Tom Bäckström, Guillaume Fuchs. Fundamental Frequency Model for Postfiltering at Low Bitrates in a Transform-Domain Speech and Audio Codec. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Shanghai, China, pp. 2837-2841, October 2020. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-202101251402. DOI: 10.21437/Interspeech.2020-1067 View at Publisher [Publication 4]: Sneha Das, Tom Bäckström. Enhancement by postfiltering for speech and audio coding in ad-hoc sensor networks. The Journal of the Acoustical Society of America Express Letters, pp. 015206, January 2021. DOI: 10.1121/10.0003208 View at Publisher [Publication 5]: Sneha Das, Tom Bäckström. Postfiltering Using Source Modeling for Speech and Audio Coding in Ad Hoc Sensor Networks. Submitted to IEEE Access, 2021[Publication 6]: Pablo Perez Zarazaga, Sneha Das, Tom Bäckström, V. V. Vidyadhara Raju, Anil Kumar Vuppala. Sound Privacy: A Conversational Speech Corpus for Quantifying the Experience of Privacy. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Graz, Austria, pp. 3720-3724, September 2019. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-201909255496. [Publication 7]: Anna Leschanowsky, Sneha Das, Tom Bäckström. Perception of Privacy Measured in the Crowd–Paired Comparison on the Effect of Background Noises. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Shanghai, China, pp. 4651-4654, October 2020. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-202101251597. DOI: 10.21437/Interspeech.2020-2299 View at Publisher |
|
|
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Page content by: Aalto University Learning Centre | Privacy policy of the service | About this site