Postfiltering with Complex Spectral Correlations for Speech and Audio Coding

Loading...
Thumbnail Image
Access rights
openAccess
Journal Title
Journal ISSN
Volume Title
Conference article in proceedings
This publication is imported from Aalto University research portal.
View publication in the Research portal
View/Open full text file from the Research portal
Date
2018-09
Major/Subject
Mcode
Degree programme
Language
en
Pages
3538-3542
Series
Interspeech
Abstract
State-of-the-art speech codecs achieve a good compromise between quality, bitrate and complexity. However, retaining performance outside the target bitrate range remains challenging. To improve performance, many codecs use pre- and post-filtering techniques to reduce the perceptual effect of quantization-noise. In this paper, we propose a postfiltering method to attenuate quantization noise which uses the complex spectral correlations of speech signals. Since conventional speech codecs cannot transmit information with temporal dependencies as transmission errors could result in severe error propagation, we model the correlation offline and employ them at the decoder, hence removing the need to transmit any side information. Objective evaluation indicates an average 4 dB improvement in the perceptual SNR of signals using the context-based post-filter, with respect to the noisy signal and an average 2 dB improvement relative to the conventional Wiener filter. These results are confirmed by an improvement of up to 30 MUSHRA points in a subjective listening test.
Description
Keywords
Other note
Citation
Das , S & Bäckström , T 2018 , Postfiltering with Complex Spectral Correlations for Speech and Audio Coding . in Interspeech : Annual Conference of the International Speech Communication Association . , 1026 , Interspeech , International Speech Communication Association (ISCA) , pp. 3538-3542 , Interspeech , Hyderabad , India , 02/09/2018 . https://doi.org/10.21437/Interspeech.2018-1026