Postfiltering with Complex Spectral Correlations for Speech and Audio Coding

 |  Login

Show simple item record

dc.contributor Aalto-yliopisto fi
dc.contributor Aalto University en
dc.contributor.author Das, Sneha
dc.contributor.author Bäckström, Tom
dc.date.accessioned 2018-12-10T10:34:16Z
dc.date.available 2018-12-10T10:34:16Z
dc.date.issued 2018-09
dc.identifier.citation Das , S & Bäckström , T 2018 , Postfiltering with Complex Spectral Correlations for Speech and Audio Coding . in Interspeech : Annual Conference of the International Speech Communication Association . , 1026 , Interspeech , International Speech Communication Association , pp. 3538-3542 , Interspeech , Hyberabad , India , 02/09/2018 . DOI: 10.21437/Interspeech.2018-1026 en
dc.identifier.issn 1990-9772
dc.identifier.other PURE UUID: ebd19b9b-6465-4afe-902c-a3f710ad9f72
dc.identifier.other PURE ITEMURL: https://research.aalto.fi/en/publications/postfiltering-with-complex-spectral-correlations-for-speech-and-audio-coding(ebd19b9b-6465-4afe-902c-a3f710ad9f72).html
dc.identifier.other PURE FILEURL: https://research.aalto.fi/files/27812371/ELEC_das_et_al_Postfiltering_with_complexd_Interspeech.pdf
dc.identifier.uri https://aaltodoc.aalto.fi/handle/123456789/35335
dc.description.abstract State-of-the-art speech codecs achieve a good compromise between quality, bitrate and complexity. However, retaining performance outside the target bitrate range remains challenging. To improve performance, many codecs use pre- and post-filtering techniques to reduce the perceptual effect of quantization-noise. In this paper, we propose a postfiltering method to attenuate quantization noise which uses the complex spectral correlations of speech signals. Since conventional speech codecs cannot transmit information with temporal dependencies as transmission errors could result in severe error propagation, we model the correlation offline and employ them at the decoder, hence removing the need to transmit any side information. Objective evaluation indicates an average 4 dB improvement in the perceptual SNR of signals using the context-based post-filter, with respect to the noisy signal and an average 2 dB improvement relative to the conventional Wiener filter. These results are confirmed by an improvement of up to 30 MUSHRA points in a subjective listening test. en
dc.format.extent 3538-3542
dc.format.mimetype application/pdf
dc.language.iso en en
dc.relation.ispartof Interspeech en
dc.relation.ispartofseries Interspeech en
dc.rights openAccess en
dc.subject.other 113 Computer and information sciences en
dc.title Postfiltering with Complex Spectral Correlations for Speech and Audio Coding en
dc.type A4 Artikkeli konferenssijulkaisussa fi
dc.description.version Peer reviewed en
dc.contributor.department Department of Signal Processing and Acoustics
dc.subject.keyword 113 Computer and information sciences
dc.identifier.urn URN:NBN:fi:aalto-201812106350
dc.identifier.doi 10.21437/Interspeech.2018-1026
dc.type.version publishedVersion


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search archive


Advanced Search

article-iconSubmit a publication

Browse

My Account