Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorDas, Snehaen_US
dc.contributor.authorBäckström, Tomen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorSpeech Communication Technologyen
dc.date.accessioned2018-12-10T10:36:16Z
dc.date.available2018-12-10T10:36:16Z
dc.date.issued2018-09en_US
dc.description.abstractAdvanced coding algorithms yield high quality signals with good coding efficiency within their target bit-rate ranges, but their performance suffer outside the target range. At lower bitrates, the degradation in performance is because the decoded signals are sparse, which gives a perceptually muffled and distorted characteristic to the signal. Standard codecs reduce such distortions by applying noise filling and post-filtering methods. In this paper, we propose a post-processing method based on modeling the inherent time-frequency correlation in the log-magnitude spectrum. The goal is to improve the perceptual SNR of the decoded signals and, to reduce the distortions caused by signal sparsity. Objective measures show an average improvement of 1.5 dB for input perceptual SNR in range 4 to 18 dB. The improvement is especially prominent in components which had been quantized to zero.en
dc.description.versionPeer revieweden
dc.format.extent5
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationDas, S & Bäckström, T 2018, Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding . in Interspeech : Annual Conference of the International Speech Communication Association ., 1027, Interspeech, International Speech Communication Association (ISCA), pp. 3543-3547, Interspeech, Hyderabad, India, 02/09/2018 . https://doi.org/10.21437/Interspeech.2018-1027en
dc.identifier.doi10.21437/Interspeech.2018-1027en_US
dc.identifier.issn1990-9772
dc.identifier.otherPURE UUID: fa7536e0-8772-4617-bf68-af6f77d4df64en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/fa7536e0-8772-4617-bf68-af6f77d4df64en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/27812283/ELEC_das_et_al_Postfiltering_Using_Interspeech.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/35370
dc.identifier.urnURN:NBN:fi:aalto-201812106385
dc.language.isoenen
dc.relation.ispartofInterspeechen
dc.relation.ispartofseriesInterspeech: Annual Conference of the International Speech Communication Associationen
dc.relation.ispartofseriespp. 3543-3547en
dc.relation.ispartofseriesInterspeechen
dc.rightsopenAccessen
dc.titlePostfiltering Using Log-Magnitude Spectrum for Speech and Audio Codingen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionpublishedVersion

Files