Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorBackstrom, Tomen_US
dc.contributor.authorFischer, Johannesen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.organizationFriedrich-Alexander University Erlangen-Nürnbergen_US
dc.date.accessioned2018-08-21T13:43:12Z
dc.date.available2018-08-21T13:43:12Z
dc.date.issued2018-01en_US
dc.description.abstractEfficient coding of speech and audio in a distributed system requires that quantization errors across nodes are uncorrelated. Yet with conventional methods at low bitrates, quantization levels become increasingly sparse, which does not correspond to the distribution of the input signal and importantly, also reduces coding efficiency in a distributed system. We have recently proposed a distributed speech and audio codec design which applies quantization in a randomized domain such that quantization errors are randomly rotated in the output domain. Similar to dithering, this ensures that quantization errors across nodes are uncorrelated and coding efficiency is retained. In this paper we improve this approach by proposing faster randomization methods, with a computational complexity O(N log N). Presented experiments demonstrate that the proposed randomizations yield uncorrelated signals, that perceptual quality is competitive and that the complexity of the proposed methods is feasible for practical applications.en
dc.description.versionPeer revieweden
dc.format.extent11
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationBackstrom, T & Fischer, J 2018, 'Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio', IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 26, no. 1, pp. 19-30. https://doi.org/10.1109/TASLP.2017.2757601en
dc.identifier.doi10.1109/TASLP.2017.2757601en_US
dc.identifier.issn2329-9290
dc.identifier.issn2329-9304
dc.identifier.otherPURE UUID: 238136e5-8df9-4d92-a9e8-4edebba32aa6en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/238136e5-8df9-4d92-a9e8-4edebba32aa6en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/27158975/ELEC_backstrom_et_al_Fast_randomization.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/33466
dc.identifier.urnURN:NBN:fi:aalto-201808214599
dc.language.isoenen
dc.publisherIEEE
dc.relation.ispartofseriesIEEE/ACM Transactions on Audio Speech and Language Processingen
dc.relation.ispartofseriesVolume 26, issue 1, pp. 19-30en
dc.rightsopenAccessen
dc.subject.keywordaudio codingen_US
dc.subject.keywordCodecsen_US
dc.subject.keywordComplexity theoryen_US
dc.subject.keyworddistributed codingen_US
dc.subject.keywordorthonormal matrixen_US
dc.subject.keywordQuantization (signal)en_US
dc.subject.keywordrandomizationen_US
dc.subject.keywordSpeechen_US
dc.subject.keywordSpeech codingen_US
dc.subject.keywordspeech codingen_US
dc.subject.keywordSpeech processingen_US
dc.subject.keywordsuperfast algorithmen_US
dc.titleFast Randomization for Distributed Low-Bitrate Coding of Speech and Audioen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionacceptedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ELEC_backstrom_et_al_Fast_randomization.pdf
Size:
578.89 KB
Format:
Adobe Portable Document Format