Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorBackstrom, Tom
dc.contributor.authorFischer, Johannes
dc.contributor.departmentDept Signal Process and Acoust
dc.contributor.departmentFriedrich-Alexander University Erlangen-Nürnberg
dc.date.accessioned2018-08-21T13:43:12Z
dc.date.available2018-08-21T13:43:12Z
dc.date.issued2018-01
dc.description.abstractEfficient coding of speech and audio in a distributed system requires that quantization errors across nodes are uncorrelated. Yet with conventional methods at low bitrates, quantization levels become increasingly sparse, which does not correspond to the distribution of the input signal and importantly, also reduces coding efficiency in a distributed system. We have recently proposed a distributed speech and audio codec design which applies quantization in a randomized domain such that quantization errors are randomly rotated in the output domain. Similar to dithering, this ensures that quantization errors across nodes are uncorrelated and coding efficiency is retained. In this paper we improve this approach by proposing faster randomization methods, with a computational complexity O(N log N). Presented experiments demonstrate that the proposed randomizations yield uncorrelated signals, that perceptual quality is competitive and that the complexity of the proposed methods is feasible for practical applications.en
dc.description.versionPeer revieweden
dc.format.extent11
dc.format.extent19-30
dc.format.mimetypeapplication/pdf
dc.identifier.citationBackstrom , T & Fischer , J 2018 , ' Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio ' , IEEE/ACM Transactions on Audio Speech and Language Processing , vol. 26 , no. 1 , pp. 19-30 . https://doi.org/10.1109/TASLP.2017.2757601en
dc.identifier.doi10.1109/TASLP.2017.2757601
dc.identifier.issn2329-9290
dc.identifier.issn2329-9304
dc.identifier.otherPURE UUID: 238136e5-8df9-4d92-a9e8-4edebba32aa6
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/238136e5-8df9-4d92-a9e8-4edebba32aa6
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85030764240&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/27158975/ELEC_backstrom_et_al_Fast_randomization.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/33466
dc.identifier.urnURN:NBN:fi:aalto-201808214599
dc.language.isoenen
dc.relation.ispartofseriesIEEE/ACM Transactions on Audio Speech and Language Processingen
dc.relation.ispartofseriesVolume 26, issue 1en
dc.rightsopenAccessen
dc.subject.keywordaudio coding
dc.subject.keywordCodecs
dc.subject.keywordComplexity theory
dc.subject.keyworddistributed coding
dc.subject.keywordorthonormal matrix
dc.subject.keywordQuantization (signal)
dc.subject.keywordrandomization
dc.subject.keywordSpeech
dc.subject.keywordSpeech coding
dc.subject.keywordspeech coding
dc.subject.keywordSpeech processing
dc.subject.keywordsuperfast algorithm
dc.titleFast Randomization for Distributed Low-Bitrate Coding of Speech and Audioen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionacceptedVersion
Files