GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorKorse, Srikanthen_US
dc.contributor.authorFuchs, Guillaumeen_US
dc.contributor.authorBäckström, Tomen_US
dc.contributor.departmentFraunhofer Institute for Integrated Circuitsen_US
dc.contributor.departmentDept Signal Process and Acousten_US
dc.date.accessioned2018-12-10T10:11:24Z
dc.date.available2018-12-10T10:11:24Z
dc.date.issued2018en_US
dc.description.abstractSpectral envelope modelling is a central part of speech and audio codecs and is traditionally based on either vector quantization or scalar quantization followed by entropy coding. To bridge the coding performance of vector quantization with the low complexity of the scalar case, we propose an iterative approach for entropy coding the spectral envelope parameters. For each parameter, a univariate probability distribution is derived from a Gaussian mixture model of the joint distribution and the previously quantized parameters used as a-priori information. Parameters are then iteratively and individually scalar quantized and entropy coded. Unlike vector quantization, the complexity of proposed method does not increase exponentially with dimension and bitrate. Moreover, the coding resolution and dimension can be adaptively modified without retraining the model. Experimental results show that these important advantages do not impair coding efficiency compared to a state-of-art vector quantization scheme.en
dc.description.versionPeer revieweden
dc.format.extent5689-5693
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationKorse , S , Fuchs , G & Bäckström , T 2018 , GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio . in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . , 8461527 , Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , IEEE , pp. 5689-5693 , IEEE International Conference on Acoustics, Speech, and Signal Processing , Calgary , Alberta , Canada , 15/04/2018 . https://doi.org/10.1109/ICASSP.2018.8461527en
dc.identifier.doi10.1109/ICASSP.2018.8461527en_US
dc.identifier.isbn978-1-5386-4658-8
dc.identifier.issn2379-190X
dc.identifier.otherPURE UUID: 1d869657-d620-4bc9-a470-848d46f00a73en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/1d869657-d620-4bc9-a470-848d46f00a73en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/27158967/ELEC_korse_et_al_Gmm_based_iterative2018gmm.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/34946
dc.identifier.urnURN:NBN:fi:aalto-201812105961
dc.language.isoenen
dc.relation.ispartofIEEE International Conference on Acoustics, Speech and Signal Processingen
dc.relation.ispartofseriesProceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)en
dc.relation.ispartofseriesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processingen
dc.rightsopenAccessen
dc.subject.keywordEntropy Codingen_US
dc.subject.keywordGaussian mixture modelsen_US
dc.subject.keywordEnvelope Modellingen_US
dc.subject.keywordSpeech Codingen_US
dc.subject.keywordAudio Codingen_US
dc.titleGMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audioen
dc.typeConference article in proceedingsfi
dc.type.versionacceptedVersion
Files