GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio

Loading...
Thumbnail Image
Journal Title
Journal ISSN
Volume Title
Conference article in proceedings
This publication is imported from Aalto University research portal.
View publication in the Research portal
View/Open full text file from the Research portal
Date
2018
Major/Subject
Mcode
Degree programme
Language
en
Pages
5689-5693
Series
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
Abstract
Spectral envelope modelling is a central part of speech and audio codecs and is traditionally based on either vector quantization or scalar quantization followed by entropy coding. To bridge the coding performance of vector quantization with the low complexity of the scalar case, we propose an iterative approach for entropy coding the spectral envelope parameters. For each parameter, a univariate probability distribution is derived from a Gaussian mixture model of the joint distribution and the previously quantized parameters used as a-priori information. Parameters are then iteratively and individually scalar quantized and entropy coded. Unlike vector quantization, the complexity of proposed method does not increase exponentially with dimension and bitrate. Moreover, the coding resolution and dimension can be adaptively modified without retraining the model. Experimental results show that these important advantages do not impair coding efficiency compared to a state-of-art vector quantization scheme.
Description
Keywords
Entropy Coding, Gaussian mixture models, Envelope Modelling, Speech Coding, Audio Coding
Other note
Citation
Korse , S , Fuchs , G & Bäckström , T 2018 , GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio . in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . , 8461527 , Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , IEEE , pp. 5689-5693 , IEEE International Conference on Acoustics, Speech, and Signal Processing , Calgary , Alberta , Canada , 15/04/2018 . https://doi.org/10.1109/ICASSP.2018.8461527