Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image Processing

Loading...
Thumbnail Image
Access rights
openAccess
Journal Title
Journal ISSN
Volume Title
Conference article in proceedings
This publication is imported from Aalto University research portal.
View publication in the Research portal
View/Open full text file from the Research portal
Date
2023
Major/Subject
Mcode
Degree programme
Language
en
Pages
5
Series
International Conference on Acoustics, Speech, and Signal Processing
Abstract
Vector quantization (VQ) methods have been used in a wide range of applications for speech, image, and video data. While classic VQ methods often use expectation maximization, in this paper, we investigate the use of stochastic optimization employing our recently proposed noise substitution in vector quantization technique. We consider three variants of VQ including additive VQ, residual VQ, and product VQ, and evaluate their quality, complexity and bitrate in speech coding, image compression, approximate nearest neighbor search, and a selection of toy examples. Our experimental results demonstrate the trade-offs in accuracy, complexity, and bitrate such that using our open source implementations and complexity calculator, the best vector quantization method can be chosen for a particular problem.
Description
Keywords
Complexity, Machine learning, rate-distortion, Vector quantization
Other note
Citation
Vali , M & Bäckström , T 2023 , Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image Processing . in International Conference on Acoustics, Speech, and Signal Processing . Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , IEEE , IEEE International Conference on Acoustics, Speech, and Signal Processing , Rhodes Island , Greece , 04/06/2023 . https://doi.org/10.1109/ICASSP49357.2023.10096204