Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image Processing

Loading...
Thumbnail Image

Access rights

openAccess
acceptedVersion

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Date

2023

Major/Subject

Mcode

Degree programme

Language

en

Pages

5

Series

International Conference on Acoustics, Speech, and Signal Processing, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing

Abstract

Vector quantization (VQ) methods have been used in a wide range of applications for speech, image, and video data. While classic VQ methods often use expectation maximization, in this paper, we investigate the use of stochastic optimization employing our recently proposed noise substitution in vector quantization technique. We consider three variants of VQ including additive VQ, residual VQ, and product VQ, and evaluate their quality, complexity and bitrate in speech coding, image compression, approximate nearest neighbor search, and a selection of toy examples. Our experimental results demonstrate the trade-offs in accuracy, complexity, and bitrate such that using our open source implementations and complexity calculator, the best vector quantization method can be chosen for a particular problem.

Description

Keywords

Complexity, Machine learning, rate-distortion, Vector quantization

Other note

Citation

Vali, M & Bäckström, T 2023, Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image Processing . in International Conference on Acoustics, Speech, and Signal Processing . Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, IEEE International Conference on Acoustics, Speech, and Signal Processing, Rhodes Island, Greece, 04/06/2023 . https://doi.org/10.1109/ICASSP49357.2023.10096204