Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

Oct 14, 2024

Mingyuan Yan, Jiawei Wu, Rushi Shah, Dianbo Liu

Figure 1 for Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

Figure 2 for Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

Figure 3 for Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

Figure 4 for Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

Share this with someone who'll enjoy it:

Abstract:The vector quantization is a widely used method to map continuous representation to discrete space and has important application in tokenization for generative mode, bottlenecking information and many other tasks in machine learning. Vector Quantized Variational Autoencoder (VQ-VAE) is a type of variational autoencoder using discrete embedding as latent. We generalize the technique further, enriching the probabilistic framework with a Gaussian mixture as the underlying generative model. This framework leverages a codebook of latent means and adaptive variances to capture complex data distributions. This principled framework avoids various heuristics and strong assumptions that are needed with the VQ-VAE to address training instability and to improve codebook utilization. This approach integrates the benefits of both discrete and continuous representations within a variational Bayesian framework. Furthermore, by introducing the \textit{Aggregated Categorical Posterior Evidence Lower Bound} (ALBO), we offer a principled alternative optimization objective that aligns variational distributions with the generative model. Our experiments demonstrate that GM-VQ improves codebook utilization and reduces information loss without relying on handcrafted heuristics.

View paper on

Share this with someone who'll enjoy it:

Title:Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

Paper and Code