VQ VAE


Vector-quantized variational autoencoder (VQ VAE) is a generative model that uses vector quantization to learn discrete latent representations.

Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition

Add code
Jan 08, 2025
Viaarxiv icon

PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing

Add code
Dec 26, 2024
Viaarxiv icon

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction

Add code
Dec 22, 2024
Viaarxiv icon

Jet: A Modern Transformer-Based Normalizing Flow

Add code
Dec 19, 2024
Figure 1 for Jet: A Modern Transformer-Based Normalizing Flow
Figure 2 for Jet: A Modern Transformer-Based Normalizing Flow
Figure 3 for Jet: A Modern Transformer-Based Normalizing Flow
Figure 4 for Jet: A Modern Transformer-Based Normalizing Flow
Viaarxiv icon

SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization

Add code
Dec 17, 2024
Viaarxiv icon

CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls

Add code
Dec 13, 2024
Figure 1 for CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls
Figure 2 for CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls
Figure 3 for CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls
Figure 4 for CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls
Viaarxiv icon

Adaptive$^2$: Adaptive Domain Mining for Fine-grained Domain Adaptation Modeling

Add code
Dec 11, 2024
Viaarxiv icon

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

Add code
Dec 02, 2024
Viaarxiv icon

Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates

Add code
Dec 02, 2024
Figure 1 for Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates
Figure 2 for Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates
Figure 3 for Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates
Figure 4 for Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates
Viaarxiv icon

JetFormer: An Autoregressive Generative Model of Raw Images and Text

Add code
Nov 29, 2024
Viaarxiv icon