Picture for Fabian Mentzer

Fabian Mentzer

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

GIVT: Generative Infinite-Vocabulary Transformers

Add code
Dec 04, 2023
Viaarxiv icon

Finite Scalar Quantization: VQ-VAE Made Simple

Add code
Oct 12, 2023
Viaarxiv icon

High-Fidelity Image Compression with Score-based Generative Models

Add code
May 26, 2023
Viaarxiv icon

M2T: Masking Transformers Twice for Faster Decoding

Add code
Apr 14, 2023
Viaarxiv icon

Multi-Realism Image Compression with a Conditional Generator

Add code
Dec 28, 2022
Viaarxiv icon

Lossy Compression with Gaussian Diffusion

Add code
Jun 17, 2022
Figure 1 for Lossy Compression with Gaussian Diffusion
Figure 2 for Lossy Compression with Gaussian Diffusion
Figure 3 for Lossy Compression with Gaussian Diffusion
Figure 4 for Lossy Compression with Gaussian Diffusion
Viaarxiv icon

VCT: A Video Compression Transformer

Add code
Jun 15, 2022
Figure 1 for VCT: A Video Compression Transformer
Figure 2 for VCT: A Video Compression Transformer
Figure 3 for VCT: A Video Compression Transformer
Figure 4 for VCT: A Video Compression Transformer
Viaarxiv icon

Towards Generative Video Compression

Add code
Jul 26, 2021
Figure 1 for Towards Generative Video Compression
Figure 2 for Towards Generative Video Compression
Figure 3 for Towards Generative Video Compression
Figure 4 for Towards Generative Video Compression
Viaarxiv icon

High-Fidelity Generative Image Compression

Add code
Jul 10, 2020
Figure 1 for High-Fidelity Generative Image Compression
Figure 2 for High-Fidelity Generative Image Compression
Figure 3 for High-Fidelity Generative Image Compression
Figure 4 for High-Fidelity Generative Image Compression
Viaarxiv icon