Picture for Kai Qiu

Kai Qiu

ImageFolder: Autoregressive Image Generation with Folded Tokens

Add code
Oct 02, 2024
Figure 1 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 2 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 3 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 4 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Viaarxiv icon

Efficient Autoregressive Audio Modeling via Next-Scale Prediction

Add code
Aug 16, 2024
Viaarxiv icon

ControlVAR: Exploring Controllable Visual Autoregressive Modeling

Add code
Jun 14, 2024
Figure 1 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Figure 2 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Figure 3 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Figure 4 for ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Viaarxiv icon

$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

Add code
Mar 07, 2024
Figure 1 for $\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Figure 2 for $\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Figure 3 for $\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Figure 4 for $\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Viaarxiv icon

Exploring Transferability for Randomized Smoothing

Add code
Dec 14, 2023
Viaarxiv icon

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

Add code
Nov 30, 2023
Viaarxiv icon

ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models

Add code
Nov 30, 2023
Viaarxiv icon

Weakly-supervised Pre-training for 3D Human Pose Estimation via Perspective Knowledge

Add code
Nov 22, 2022
Viaarxiv icon

Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting

Add code
Aug 08, 2019
Figure 1 for Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
Figure 2 for Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
Figure 3 for Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
Figure 4 for Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
Viaarxiv icon