Picture for Guande He

Guande He

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Add code
Mar 03, 2025
Viaarxiv icon

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Add code
Feb 21, 2025
Viaarxiv icon

Elucidating the Preconditioning in Consistency Distillation

Add code
Feb 05, 2025
Viaarxiv icon

Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models

Add code
Dec 19, 2024
Figure 1 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 2 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 3 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 4 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Viaarxiv icon

Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models

Add code
Nov 26, 2024
Figure 1 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Figure 2 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Figure 3 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Figure 4 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Viaarxiv icon

Consistency Diffusion Bridge Models

Add code
Oct 31, 2024
Figure 1 for Consistency Diffusion Bridge Models
Figure 2 for Consistency Diffusion Bridge Models
Figure 3 for Consistency Diffusion Bridge Models
Figure 4 for Consistency Diffusion Bridge Models
Viaarxiv icon

Diffusion Bridge Implicit Models

Add code
May 24, 2024
Figure 1 for Diffusion Bridge Implicit Models
Figure 2 for Diffusion Bridge Implicit Models
Figure 3 for Diffusion Bridge Implicit Models
Figure 4 for Diffusion Bridge Implicit Models
Viaarxiv icon

Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models

Add code
May 07, 2024
Figure 1 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Figure 2 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Figure 3 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Figure 4 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Viaarxiv icon

Noise Contrastive Alignment of Language Models with Explicit Rewards

Add code
Feb 08, 2024
Figure 1 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Figure 2 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Figure 3 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Figure 4 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Viaarxiv icon

Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis

Add code
Dec 06, 2023
Viaarxiv icon