Picture for Yehao Li

Yehao Li

Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning

Add code
Dec 31, 2024
Viaarxiv icon

Improving Text-guided Object Inpainting with Semantic Pre-inpainting

Add code
Sep 12, 2024
Viaarxiv icon

Improving Virtual Try-On with Garment-focused Diffusion Models

Add code
Sep 12, 2024
Viaarxiv icon

Boosting Diffusion Models with Moving Average Sampling in Frequency Domain

Add code
Mar 26, 2024
Figure 1 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Figure 2 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Figure 3 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Figure 4 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Viaarxiv icon

SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer

Add code
Mar 25, 2024
Viaarxiv icon

HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs

Add code
Mar 18, 2024
Viaarxiv icon

Control3D: Towards Controllable Text-to-3D Generation

Add code
Nov 09, 2023
Viaarxiv icon

Semantic-Conditional Diffusion Networks for Image Captioning

Add code
Dec 06, 2022
Viaarxiv icon

SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement

Add code
Nov 15, 2022
Viaarxiv icon

Dual Vision Transformer

Add code
Jul 12, 2022
Figure 1 for Dual Vision Transformer
Figure 2 for Dual Vision Transformer
Figure 3 for Dual Vision Transformer
Figure 4 for Dual Vision Transformer
Viaarxiv icon