Picture for Vincent Tao Hu

Vincent Tao Hu

TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training

Add code
Jan 08, 2025
Viaarxiv icon

Does VLM Classification Benefit from LLM Description Semantics?

Add code
Dec 16, 2024
Figure 1 for Does VLM Classification Benefit from LLM Description Semantics?
Figure 2 for Does VLM Classification Benefit from LLM Description Semantics?
Figure 3 for Does VLM Classification Benefit from LLM Description Semantics?
Figure 4 for Does VLM Classification Benefit from LLM Description Semantics?
Viaarxiv icon

[MASK] is All You Need

Add code
Dec 10, 2024
Viaarxiv icon

Distillation of Diffusion Features for Semantic Correspondence

Add code
Dec 04, 2024
Viaarxiv icon

Scaling Image Tokenizers with Grouped Spherical Quantization

Add code
Dec 03, 2024
Viaarxiv icon

Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory

Add code
Jul 29, 2024
Viaarxiv icon

Diffusion Models and Representation Learning: A Survey

Add code
Jun 30, 2024
Figure 1 for Diffusion Models and Representation Learning: A Survey
Figure 2 for Diffusion Models and Representation Learning: A Survey
Figure 3 for Diffusion Models and Representation Learning: A Survey
Figure 4 for Diffusion Models and Representation Learning: A Survey
Viaarxiv icon

Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions

Add code
Mar 25, 2024
Viaarxiv icon

DepthFM: Fast Monocular Depth Estimation with Flow Matching

Add code
Mar 20, 2024
Figure 1 for DepthFM: Fast Monocular Depth Estimation with Flow Matching
Figure 2 for DepthFM: Fast Monocular Depth Estimation with Flow Matching
Figure 3 for DepthFM: Fast Monocular Depth Estimation with Flow Matching
Figure 4 for DepthFM: Fast Monocular Depth Estimation with Flow Matching
Viaarxiv icon

ZigMa: Zigzag Mamba Diffusion Model

Add code
Mar 20, 2024
Viaarxiv icon