Picture for Huanzhang Dou

Huanzhang Dou

In-Context LoRA for Diffusion Transformers

Add code
Oct 31, 2024
Viaarxiv icon

Group Diffusion Transformers are Unsupervised Multitask Learners

Add code
Oct 19, 2024
Figure 1 for Group Diffusion Transformers are Unsupervised Multitask Learners
Viaarxiv icon

CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition

Add code
Jul 04, 2024
Viaarxiv icon

GVDIFF: Grounded Text-to-Video Generation with Diffusion Models

Add code
Jul 02, 2024
Figure 1 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 2 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 3 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 4 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Viaarxiv icon

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

Add code
Jun 26, 2024
Viaarxiv icon

SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation

Add code
Jun 15, 2024
Viaarxiv icon

GaitMPL: Gait Recognition with Memory-Augmented Progressive Learning

Add code
Jun 06, 2023
Viaarxiv icon

GaitGCI: Generative Counterfactual Intervention for Gait Recognition

Add code
Jun 06, 2023
Viaarxiv icon

Referring Expression Comprehension Using Language Adaptive Inference

Add code
Jun 06, 2023
Viaarxiv icon

MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition

Add code
Jun 06, 2023
Viaarxiv icon