Picture for Yucheng Han

Yucheng Han

EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts

Add code
Jun 13, 2024
Viaarxiv icon

Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection

Add code
Jan 10, 2024
Viaarxiv icon

ICD-LM: Configuring Vision-Language In-Context Demonstrations by Language Modeling

Add code
Dec 22, 2023
Figure 1 for ICD-LM: Configuring Vision-Language In-Context Demonstrations by Language Modeling
Figure 2 for ICD-LM: Configuring Vision-Language In-Context Demonstrations by Language Modeling
Figure 3 for ICD-LM: Configuring Vision-Language In-Context Demonstrations by Language Modeling
Figure 4 for ICD-LM: Configuring Vision-Language In-Context Demonstrations by Language Modeling
Viaarxiv icon

AppAgent: Multimodal Agents as Smartphone Users

Add code
Dec 22, 2023
Viaarxiv icon

ChartLlama: A Multimodal LLM for Chart Understanding and Generation

Add code
Nov 27, 2023
Viaarxiv icon

Prompt-aligned Gradient for Prompt Tuning

Add code
May 30, 2022
Figure 1 for Prompt-aligned Gradient for Prompt Tuning
Figure 2 for Prompt-aligned Gradient for Prompt Tuning
Figure 3 for Prompt-aligned Gradient for Prompt Tuning
Figure 4 for Prompt-aligned Gradient for Prompt Tuning
Viaarxiv icon

Fast AdvProp

Add code
Apr 21, 2022
Figure 1 for Fast AdvProp
Figure 2 for Fast AdvProp
Figure 3 for Fast AdvProp
Figure 4 for Fast AdvProp
Viaarxiv icon