Picture for Dongchen Han

Dongchen Han

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Add code
Aug 11, 2024
Figure 1 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 2 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 3 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 4 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Viaarxiv icon

Demystify Mamba in Vision: A Linear Attention Perspective

Add code
May 26, 2024
Viaarxiv icon

VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Agent Attention: On the Integration of Softmax and Linear Attention

Add code
Dec 22, 2023
Viaarxiv icon

GSVA: Generalized Segmentation via Multimodal Large Language Models

Add code
Dec 15, 2023
Viaarxiv icon

OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization

Add code
Dec 07, 2023
Figure 1 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Figure 2 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Figure 3 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Figure 4 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Viaarxiv icon

FLatten Transformer: Vision Transformer using Focused Linear Attention

Add code
Aug 01, 2023
Viaarxiv icon

Dynamic Perceiver for Efficient Visual Recognition

Add code
Jun 20, 2023
Viaarxiv icon

Contrastive Language-Image Pre-Training with Knowledge Graphs

Add code
Oct 17, 2022
Figure 1 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 2 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 3 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 4 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Viaarxiv icon

Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

Add code
Mar 22, 2022
Figure 1 for Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Figure 2 for Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Figure 3 for Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Figure 4 for Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Viaarxiv icon