Picture for Dongchen Han

Dongchen Han

Bridging the Divide: Reconsidering Softmax and Linear Attention

Add code
Dec 09, 2024
Viaarxiv icon

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Add code
Aug 11, 2024
Figure 1 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 2 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 3 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 4 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Viaarxiv icon

Demystify Mamba in Vision: A Linear Attention Perspective

Add code
May 26, 2024
Figure 1 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 2 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 3 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 4 for Demystify Mamba in Vision: A Linear Attention Perspective
Viaarxiv icon

VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Agent Attention: On the Integration of Softmax and Linear Attention

Add code
Dec 22, 2023
Figure 1 for Agent Attention: On the Integration of Softmax and Linear Attention
Figure 2 for Agent Attention: On the Integration of Softmax and Linear Attention
Figure 3 for Agent Attention: On the Integration of Softmax and Linear Attention
Figure 4 for Agent Attention: On the Integration of Softmax and Linear Attention
Viaarxiv icon

GSVA: Generalized Segmentation via Multimodal Large Language Models

Add code
Dec 15, 2023
Figure 1 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Figure 2 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Figure 3 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Figure 4 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Viaarxiv icon

OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization

Add code
Dec 07, 2023
Figure 1 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Figure 2 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Figure 3 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Figure 4 for OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization
Viaarxiv icon

FLatten Transformer: Vision Transformer using Focused Linear Attention

Add code
Aug 01, 2023
Viaarxiv icon

Dynamic Perceiver for Efficient Visual Recognition

Add code
Jun 20, 2023
Viaarxiv icon

Contrastive Language-Image Pre-Training with Knowledge Graphs

Add code
Oct 17, 2022
Figure 1 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 2 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 3 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 4 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Viaarxiv icon