Picture for Xuran Pan

Xuran Pan

Bridging the Divide: Reconsidering Softmax and Linear Attention

Add code
Dec 09, 2024
Viaarxiv icon

GSVA: Generalized Segmentation via Multimodal Large Language Models

Add code
Dec 15, 2023
Figure 1 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Figure 2 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Figure 3 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Figure 4 for GSVA: Generalized Segmentation via Multimodal Large Language Models
Viaarxiv icon

DAT++: Spatially Dynamic Vision Transformer with Deformable Attention

Add code
Sep 04, 2023
Figure 1 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Figure 2 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Figure 3 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Figure 4 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Viaarxiv icon

FLatten Transformer: Vision Transformer using Focused Linear Attention

Add code
Aug 01, 2023
Viaarxiv icon

Dynamic Perceiver for Efficient Visual Recognition

Add code
Jun 20, 2023
Viaarxiv icon

Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention

Add code
Apr 09, 2023
Figure 1 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Figure 2 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Figure 3 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Figure 4 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Viaarxiv icon

Joint Representation Learning for Text and 3D Point Cloud

Add code
Jan 18, 2023
Viaarxiv icon

Contrastive Language-Image Pre-Training with Knowledge Graphs

Add code
Oct 17, 2022
Figure 1 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 2 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 3 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Figure 4 for Contrastive Language-Image Pre-Training with Knowledge Graphs
Viaarxiv icon

ActiveNeRF: Learning where to See with Uncertainty Estimation

Add code
Sep 18, 2022
Figure 1 for ActiveNeRF: Learning where to See with Uncertainty Estimation
Figure 2 for ActiveNeRF: Learning where to See with Uncertainty Estimation
Figure 3 for ActiveNeRF: Learning where to See with Uncertainty Estimation
Figure 4 for ActiveNeRF: Learning where to See with Uncertainty Estimation
Viaarxiv icon

Vision Transformer with Deformable Attention

Add code
Jan 03, 2022
Figure 1 for Vision Transformer with Deformable Attention
Figure 2 for Vision Transformer with Deformable Attention
Figure 3 for Vision Transformer with Deformable Attention
Figure 4 for Vision Transformer with Deformable Attention
Viaarxiv icon