Picture for Xiangyuan Lan

Xiangyuan Lan

Transferable Adversarial Face Attack with Text Controlled Attribute

Add code
Dec 16, 2024
Viaarxiv icon

AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment

Add code
Dec 01, 2024
Viaarxiv icon

ClickTrack: Towards Real-time Interactive Single Object Tracking

Add code
Nov 24, 2024
Viaarxiv icon

Click; Single Object Tracking; Video Object Segmentation; Real-time Interaction

Add code
Nov 20, 2024
Viaarxiv icon

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Add code
Oct 08, 2024
Viaarxiv icon

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Add code
Jul 10, 2024
Figure 1 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 2 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 3 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 4 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Viaarxiv icon

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

Add code
Feb 29, 2024
Viaarxiv icon

Deep Homography Estimation for Visual Place Recognition

Add code
Feb 25, 2024
Viaarxiv icon

Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition

Add code
Feb 22, 2024
Viaarxiv icon

Strip-MLP: Efficient Token Interaction for Vision MLP

Add code
Jul 21, 2023
Viaarxiv icon