Picture for Xiangyuan Lan

Xiangyuan Lan

Towards Visual Grounding: A Survey

Add code
Dec 28, 2024
Viaarxiv icon

Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model

Add code
Dec 22, 2024
Viaarxiv icon

Transferable Adversarial Face Attack with Text Controlled Attribute

Add code
Dec 16, 2024
Figure 1 for Transferable Adversarial Face Attack with Text Controlled Attribute
Figure 2 for Transferable Adversarial Face Attack with Text Controlled Attribute
Figure 3 for Transferable Adversarial Face Attack with Text Controlled Attribute
Figure 4 for Transferable Adversarial Face Attack with Text Controlled Attribute
Viaarxiv icon

AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment

Add code
Dec 01, 2024
Viaarxiv icon

ClickTrack: Towards Real-time Interactive Single Object Tracking

Add code
Nov 24, 2024
Figure 1 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Figure 2 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Figure 3 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Figure 4 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Viaarxiv icon

Click; Single Object Tracking; Video Object Segmentation; Real-time Interaction

Add code
Nov 20, 2024
Figure 1 for Click; Single Object Tracking; Video Object Segmentation; Real-time Interaction
Figure 2 for Click; Single Object Tracking; Video Object Segmentation; Real-time Interaction
Figure 3 for Click; Single Object Tracking; Video Object Segmentation; Real-time Interaction
Figure 4 for Click; Single Object Tracking; Video Object Segmentation; Real-time Interaction
Viaarxiv icon

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Add code
Oct 08, 2024
Figure 1 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Figure 2 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Figure 3 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Figure 4 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Viaarxiv icon

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Add code
Jul 10, 2024
Figure 1 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 2 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 3 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 4 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Viaarxiv icon

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

Add code
Feb 29, 2024
Viaarxiv icon

Deep Homography Estimation for Visual Place Recognition

Add code
Feb 25, 2024
Viaarxiv icon