Picture for Dongmei Jiang

Dongmei Jiang

AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment

Add code
Dec 01, 2024
Viaarxiv icon

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Add code
Nov 19, 2024
Viaarxiv icon

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Add code
Oct 08, 2024
Viaarxiv icon

Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment

Add code
Sep 10, 2024
Figure 1 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 2 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 3 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 4 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Viaarxiv icon

ExpLLM: Towards Chain of Thought for Facial Expression Recognition

Add code
Sep 04, 2024
Viaarxiv icon

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Add code
Aug 07, 2024
Viaarxiv icon

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Add code
Jul 10, 2024
Figure 1 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 2 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 3 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 4 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Viaarxiv icon

Prompt Customization for Continual Learning

Add code
Apr 28, 2024
Viaarxiv icon

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

Add code
Feb 29, 2024
Viaarxiv icon

Deep Homography Estimation for Visual Place Recognition

Add code
Feb 25, 2024
Viaarxiv icon