Picture for Dongmei Jiang

Dongmei Jiang

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Add code
Oct 08, 2024
Viaarxiv icon

Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment

Add code
Sep 10, 2024
Figure 1 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 2 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 3 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 4 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Viaarxiv icon

ExpLLM: Towards Chain of Thought for Facial Expression Recognition

Add code
Sep 04, 2024
Viaarxiv icon

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Add code
Aug 07, 2024
Viaarxiv icon

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Add code
Jul 10, 2024
Figure 1 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 2 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 3 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 4 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Viaarxiv icon

Prompt Customization for Continual Learning

Add code
Apr 28, 2024
Viaarxiv icon

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

Add code
Feb 29, 2024
Viaarxiv icon

Deep Homography Estimation for Visual Place Recognition

Add code
Feb 25, 2024
Viaarxiv icon

Enhancing the Emotional Generation Capability of Large Language Models via Emotional Chain-of-Thought

Add code
Jan 12, 2024
Viaarxiv icon

Benign Shortcut for Debiasing: Fair Visual Recognition via Intervention with Shortcut Features

Add code
Aug 13, 2023
Viaarxiv icon