Picture for Dongmei Jiang

Dongmei Jiang

Transferable Adversarial Face Attack with Text Controlled Attribute

Add code
Dec 16, 2024
Figure 1 for Transferable Adversarial Face Attack with Text Controlled Attribute
Figure 2 for Transferable Adversarial Face Attack with Text Controlled Attribute
Figure 3 for Transferable Adversarial Face Attack with Text Controlled Attribute
Figure 4 for Transferable Adversarial Face Attack with Text Controlled Attribute
Viaarxiv icon

AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment

Add code
Dec 01, 2024
Viaarxiv icon

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Add code
Nov 19, 2024
Figure 1 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 2 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 3 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 4 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Viaarxiv icon

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Add code
Oct 08, 2024
Figure 1 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Figure 2 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Figure 3 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Figure 4 for EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Viaarxiv icon

Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment

Add code
Sep 10, 2024
Figure 1 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 2 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 3 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 4 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Viaarxiv icon

ExpLLM: Towards Chain of Thought for Facial Expression Recognition

Add code
Sep 04, 2024
Figure 1 for ExpLLM: Towards Chain of Thought for Facial Expression Recognition
Figure 2 for ExpLLM: Towards Chain of Thought for Facial Expression Recognition
Figure 3 for ExpLLM: Towards Chain of Thought for Facial Expression Recognition
Figure 4 for ExpLLM: Towards Chain of Thought for Facial Expression Recognition
Viaarxiv icon

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Add code
Aug 07, 2024
Viaarxiv icon

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Add code
Jul 10, 2024
Figure 1 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 2 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 3 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 4 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Viaarxiv icon

Prompt Customization for Continual Learning

Add code
Apr 28, 2024
Viaarxiv icon

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

Add code
Feb 29, 2024
Viaarxiv icon