Picture for Yuxuan Ding

Yuxuan Ding

TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models

Add code
Oct 30, 2024
Viaarxiv icon

EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance

Add code
Sep 12, 2024
Viaarxiv icon

The CLIP Model is Secretly an Image-to-Prompt Converter

Add code
May 22, 2023
Viaarxiv icon

How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection

Add code
Jan 18, 2023
Viaarxiv icon

Position-Aware Relation Learning for RGB-Thermal Salient Object Detection

Add code
Sep 21, 2022
Figure 1 for Position-Aware Relation Learning for RGB-Thermal Salient Object Detection
Figure 2 for Position-Aware Relation Learning for RGB-Thermal Salient Object Detection
Figure 3 for Position-Aware Relation Learning for RGB-Thermal Salient Object Detection
Figure 4 for Position-Aware Relation Learning for RGB-Thermal Salient Object Detection
Viaarxiv icon

Seeking Subjectivity in Visual Emotion Distribution Learning

Add code
Jul 25, 2022
Figure 1 for Seeking Subjectivity in Visual Emotion Distribution Learning
Figure 2 for Seeking Subjectivity in Visual Emotion Distribution Learning
Figure 3 for Seeking Subjectivity in Visual Emotion Distribution Learning
Figure 4 for Seeking Subjectivity in Visual Emotion Distribution Learning
Viaarxiv icon

Don't Stop Learning: Towards Continual Learning for the CLIP Model

Add code
Jul 20, 2022
Figure 1 for Don't Stop Learning: Towards Continual Learning for the CLIP Model
Figure 2 for Don't Stop Learning: Towards Continual Learning for the CLIP Model
Figure 3 for Don't Stop Learning: Towards Continual Learning for the CLIP Model
Figure 4 for Don't Stop Learning: Towards Continual Learning for the CLIP Model
Viaarxiv icon

Stimuli-Aware Visual Emotion Analysis

Add code
Sep 04, 2021
Figure 1 for Stimuli-Aware Visual Emotion Analysis
Figure 2 for Stimuli-Aware Visual Emotion Analysis
Figure 3 for Stimuli-Aware Visual Emotion Analysis
Figure 4 for Stimuli-Aware Visual Emotion Analysis
Viaarxiv icon