Picture for Kejie Wang

Kejie Wang

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing

Add code
Oct 14, 2024
Figure 1 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 2 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 3 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 4 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Viaarxiv icon

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR

Add code
May 27, 2024
Viaarxiv icon

Learning to Agree on Vision Attention for Visual Commonsense Reasoning

Add code
Feb 19, 2023
Viaarxiv icon

Joint Answering and Explanation for Visual Commonsense Reasoning

Add code
Feb 25, 2022
Figure 1 for Joint Answering and Explanation for Visual Commonsense Reasoning
Figure 2 for Joint Answering and Explanation for Visual Commonsense Reasoning
Figure 3 for Joint Answering and Explanation for Visual Commonsense Reasoning
Figure 4 for Joint Answering and Explanation for Visual Commonsense Reasoning
Viaarxiv icon