Picture for Dong Jing

Dong Jing

Say Cheese! Detail-Preserving Portrait Collection Generation via Natural Language Edits

Add code
Jan 28, 2026
Viaarxiv icon

Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation

Add code
May 19, 2025
Viaarxiv icon

CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval

Add code
Feb 28, 2025
Viaarxiv icon

Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval

Add code
Dec 15, 2024
Viaarxiv icon

CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning

Add code
Mar 07, 2024
Figure 1 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Figure 2 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Figure 3 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Figure 4 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Viaarxiv icon

Light Field Raindrop Removal via 4D Re-sampling

Add code
May 26, 2022
Figure 1 for Light Field Raindrop Removal via 4D Re-sampling
Figure 2 for Light Field Raindrop Removal via 4D Re-sampling
Figure 3 for Light Field Raindrop Removal via 4D Re-sampling
Figure 4 for Light Field Raindrop Removal via 4D Re-sampling
Viaarxiv icon