Picture for Yu-Gang Jiang

Yu-Gang Jiang

Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue

Add code
Sep 18, 2025
Viaarxiv icon

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives

Add code
Aug 20, 2025
Viaarxiv icon

StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation

Add code
Aug 11, 2025
Viaarxiv icon

MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

Add code
Aug 07, 2025
Viaarxiv icon

Multimodal Referring Segmentation: A Survey

Add code
Aug 01, 2025
Viaarxiv icon

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Add code
Jul 30, 2025
Viaarxiv icon

RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base

Add code
Jun 23, 2025
Viaarxiv icon

NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

Reasoning Models Are More Easily Gaslighted Than You Think

Add code
Jun 11, 2025
Figure 1 for Reasoning Models Are More Easily Gaslighted Than You Think
Figure 2 for Reasoning Models Are More Easily Gaslighted Than You Think
Figure 3 for Reasoning Models Are More Easily Gaslighted Than You Think
Figure 4 for Reasoning Models Are More Easily Gaslighted Than You Think
Viaarxiv icon