Picture for Yu-Gang Jiang

Yu-Gang Jiang

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives

Add code
Aug 20, 2025
Viaarxiv icon

StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation

Add code
Aug 11, 2025
Viaarxiv icon

MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

Add code
Aug 07, 2025
Viaarxiv icon

Multimodal Referring Segmentation: A Survey

Add code
Aug 01, 2025
Viaarxiv icon

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Add code
Jul 30, 2025
Viaarxiv icon

RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base

Add code
Jun 23, 2025
Viaarxiv icon

NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models

Add code
Jun 11, 2025
Viaarxiv icon

Reasoning Models Are More Easily Gaslighted Than You Think

Add code
Jun 11, 2025
Viaarxiv icon