Picture for Yu-Gang Jiang

Yu-Gang Jiang

Fudan University

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

Add code
Jun 09, 2026
Viaarxiv icon

IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder

Add code
Jun 09, 2026
Viaarxiv icon

UniDexTok: A Unified Dexterous Hand Tokenizer from Real Data

Add code
Jun 09, 2026
Viaarxiv icon

OmniGen-AR: AutoRegressive Any-to-Image Generation

Add code
Jun 08, 2026
Viaarxiv icon

Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data

Add code
Jun 07, 2026
Viaarxiv icon

DisCo: World Models with Discrete Camera Motion Control

Add code
Jun 06, 2026
Viaarxiv icon

Coarse-to-Control: Action-Token Planning for Vision-Language-Action Models

Add code
Jun 05, 2026
Viaarxiv icon

ActiveMimic: Egocentric Video Pretraining with Active Perception

Add code
Jun 04, 2026
Viaarxiv icon

Constitutional On-Policy Safe Distillation

Add code
Jun 02, 2026
Viaarxiv icon

EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation

Add code
Jun 02, 2026
Viaarxiv icon