Picture for Xiaodan Liang

Xiaodan Liang

AnyCrowd: Instance-Isolated Identity-Pose Binding for Arbitrary Multi-Character Animation

Add code
Mar 16, 2026
Viaarxiv icon

World2Act: Latent Action Post-Training via Skill-Compositional World Models

Add code
Mar 11, 2026
Viaarxiv icon

Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos

Add code
Mar 10, 2026
Viaarxiv icon

Choose What to Observe: Task-Aware Semantic-Geometric Representations for Visuomotor Policy

Add code
Mar 09, 2026
Viaarxiv icon

AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots

Add code
Mar 08, 2026
Viaarxiv icon

Suppressing Prior-Comparison Hallucinations in Radiology Report Generation via Semantically Decoupled Latent Steering

Add code
Feb 27, 2026
Viaarxiv icon

WildGHand: Learning Anti-Perturbation Gaussian Hand Avatars from Monocular In-the-Wild Videos

Add code
Feb 24, 2026
Viaarxiv icon

RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training

Add code
Feb 13, 2026
Viaarxiv icon

SimuScene: Training and Benchmarking Code Generation to Simulate Physical Scenarios

Add code
Feb 11, 2026
Viaarxiv icon

ERGO: Excess-Risk-Guided Optimization for High-Fidelity Monocular 3D Gaussian Splatting

Add code
Feb 10, 2026
Viaarxiv icon