Picture for Xiaokang Yang

Xiaokang Yang

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Add code
Mar 16, 2026
Viaarxiv icon

R3DP: Real-Time 3D-Aware Policy for Embodied Manipulation

Add code
Mar 15, 2026
Viaarxiv icon

CodePercept: Code-Grounded Visual STEM Perception for MLLMs

Add code
Mar 11, 2026
Viaarxiv icon

Dynamic Training-Free Fusion of Subject and Style LoRAs

Add code
Feb 17, 2026
Viaarxiv icon

CLOT: Closed-Loop Global Motion Tracking for Whole-Body Humanoid Teleoperation

Add code
Feb 13, 2026
Viaarxiv icon

Thinking Like a Radiologist: A Dataset for Anatomy-Guided Interleaved Vision Language Reasoning in Chest X-ray Interpretation

Add code
Feb 13, 2026
Viaarxiv icon

UniVTAC: A Unified Simulation Platform for Visuo-Tactile Manipulation Data Generation, Learning, and Benchmarking

Add code
Feb 10, 2026
Viaarxiv icon

Light Up Your Face: A Physically Consistent Dataset and Diffusion Model for Face Fill-Light Enhancement

Add code
Feb 04, 2026
Viaarxiv icon

Stabilizing Diffusion Posterior Sampling by Noise--Frequency Continuation

Add code
Jan 30, 2026
Viaarxiv icon

Learning Domain Knowledge in Multimodal Large Language Models through Reinforcement Fine-Tuning

Add code
Jan 23, 2026
Viaarxiv icon