Picture for Xuxin Cheng

Xuxin Cheng

DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding

Add code
Jan 30, 2026
Viaarxiv icon

Reflecting Twice before Speaking with Empathy: Self-Reflective Alternating Inference for Empathy-Aware End-to-End Spoken Dialogue

Add code
Jan 26, 2026
Viaarxiv icon

In-N-On: Scaling Egocentric Manipulation with in-the-wild and on-task Data

Add code
Nov 19, 2025
Viaarxiv icon

HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation

Add code
Nov 18, 2025
Figure 1 for HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation
Figure 2 for HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation
Figure 3 for HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation
Figure 4 for HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation
Viaarxiv icon

GMT: General Motion Tracking for Humanoid Whole-Body Control

Add code
Jun 17, 2025
Viaarxiv icon

AMO: Adaptive Motion Optimization for Hyper-Dexterous Humanoid Whole-Body Control

Add code
May 06, 2025
Viaarxiv icon

Kongzi: A Historical Large Language Model with Fact Enhancement

Add code
Apr 13, 2025
Viaarxiv icon

Humanoid Policy ~ Human Policy

Add code
Mar 17, 2025
Viaarxiv icon

ExBody2: Advanced Expressive Humanoid Whole-Body Control

Add code
Dec 17, 2024
Viaarxiv icon

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Add code
Dec 13, 2024
Figure 1 for DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
Figure 2 for DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
Figure 3 for DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
Figure 4 for DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
Viaarxiv icon