Picture for Zekun Qi

Zekun Qi

DeformGen: Dynamics-Based Topology Augmentation for Deformable Manipulation Policy Learning

Add code
Jun 24, 2026
Viaarxiv icon

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

Add code
Jun 17, 2026
Viaarxiv icon

LIMMT: Less is More for Motion Tracking

Add code
Jun 05, 2026
Viaarxiv icon

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

Add code
Jun 02, 2026
Viaarxiv icon

Learning Athletic Humanoid Tennis Skills from Imperfect Human Motion Data

Add code
Mar 13, 2026
Viaarxiv icon

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Add code
Feb 14, 2026
Viaarxiv icon

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models

Add code
Jan 18, 2026
Viaarxiv icon

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Add code
Jul 03, 2025
Viaarxiv icon

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Add code
Feb 18, 2025
Figure 1 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Figure 2 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Figure 3 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Figure 4 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Viaarxiv icon

Positional Prompt Tuning for Efficient 3D Representation Learning

Add code
Aug 21, 2024
Figure 1 for Positional Prompt Tuning for Efficient 3D Representation Learning
Figure 2 for Positional Prompt Tuning for Efficient 3D Representation Learning
Figure 3 for Positional Prompt Tuning for Efficient 3D Representation Learning
Figure 4 for Positional Prompt Tuning for Efficient 3D Representation Learning
Viaarxiv icon