Picture for Zekun Qi

Zekun Qi

Learning Athletic Humanoid Tennis Skills from Imperfect Human Motion Data

Add code
Mar 13, 2026
Viaarxiv icon

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Add code
Feb 14, 2026
Viaarxiv icon

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models

Add code
Jan 18, 2026
Viaarxiv icon

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Add code
Jul 03, 2025
Viaarxiv icon

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Add code
Feb 18, 2025
Figure 1 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Figure 2 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Figure 3 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Figure 4 for SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Viaarxiv icon

Positional Prompt Tuning for Efficient 3D Representation Learning

Add code
Aug 21, 2024
Figure 1 for Positional Prompt Tuning for Efficient 3D Representation Learning
Figure 2 for Positional Prompt Tuning for Efficient 3D Representation Learning
Figure 3 for Positional Prompt Tuning for Efficient 3D Representation Learning
Figure 4 for Positional Prompt Tuning for Efficient 3D Representation Learning
Viaarxiv icon

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Add code
Jun 24, 2024
Figure 1 for DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Figure 2 for DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Figure 3 for DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Figure 4 for DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Viaarxiv icon

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

Add code
Mar 06, 2024
Figure 1 for ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Figure 2 for ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Figure 3 for ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Figure 4 for ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Viaarxiv icon

DreamLLM: Synergistic Multimodal Comprehension and Creation

Add code
Sep 20, 2023
Viaarxiv icon

VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation

Add code
Jul 28, 2023
Figure 1 for VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation
Figure 2 for VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation
Figure 3 for VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation
Figure 4 for VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation
Viaarxiv icon