Picture for Yuke Zhu

Yuke Zhu

LEGATO: Cross-Embodiment Imitation Using a Grasping Tool

Add code
Nov 06, 2024
Viaarxiv icon

RT-Affordance: Affordances are Versatile Intermediate Representations for Robot Manipulation

Add code
Nov 05, 2024
Viaarxiv icon

SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation

Add code
Nov 01, 2024
Viaarxiv icon

DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning

Add code
Oct 31, 2024
Viaarxiv icon

Multi-Task Interactive Robot Fleet Learning with Visual World Models

Add code
Oct 30, 2024
Viaarxiv icon

HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots

Add code
Oct 28, 2024
Figure 1 for HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots
Figure 2 for HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots
Figure 3 for HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots
Figure 4 for HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots
Viaarxiv icon

One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation

Add code
Oct 28, 2024
Figure 1 for One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Figure 2 for One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Figure 3 for One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Figure 4 for One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Viaarxiv icon

Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions

Add code
Oct 16, 2024
Viaarxiv icon

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation

Add code
Oct 15, 2024
Viaarxiv icon

BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation

Add code
Oct 08, 2024
Figure 1 for BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation
Figure 2 for BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation
Figure 3 for BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation
Figure 4 for BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation
Viaarxiv icon