Picture for Zhefei Gong

Zhefei Gong

VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation

Add code
Feb 19, 2025
Viaarxiv icon

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Add code
Dec 09, 2024
Figure 1 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Figure 2 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Figure 3 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Figure 4 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Viaarxiv icon