Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shugao Liu

ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy

Feb 08, 2025

Yuhui Chen, Shuai Tian, Shugao Liu, Yingting Zhou, Haoran Li, Dongbin Zhao

Figure 1 for ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy

Figure 2 for ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy

Figure 3 for ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy

Figure 4 for ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy

Abstract:Vision-Language-Action (VLA) models have shown substantial potential in real-world robotic manipulation. However, fine-tuning these models through supervised learning struggles to achieve robust performance due to limited, inconsistent demonstrations, especially in contact-rich environments. In this paper, we propose a reinforced fine-tuning approach for VLA models, named ConRFT, which consists of offline and online fine-tuning with a unified consistency-based training objective, to address these challenges. In the offline stage, our method integrates behavior cloning and Q-learning to effectively extract policy from a small set of demonstrations and stabilize value estimating. In the online stage, the VLA model is further fine-tuned via consistency policy, with human interventions to ensure safe exploration and high sample efficiency. We evaluate our approach on eight diverse real-world manipulation tasks. It achieves an average success rate of 96.3% within 45-90 minutes of online fine-tuning, outperforming prior supervised methods with a 144% improvement in success rate and 1.9x shorter episode length. This work highlights the potential of integrating reinforcement learning to enhance the performance of VLA models for real-world robotic applications.

Via

Access Paper or Ask Questions