Picture for Zhiyuan Zhou

Zhiyuan Zhou

Robust Finetuning of Vision-Language-Action Robot Policies via Parameter Merging

Add code
Dec 18, 2025
Viaarxiv icon

$π^{*}_{0.6}$: a VLA That Learns From Experience

Add code
Nov 19, 2025
Viaarxiv icon

Compute-Optimal Scaling for Value-Based Deep RL

Add code
Aug 20, 2025
Viaarxiv icon

Reinforcement Learning with Action Chunking

Add code
Jul 10, 2025
Figure 1 for Reinforcement Learning with Action Chunking
Figure 2 for Reinforcement Learning with Action Chunking
Figure 3 for Reinforcement Learning with Action Chunking
Figure 4 for Reinforcement Learning with Action Chunking
Viaarxiv icon

REACT: Runtime-Enabled Active Collision-avoidance Technique for Autonomous Driving

Add code
May 16, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon

AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World

Add code
Mar 31, 2025
Viaarxiv icon

SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models

Add code
Dec 19, 2024
Figure 1 for SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models
Figure 2 for SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models
Figure 3 for SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models
Figure 4 for SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models
Viaarxiv icon

Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data

Add code
Dec 10, 2024
Figure 1 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 2 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 3 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 4 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Viaarxiv icon

LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling

Add code
Sep 13, 2024
Figure 1 for LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling
Figure 2 for LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling
Figure 3 for LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling
Viaarxiv icon