Picture for Zhiyuan Zhou

Zhiyuan Zhou

Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data

Add code
Dec 10, 2024
Viaarxiv icon

LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling

Add code
Sep 13, 2024
Viaarxiv icon

Autonomous Improvement of Instruction Following Skills via Foundation Models

Add code
Jul 30, 2024
Viaarxiv icon

Proprioceptive State Estimation for Amphibious Tactile Sensing

Add code
Dec 15, 2023
Viaarxiv icon

Autoencoding a Soft Touch to Learn Grasping from On-land to Underwater

Add code
Aug 16, 2023
Viaarxiv icon

Specifying Behavior Preference with Tiered Reward Functions

Add code
Dec 07, 2022
Viaarxiv icon

Designing Rewards for Fast Learning

Add code
May 30, 2022
Figure 1 for Designing Rewards for Fast Learning
Figure 2 for Designing Rewards for Fast Learning
Figure 3 for Designing Rewards for Fast Learning
Figure 4 for Designing Rewards for Fast Learning
Viaarxiv icon

Characterizing the Action-Generalization Gap in Deep Q-Learning

Add code
May 11, 2022
Figure 1 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Figure 2 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Figure 3 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Viaarxiv icon

Generalized-TODIM Method for Multi-criteria Decision Making with Basic Uncertain Information and its Application

Add code
Apr 27, 2021
Figure 1 for Generalized-TODIM Method for Multi-criteria Decision Making with Basic Uncertain Information and its Application
Figure 2 for Generalized-TODIM Method for Multi-criteria Decision Making with Basic Uncertain Information and its Application
Viaarxiv icon