Picture for Shuyu Yin

Shuyu Yin

Analyzing and Bridging the Gap between Maximizing Total Reward and Discounted Reward in Deep Reinforcement Learning

Add code
Jul 18, 2024
Viaarxiv icon

Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation

Add code
Jun 12, 2024
Viaarxiv icon

A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning

Add code
Mar 07, 2024
Figure 1 for A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Figure 2 for A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Viaarxiv icon

An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation

Add code
May 25, 2022
Figure 1 for An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Figure 2 for An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Figure 3 for An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Figure 4 for An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Viaarxiv icon

FoldingZero: Protein Folding from Scratch in Hydrophobic-Polar Model

Add code
Dec 03, 2018
Figure 1 for FoldingZero: Protein Folding from Scratch in Hydrophobic-Polar Model
Figure 2 for FoldingZero: Protein Folding from Scratch in Hydrophobic-Polar Model
Figure 3 for FoldingZero: Protein Folding from Scratch in Hydrophobic-Polar Model
Figure 4 for FoldingZero: Protein Folding from Scratch in Hydrophobic-Polar Model
Viaarxiv icon