Picture for Shiqing Gao

Shiqing Gao

Extreme Value Policy Optimization for Safe Reinforcement Learning

Add code
Jan 17, 2026
Viaarxiv icon

Controlling Underestimation Bias in Constrained Reinforcement Learning for Safe Exploration

Add code
Jan 17, 2026
Viaarxiv icon

Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints

Add code
Jul 22, 2024
Figure 1 for Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Figure 2 for Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Figure 3 for Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Figure 4 for Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Viaarxiv icon