Picture for Yuhao Ding

Yuhao Ding

Max

Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation

Add code
May 31, 2024
Figure 1 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 2 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 3 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 4 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Viaarxiv icon

A CMDP-within-online framework for Meta-Safe Reinforcement Learning

Add code
May 26, 2024
Viaarxiv icon

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning

Add code
May 26, 2024
Figure 1 for Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Figure 2 for Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Figure 3 for Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Figure 4 for Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Viaarxiv icon

Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation

Add code
May 02, 2024
Viaarxiv icon

Tempo Adaption in Non-stationary Reinforcement Learning

Add code
Sep 26, 2023
Viaarxiv icon

Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities

Add code
May 27, 2023
Viaarxiv icon

DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference

Add code
Feb 24, 2023
Viaarxiv icon

Scalable Multi-Agent Reinforcement Learning with General Utilities

Add code
Feb 15, 2023
Viaarxiv icon

Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation Design

Add code
Nov 19, 2022
Viaarxiv icon

Policy-based Primal-Dual Methods for Convex Constrained Markov Decision Processes

Add code
May 22, 2022
Figure 1 for Policy-based Primal-Dual Methods for Convex Constrained Markov Decision Processes
Viaarxiv icon