Picture for Shangqin Mao

Shangqin Mao

Off-Policy Primal-Dual Safe Reinforcement Learning

Add code
Jan 26, 2024
Viaarxiv icon

HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

Add code
Dec 29, 2023
Viaarxiv icon

Safe Offline Reinforcement Learning with Real-Time Budget Constraints

Add code
Jun 01, 2023
Viaarxiv icon