Picture for Qing-Shan Jia

Qing-Shan Jia

Query-Policy Misalignment in Preference-Based Reinforcement Learning

Add code
May 27, 2023
Viaarxiv icon

Mind the Gap: Offline Policy Optimization for Imperfect Rewards

Add code
Feb 03, 2023
Viaarxiv icon

Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method

Add code
Oct 31, 2021
Figure 1 for Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Figure 2 for Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Viaarxiv icon

An Actor-Critic Method for Simulation-Based Optimization

Add code
Oct 31, 2021
Figure 1 for An Actor-Critic Method for Simulation-Based Optimization
Figure 2 for An Actor-Critic Method for Simulation-Based Optimization
Figure 3 for An Actor-Critic Method for Simulation-Based Optimization
Figure 4 for An Actor-Critic Method for Simulation-Based Optimization
Viaarxiv icon