Picture for Paniz Behboudian

Paniz Behboudian

Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning

Add code
Apr 14, 2023
Viaarxiv icon

Useful Policy Invariant Shaping from Arbitrary Advice

Add code
Nov 02, 2020
Figure 1 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 2 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 3 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 4 for Useful Policy Invariant Shaping from Arbitrary Advice
Viaarxiv icon