EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

Add code
Jul 21, 2020
Figure 1 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Figure 2 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Figure 3 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Figure 4 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: