Picture for Steve Marcus

Steve Marcus

Weighted bandits or: How bandits learn distorted values that are not expected

Add code
Nov 30, 2016
Figure 1 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 2 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 3 for Weighted bandits or: How bandits learn distorted values that are not expected
Viaarxiv icon

Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control

Add code
Feb 26, 2016
Figure 1 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Figure 2 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Figure 3 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Figure 4 for Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control
Viaarxiv icon

Adaptive system optimization using random directions stochastic approximation

Add code
Aug 08, 2015
Figure 1 for Adaptive system optimization using random directions stochastic approximation
Figure 2 for Adaptive system optimization using random directions stochastic approximation
Figure 3 for Adaptive system optimization using random directions stochastic approximation
Figure 4 for Adaptive system optimization using random directions stochastic approximation
Viaarxiv icon