Picture for Mario Bravo

Mario Bravo

ISCI

Mixing Times and Privacy Analysis for the Projected Langevin Algorithm under a Modulus of Continuity

Add code
Jan 07, 2025
Viaarxiv icon

Stochastic Halpern iteration in normed spaces and applications to reinforcement learning

Add code
Mar 19, 2024
Viaarxiv icon

Bandit learning in concave $N$-person games

Add code
Oct 03, 2018
Figure 1 for Bandit learning in concave $N$-person games
Viaarxiv icon

On the robustness of learning in games with stochastically perturbed payoff observations

Add code
Jun 02, 2016
Figure 1 for On the robustness of learning in games with stochastically perturbed payoff observations
Figure 2 for On the robustness of learning in games with stochastically perturbed payoff observations
Viaarxiv icon

Reinforcement learning with restrictions on the action set

Add code
Jun 12, 2013
Figure 1 for Reinforcement learning with restrictions on the action set
Figure 2 for Reinforcement learning with restrictions on the action set
Figure 3 for Reinforcement learning with restrictions on the action set
Figure 4 for Reinforcement learning with restrictions on the action set
Viaarxiv icon