Picture for Mehdi Jafarnia-Jahromi

Mehdi Jafarnia-Jahromi

Learning Zero-sum Stochastic Games with Posterior Sampling

Add code
Sep 08, 2021
Viaarxiv icon

Online Learning for Cooperative Multi-Player Multi-Armed Bandits

Add code
Sep 07, 2021
Figure 1 for Online Learning for Cooperative Multi-Player Multi-Armed Bandits
Figure 2 for Online Learning for Cooperative Multi-Player Multi-Armed Bandits
Viaarxiv icon

Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path

Add code
Jun 15, 2021
Figure 1 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Figure 2 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Figure 3 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Figure 4 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Viaarxiv icon

Online Learning for Stochastic Shortest Path Model via Posterior Sampling

Add code
Jun 09, 2021
Figure 1 for Online Learning for Stochastic Shortest Path Model via Posterior Sampling
Viaarxiv icon

Online Learning for Unknown Partially Observable MDPs

Add code
Feb 25, 2021
Viaarxiv icon

Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation

Add code
Jul 23, 2020
Figure 1 for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Viaarxiv icon

A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret

Add code
Jun 08, 2020
Figure 1 for A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Figure 2 for A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Figure 3 for A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Viaarxiv icon

Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes

Add code
Oct 15, 2019
Figure 1 for Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Figure 2 for Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Viaarxiv icon

PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning

Add code
Dec 25, 2018
Figure 1 for PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning
Figure 2 for PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning
Figure 3 for PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning
Figure 4 for PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning
Viaarxiv icon