Picture for Ahmadreza Moradipari

Ahmadreza Moradipari

Cooperative Multi-Agent Constrained Stochastic Linear Bandits

Add code
Oct 22, 2024
Viaarxiv icon

Convex Methods for Constrained Linear Bandits

Add code
Nov 10, 2023
Viaarxiv icon

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

Add code
Oct 30, 2023
Viaarxiv icon

Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation

Add code
Jul 26, 2023
Viaarxiv icon

Predicting Parameters for Modeling Traffic Participants

Add code
Jan 26, 2023
Viaarxiv icon

Collaborative Multi-agent Stochastic Linear Bandits

Add code
May 12, 2022
Figure 1 for Collaborative Multi-agent Stochastic Linear Bandits
Figure 2 for Collaborative Multi-agent Stochastic Linear Bandits
Viaarxiv icon

Multi-Environment Meta-Learning in Stochastic Linear Bandits

Add code
May 12, 2022
Figure 1 for Multi-Environment Meta-Learning in Stochastic Linear Bandits
Viaarxiv icon

Parameter and Feature Selection in Stochastic Linear Bandits

Add code
Jun 09, 2021
Figure 1 for Parameter and Feature Selection in Stochastic Linear Bandits
Viaarxiv icon

Stage-wise Conservative Linear Bandits

Add code
Sep 30, 2020
Figure 1 for Stage-wise Conservative Linear Bandits
Figure 2 for Stage-wise Conservative Linear Bandits
Figure 3 for Stage-wise Conservative Linear Bandits
Viaarxiv icon

Safe Linear Thompson Sampling

Add code
Nov 06, 2019
Figure 1 for Safe Linear Thompson Sampling
Figure 2 for Safe Linear Thompson Sampling
Figure 3 for Safe Linear Thompson Sampling
Figure 4 for Safe Linear Thompson Sampling
Viaarxiv icon