Picture for Sanae Amani

Sanae Amani

Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning

Add code
Dec 04, 2024
Viaarxiv icon

Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing

Add code
Jul 11, 2023
Viaarxiv icon

Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation

Add code
Jun 01, 2022
Viaarxiv icon

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Add code
May 26, 2022
Figure 1 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Figure 2 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Viaarxiv icon

Safe Reinforcement Learning with Linear Function Approximation

Add code
Jun 11, 2021
Figure 1 for Safe Reinforcement Learning with Linear Function Approximation
Figure 2 for Safe Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

UCB-based Algorithms for Multinomial Logistic Regression Bandits

Add code
Mar 21, 2021
Figure 1 for UCB-based Algorithms for Multinomial Logistic Regression Bandits
Figure 2 for UCB-based Algorithms for Multinomial Logistic Regression Bandits
Viaarxiv icon

Decentralized Multi-Agent Linear Bandits with Safety Constraints

Add code
Dec 01, 2020
Figure 1 for Decentralized Multi-Agent Linear Bandits with Safety Constraints
Figure 2 for Decentralized Multi-Agent Linear Bandits with Safety Constraints
Figure 3 for Decentralized Multi-Agent Linear Bandits with Safety Constraints
Figure 4 for Decentralized Multi-Agent Linear Bandits with Safety Constraints
Viaarxiv icon

Regret Bounds for Safe Gaussian Process Bandit Optimization

Add code
May 05, 2020
Figure 1 for Regret Bounds for Safe Gaussian Process Bandit Optimization
Figure 2 for Regret Bounds for Safe Gaussian Process Bandit Optimization
Figure 3 for Regret Bounds for Safe Gaussian Process Bandit Optimization
Figure 4 for Regret Bounds for Safe Gaussian Process Bandit Optimization
Viaarxiv icon

Safe Linear Thompson Sampling

Add code
Nov 06, 2019
Figure 1 for Safe Linear Thompson Sampling
Figure 2 for Safe Linear Thompson Sampling
Figure 3 for Safe Linear Thompson Sampling
Figure 4 for Safe Linear Thompson Sampling
Viaarxiv icon

Linear Stochastic Bandits Under Safety Constraints

Add code
Aug 16, 2019
Figure 1 for Linear Stochastic Bandits Under Safety Constraints
Figure 2 for Linear Stochastic Bandits Under Safety Constraints
Figure 3 for Linear Stochastic Bandits Under Safety Constraints
Viaarxiv icon