Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aida Afshar

Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Aug 07, 2024

Aida Afshar, Aldo Pacchiano

Figure 1 for Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Figure 2 for Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Figure 3 for Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Abstract:The performance of reinforcement learning (RL) algorithms is sensitive to the choice of hyperparameters, with the learning rate being particularly influential. RL algorithms fail to reach convergence or demand an extensive number of samples when the learning rate is not optimally set. In this work, we show that model selection can help to improve the failure modes of RL that are due to suboptimal choices of learning rate. We present a model selection framework for Learning Rate-Free Reinforcement Learning that employs model selection methods to select the optimal learning rate on the fly. This approach of adaptive learning rate tuning neither depends on the underlying RL algorithm nor the optimizer and solely uses the reward feedback to select the learning rate; hence, the framework can input any RL algorithm and produce a learning rate-free version of it. We conduct experiments for policy optimization methods and evaluate various model selection strategies within our framework. Our results indicate that data-driven model selection algorithms are better alternatives to standard bandit algorithms when the optimal choice of hyperparameter is time-dependent and non-stationary.

* RLC 2024 Workshop on Failure Modes of Sequential Decision-Making in Practice

Via

Access Paper or Ask Questions

DeLF: Designing Learning Environments with Foundation Models

Jan 17, 2024

Aida Afshar, Wenchao Li

Figure 1 for DeLF: Designing Learning Environments with Foundation Models

Figure 2 for DeLF: Designing Learning Environments with Foundation Models

Abstract:Reinforcement learning (RL) offers a capable and intuitive structure for the fundamental sequential decision-making problem. Despite impressive breakthroughs, it can still be difficult to employ RL in practice in many simple applications. In this paper, we try to address this issue by introducing a method for designing the components of the RL environment for a given, user-intended application. We provide an initial formalization for the problem of RL component design, that concentrates on designing a good representation for observation and action space. We propose a method named DeLF: Designing Learning Environments with Foundation Models, that employs large language models to design and codify the user's intended learning scenario. By testing our method on four different learning environments, we demonstrate that DeLF can obtain executable environment codes for the corresponding RL problems.

* AAAI 2024 Workshop on Synergy of Reinforcement Learning and Large Language Models

Via

Access Paper or Ask Questions