Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Feb 05, 2025

Fan Wang, Pengtao Shao, Yiming Zhang, Bo Yu, Shaoshan Liu, Ning Ding, Yang Cao, Yu Kang, Haifeng Wang

Figure 1 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Figure 2 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Figure 3 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Figure 4 for OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Share this with someone who'll enjoy it:

Abstract:We introduce OmniRL, a highly generalizable in-context reinforcement learning (ICRL) model that is meta-trained on hundreds of thousands of diverse tasks. These tasks are procedurally generated by randomizing state transitions and rewards within Markov Decision Processes. To facilitate this extensive meta-training, we propose two key innovations: 1. An efficient data synthesis pipeline for ICRL, which leverages the interaction histories of diverse behavior policies; and 2. A novel modeling framework that integrates both imitation learning and reinforcement learning (RL) within the context, by incorporating prior knowledge. For the first time, we demonstrate that in-context learning (ICL) alone, without any gradient-based fine-tuning, can successfully tackle unseen Gymnasium tasks through imitation learning, online RL, or offline RL. Additionally, we show that achieving generalized ICRL capabilities-unlike task identification-oriented few-shot learning-critically depends on long trajectories generated by variant tasks and diverse behavior policies. By emphasizing the potential of ICL and departing from pre-training focused on acquiring specific skills, we further underscore the significance of meta-training aimed at cultivating the ability of ICL itself.

* Preprint

View paper on

Share this with someone who'll enjoy it:

Title:OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Paper and Code