Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning

Mar 21, 2025

Chan Kim, Seung-Woo Seo, Seong-Woo Kim

Figure 1 for Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning

Figure 2 for Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning

Figure 3 for Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning

Figure 4 for Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Deep Reinforcement Learning (DRL) has demonstrated strong performance in robotic control but remains susceptible to out-of-distribution (OOD) states, often resulting in unreliable actions and task failure. While previous methods have focused on minimizing or preventing OOD occurrences, they largely neglect recovery once an agent encounters such states. Although the latest research has attempted to address this by guiding agents back to in-distribution states, their reliance on uncertainty estimation hinders scalability in complex environments. To overcome this limitation, we introduce Language Models for Out-of-Distribution Recovery (LaMOuR), which enables recovery learning without relying on uncertainty estimation. LaMOuR generates dense reward codes that guide the agent back to a state where it can successfully perform its original task, leveraging the capabilities of LVLMs in image description, logical reasoning, and code generation. Experimental results show that LaMOuR substantially enhances recovery efficiency across diverse locomotion tasks and even generalizes effectively to complex environments, including humanoid locomotion and mobile manipulation, where existing methods struggle. The code and supplementary materials are available at \href{https://lamour-rl.github.io/}{https://lamour-rl.github.io/}.

* 14 pages, 17 figures

View paper on

Share this with someone who'll enjoy it:

Title:Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning

Paper and Code