Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SELU: Self-Learning Embodied MLLMs in Unknown Environments

Oct 04, 2024

Boyu Li, Haobin Jiang, Ziluo Ding, Xinrun Xu, Haoran Li, Dongbin Zhao, Zongqing Lu

Figure 1 for SELU: Self-Learning Embodied MLLMs in Unknown Environments

Figure 2 for SELU: Self-Learning Embodied MLLMs in Unknown Environments

Figure 3 for SELU: Self-Learning Embodied MLLMs in Unknown Environments

Figure 4 for SELU: Self-Learning Embodied MLLMs in Unknown Environments

Share this with someone who'll enjoy it:

Abstract:Recently, multimodal large language models (MLLMs) have demonstrated strong visual understanding and decision-making capabilities, enabling the exploration of autonomously improving MLLMs in unknown environments. However, external feedback like human or environmental feedback is not always available. To address this challenge, existing methods primarily focus on enhancing the decision-making capabilities of MLLMs through voting and scoring mechanisms, while little effort has been paid to improving the environmental comprehension of MLLMs in unknown environments. To fully unleash the self-learning potential of MLLMs, we propose a novel actor-critic self-learning paradigm, dubbed SELU, inspired by the actor-critic paradigm in reinforcement learning. The critic employs self-asking and hindsight relabeling to extract knowledge from interaction trajectories collected by the actor, thereby augmenting its environmental comprehension. Simultaneously, the actor is improved by the self-feedback provided by the critic, enhancing its decision-making. We evaluate our method in the AI2-THOR and VirtualHome environments, and SELU achieves critic improvements of approximately 28% and 30%, and actor improvements of about 20% and 24% via self-learning.

View paper on

Share this with someone who'll enjoy it:

Title:SELU: Self-Learning Embodied MLLMs in Unknown Environments

Paper and Code