Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matthias Hutsebaut-Buysse

Structured Exploration Through Instruction Enhancement for Object Navigation

Nov 15, 2022

Matthias Hutsebaut-Buysse, Kevin Mets, Tom De Schepper, Steven Latré

Figure 1 for Structured Exploration Through Instruction Enhancement for Object Navigation

Figure 2 for Structured Exploration Through Instruction Enhancement for Object Navigation

Figure 3 for Structured Exploration Through Instruction Enhancement for Object Navigation

Figure 4 for Structured Exploration Through Instruction Enhancement for Object Navigation

Abstract:Finding an object of a specific class in an unseen environment remains an unsolved navigation problem. Hence, we propose a hierarchical learning-based method for object navigation. The top-level is capable of high-level planning, and building a memory on a floorplan-level (e.g., which room makes the most sense for the agent to visit next, where has the agent already been?). While the lower-level is tasked with efficiently navigating between rooms and looking for objects in them. Instructions can be provided to the agent using a simple synthetic language. The top-level intelligently enhances the instructions in order to make the overall task more tractable. Language grounding, mapping instructions to visual observations, is performed by utilizing an additional separate supervised trained goal assessment module. We demonstrate the effectiveness of our method on a dynamic configurable domestic environment.

* Paper accepted to the BNAIC/BeNeLearn 2022 conference

Via

Access Paper or Ask Questions

Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Jul 10, 2020

Matthias Hutsebaut-Buysse, Kevin Mets, Steven Latré

Figure 1 for Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Figure 2 for Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Figure 3 for Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Figure 4 for Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Abstract:Reinforcement learning (RL) algorithms typically start tabula rasa, without any prior knowledge of the environment, and without any prior skills. This however often leads to low sample efficiency, requiring a large amount of interaction with the environment. This is especially true in a lifelong learning setting, in which the agent needs to continually extend its capabilities. In this paper, we examine how a pre-trained task-independent language model can make a goal-conditional RL agent more sample efficient. We do this by facilitating transfer learning between different related tasks. We experimentally demonstrate our approach on a set of object navigation tasks.

* Paper accepted to the ICML 2020 Language in Reinforcement Learning (LaReL) Workshop

Via

Access Paper or Ask Questions

Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning

Oct 09, 2019

Matthias Hutsebaut-Buysse, Kevin Mets, Steven Latré

Figure 1 for Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning

Figure 2 for Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning

Figure 3 for Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning

Figure 4 for Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning

Abstract:Over its lifetime, a reinforcement learning agent is often tasked with different tasks. How to efficiently adapt a previously learned control policy from one task to another, remains an open research question. In this paper, we investigate how instructions formulated in natural language can enable faster and more effective task adaptation. This can serve as the basis for developing language instructed skills, which can be used in a lifelong learning setting. Our method is capable of assessing, given a set of developed base control policies, which policy will adapt best to a new unseen task.

Via

Access Paper or Ask Questions