Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cheolhong Min

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Jul 26, 2024

Taewoong Kim, Cheolhong Min, Byeonghwi Kim, Jinyeon Kim, Wonje Jeung, Jonghyun Choi

Figure 1 for ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Figure 2 for ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Figure 3 for ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Figure 4 for ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Abstract:Simulated virtual environments have been widely used to learn robotic agents that perform daily household tasks. These environments encourage research progress by far, but often provide limited object interactability, visual appearance different from real-world environments, or relatively smaller environment sizes. This prevents the learned models in the virtual scenes from being readily deployable. To bridge the gap between these learning environments and deploying (i.e., real) environments, we propose the ReALFRED benchmark that employs real-world scenes, objects, and room layouts to learn agents to complete household tasks by understanding free-form language instructions and interacting with objects in large, multi-room and 3D-captured scenes. Specifically, we extend the ALFRED benchmark with updates for larger environmental spaces with smaller visual domain gaps. With ReALFRED, we analyze previously crafted methods for the ALFRED benchmark and observe that they consistently yield lower performance in all metrics, encouraging the community to develop methods in more realistic environments. Our code and data are publicly available.

* ECCV 2024 (Project page: https://twoongg.github.io/projects/realfred)

Via

Access Paper or Ask Questions

Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents

Aug 22, 2023

Byeonghwi Kim, Jinyeon Kim, Yuyeong Kim, Cheolhong Min, Jonghyun Choi

Figure 1 for Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents

Figure 2 for Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents

Figure 3 for Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents

Figure 4 for Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents

Abstract:Accomplishing household tasks requires to plan step-by-step actions considering the consequences of previous actions. However, the state-of-the-art embodied agents often make mistakes in navigating the environment and interacting with proper objects due to imperfect learning by imitating experts or algorithmic planners without such knowledge. To improve both visual navigation and object interaction, we propose to consider the consequence of taken actions by CAPEAM (Context-Aware Planning and Environment-Aware Memory) that incorporates semantic context (e.g., appropriate objects to interact with) in a sequence of actions, and the changed spatial arrangement and states of interacted objects (e.g., location that the object has been moved to) in inferring the subsequent actions. We empirically show that the agent with the proposed CAPEAM achieves state-of-the-art performance in various metrics using a challenging interactive instruction following benchmark in both seen and unseen environments by large margins (up to +10.70% in unseen env.).

* ICCV 2023

Via

Access Paper or Ask Questions