Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chaitanya Kharyal

GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

Jan 03, 2024

Chaitanya Kharyal, Sai Krishna Gottipati, Tanmay Kumar Sinha, Srijita Das, Matthew E. Taylor

Figure 1 for GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

Figure 2 for GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

Figure 3 for GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

Figure 4 for GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

Abstract:One of the final frontiers in the development of complex human - AI collaborative systems is the ability of AI agents to comprehend the natural language and perform tasks accordingly. However, training efficient Reinforcement Learning (RL) agents grounded in natural language has been a long-standing challenge due to the complexity and ambiguity of the language and sparsity of the rewards, among other factors. Several advances in reinforcement learning, curriculum learning, continual learning, language models have independently contributed to effective training of grounded agents in various environments. Leveraging these developments, we present a novel algorithm, Grounded Language Instruction through DEmonstration in RL (GLIDE-RL) that introduces a teacher-instructor-student curriculum learning framework for training an RL agent capable of following natural language instructions that can generalize to previously unseen language instructions. In this multi-agent framework, the teacher and the student agents learn simultaneously based on the student's current skill level. We further demonstrate the necessity for training the student agent with not just one, but multiple teacher agents. Experiments on a complex sparse reward environment validates the effectiveness of our proposed approach.

* 12 pages, 6 figures, to be presented at AAMAS 2024

Via

Access Paper or Ask Questions

Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

Aug 27, 2022

D. A. Sasi Kiran, Kritika Anand, Chaitanya Kharyal, Gulshan Kumar, Nandiraju Gireesh, Snehasis Banerjee, Ruddra dev Roychoudhury, Mohan Sridharan, Brojeshwar Bhowmick, Madhava Krishna

Figure 1 for Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

Figure 2 for Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

Figure 3 for Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

Figure 4 for Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

Abstract:This paper describes a framework for the object-goal navigation task, which requires a robot to find and move to the closest instance of a target object class from a random starting position. The framework uses a history of robot trajectories to learn a Spatial Relational Graph (SRG) and Graph Convolutional Network (GCN)-based embeddings for the likelihood of proximity of different semantically-labeled regions and the occurrence of different object classes in these regions. To locate a target object instance during evaluation, the robot uses Bayesian inference and the SRG to estimate the visible regions, and uses the learned GCN embeddings to rank visible regions and select the region to explore next.

* CASE 2022 paper

Via

Access Paper or Ask Questions

RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments

Mar 18, 2021

Karnik Ram, Chaitanya Kharyal, Sudarshan S. Harithas, K. Madhava Krishna

Figure 1 for RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments

Figure 2 for RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments

Figure 3 for RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments

Figure 4 for RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments

Abstract:Modern visual-inertial navigation systems (VINS) are faced with a critical challenge in real-world deployment: they need to operate reliably and robustly in highly dynamic environments. Current best solutions merely filter dynamic objects as outliers based on the semantics of the object category. Such an approach does not scale as it requires semantic classifiers to encompass all possibly-moving object classes; this is hard to define, let alone deploy. On the other hand, many real-world environments exhibit strong structural regularities in the form of planes such as walls and ground surfaces, which are also crucially static. We present RP-VIO, a monocular visual-inertial odometry system that leverages the simple geometry of these planes for improved robustness and accuracy in challenging dynamic environments. Since existing datasets have a limited number of dynamic elements, we also present a highly-dynamic, photorealistic synthetic dataset for a more effective evaluation of the capabilities of modern VINS systems. We evaluate our approach on this dataset, and three diverse sequences from standard datasets including two real-world dynamic sequences and show a significant improvement in robustness and accuracy over a state-of-the-art monocular visual-inertial odometry system. We also show in simulation an improvement over a simple dynamic-features masking approach. Our code and dataset are publicly available.

* Submitted to IROS 21, code and dataset available at https://github.com/karnikram/rp-vio

Via

Access Paper or Ask Questions