Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jingsong Liang

DARE: Diffusion Policy for Autonomous Robot Exploration

Oct 22, 2024

Yuhong Cao, Jeric Lew, Jingsong Liang, Jin Cheng, Guillaume Sartoretti

Figure 1 for DARE: Diffusion Policy for Autonomous Robot Exploration

Figure 2 for DARE: Diffusion Policy for Autonomous Robot Exploration

Figure 3 for DARE: Diffusion Policy for Autonomous Robot Exploration

Figure 4 for DARE: Diffusion Policy for Autonomous Robot Exploration

Abstract:Autonomous robot exploration requires a robot to efficiently explore and map unknown environments. Compared to conventional methods that can only optimize paths based on the current robot belief, learning-based methods show the potential to achieve improved performance by drawing on past experiences to reason about unknown areas. In this paper, we propose DARE, a novel generative approach that leverages diffusion models trained on expert demonstrations, which can explicitly generate an exploration path through one-time inference. We build DARE upon an attention-based encoder and a diffusion policy model, and introduce ground truth optimal demonstrations for training to learn better patterns for exploration. The trained planner can reason about the partial belief to recognize the potential structure in unknown areas and consider these areas during path planning. Our experiments demonstrate that DARE achieves on-par performance with both conventional and learning-based state-of-the-art exploration planners, as well as good generalizability in both simulations and real-life scenarios.

Via

Access Paper or Ask Questions

IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity

Sep 07, 2024

Derek Ming Siang Tan, Yixiao Ma, Jingsong Liang, Yi Cheng Chng, Yuhong Cao, Guillaume Sartoretti

Figure 1 for IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity

Figure 2 for IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity

Figure 3 for IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity

Figure 4 for IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity

Abstract:Information sharing is critical in time-sensitive and realistic multi-robot exploration, especially for smaller robotic teams in large-scale environments where connectivity may be sparse and intermittent. Existing methods often overlook such communication constraints by assuming unrealistic global connectivity. Other works account for communication constraints (by maintaining close proximity or line of sight during information exchange), but are often inefficient. For instance, preplanned rendezvous approaches typically involve unnecessary detours resulting from poorly timed rendezvous, while pursuit-based approaches often result in short-sighted decisions due to their greedy nature. We present IR2, a deep reinforcement learning approach to information sharing for multi-robot exploration. Leveraging attention-based neural networks trained via reinforcement and curriculum learning, IR2 allows robots to effectively reason about the longer-term trade-offs between disconnecting for solo exploration and reconnecting for information sharing. In addition, we propose a hierarchical graph formulation to maintain a sparse yet informative graph, enabling our approach to scale to large-scale environments. We present simulation results in three large-scale Gazebo environments, which show that our approach yields 6.6-34.1% shorter exploration paths and significantly improved mapped area consistency among robots when compared to state-of-the-art baselines. Our simulation training and testing code is available at https://github.com/marmotlab/IR2.

* \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Via

Access Paper or Ask Questions

HDPlanner: Advancing Autonomous Deployments in Unknown Environments through Hierarchical Decision Networks

Aug 07, 2024

Jingsong Liang, Yuhong Cao, Yixiao Ma, Hanqi Zhao, Guillaume Sartoretti

Figure 1 for HDPlanner: Advancing Autonomous Deployments in Unknown Environments through Hierarchical Decision Networks

Figure 2 for HDPlanner: Advancing Autonomous Deployments in Unknown Environments through Hierarchical Decision Networks

Figure 3 for HDPlanner: Advancing Autonomous Deployments in Unknown Environments through Hierarchical Decision Networks

Figure 4 for HDPlanner: Advancing Autonomous Deployments in Unknown Environments through Hierarchical Decision Networks

Abstract:In this paper, we introduce HDPlanner, a deep reinforcement learning (DRL) based framework designed to tackle two core and challenging tasks for mobile robots: autonomous exploration and navigation, where the robot must optimize its trajectory adaptively to achieve the task objective through continuous interactions in unknown environments. Specifically, HDPlanner relies on novel hierarchical attention networks to empower the robot to reason about its belief across multiple spatial scales and sequence collaborative decisions, where our networks decompose long-term objectives into short-term informative task assignments and informative path plannings. We further propose a contrastive learning-based joint optimization to enhance the robustness of HDPlanner. We empirically demonstrate that HDPlanner significantly outperforms state-of-the-art conventional and learning-based baselines on an extensive set of simulations, including hundreds of test maps and large-scale, complex Gazebo environments. Notably, HDPlanner achieves real-time planning with travel distances reduced by up to 35.7% compared to exploration benchmarks and by up to 16.5% than navigation benchmarks. Furthermore, we validate our approach on hardware, where it generates high-quality, adaptive trajectories in both indoor and outdoor environments, highlighting its real-world applicability without additional training.

* Submitted to RA-L

Via

Access Paper or Ask Questions

Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration

Jul 29, 2024

Yixiao Ma, Jingsong Liang, Yuhong Cao, Derek Ming Siang Tan, Guillaume Sartoretti

Figure 1 for Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration

Figure 2 for Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration

Figure 3 for Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration

Figure 4 for Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration

Abstract:Communication bandwidth is an important consideration in multi-robot exploration, where information exchange among robots is critical. While existing methods typically aim to reduce communication throughput, they either require significant computation or significantly compromise exploration efficiency. In this work, we propose a deep reinforcement learning framework based on communication and privileged reinforcement learning to achieve a significant reduction in bandwidth consumption, while minimally sacrificing exploration efficiency. Specifically, our approach allows robots to learn to embed the most salient information from their individual belief (partial map) over the environment into fixed-sized messages. Robots then reason about their own belief as well as received messages to distributedly explore the environment while avoiding redundant work. In doing so, we employ privileged learning and learned attention mechanisms to endow the critic (i.e., teacher) network with ground truth map knowledge to effectively guide the policy (i.e., student) network during training. Compared to relevant baselines, our model allows the team to reduce communication by up to two orders of magnitude, while only sacrificing a marginal 2.4\% in total travel distance, paving the way for efficient, distributed multi-robot exploration in bandwidth-limited scenarios.

* Accepted by DARS2024

Via

Access Paper or Ask Questions