Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yiduo Wang

DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM

Jan 21, 2025

Jesse Morris, Yiduo Wang, Mikolaj Kliniewski, Viorela Ila

Abstract:Traditional Visual Simultaneous Localization and Mapping (vSLAM) systems focus solely on static scene structures, overlooking dynamic elements in the environment. Although effective for accurate visual odometry in complex scenarios, these methods discard crucial information about moving objects. By incorporating this information into a Dynamic SLAM framework, the motion of dynamic entities can be estimated, enhancing navigation whilst ensuring accurate localization. However, the fundamental formulation of Dynamic SLAM remains an open challenge, with no consensus on the optimal approach for accurate motion estimation within a SLAM pipeline. Therefore, we developed DynoSAM, an open-source framework for Dynamic SLAM that enables the efficient implementation, testing, and comparison of various Dynamic SLAM optimization formulations. DynoSAM integrates static and dynamic measurements into a unified optimization problem solved using factor graphs, simultaneously estimating camera poses, static scene, object motion or poses, and object structures. We evaluate DynoSAM across diverse simulated and real-world datasets, achieving state-of-the-art motion estimation in indoor and outdoor environments, with substantial improvements over existing systems. Additionally, we demonstrate DynoSAM utility in downstream applications, including 3D reconstruction of dynamic scenes and trajectory prediction, thereby showcasing potential for advancing dynamic object-aware SLAM systems. DynoSAM is open-sourced at https://github.com/ACFR-RPG/DynOSAM.

* 20 pages, 10 figures. Submitted to T-RO Visual SLAM SI 2025

Via

Access Paper or Ask Questions

DynORecon: Dynamic Object Reconstruction for Navigation

Sep 30, 2024

Yiduo Wang, Jesse Morris, Lan Wu, Teresa Vidal-Calleja, Viorela Ila

Abstract:This paper presents DynORecon, a Dynamic Object Reconstruction system that leverages the information provided by Dynamic SLAM to simultaneously generate a volumetric map of observed moving entities while estimating free space to support navigation. By capitalising on the motion estimations provided by Dynamic SLAM, DynORecon continuously refines the representation of dynamic objects to eliminate residual artefacts from past observations and incrementally reconstructs each object, seamlessly integrating new observations to capture previously unseen structures. Our system is highly efficient (~20 FPS) and produces accurate (~10 cm) reconstructions of dynamic objects using simulated and real-world outdoor datasets.

* 7 pages, 6 figures, submitted to ICRA 2025

Via

Access Paper or Ask Questions

The Importance of Coordinate Frames in Dynamic SLAM

Dec 07, 2023

Jesse Morris, Yiduo Wang, Viorela Ila

Abstract:Most Simultaneous localisation and mapping (SLAM) systems have traditionally assumed a static world, which does not align with real-world scenarios. To enable robots to safely navigate and plan in dynamic environments, it is essential to employ representations capable of handling moving objects. Dynamic SLAM is an emerging field in SLAM research as it improves the overall system accuracy while providing additional estimation of object motions. State-of-the-art literature informs two main formulations for Dynamic SLAM, representing dynamic object points in either the world or object coordinate frame. While expressing object points in a local reference frame may seem intuitive, it may not necessarily lead to the most accurate and robust solutions. This paper conducts and presents a thorough analysis of various Dynamic SLAM formulations, identifying the best approach to address the problem. To this end, we introduce a front-end agnostic framework using GTSAM that can be used to evaluate various Dynamic SLAM formulations.

* 7 pages, 4 figures, submitted to ICRA 2024

Via

Access Paper or Ask Questions

3D Lidar Reconstruction with Probabilistic Depth Completion for Robotic Navigation

Jul 25, 2022

Yifu Tao, Marija Popović, Yiduo Wang, Sundara Tejaswi Digumarti, Nived Chebrolu, Maurice Fallon

Figure 1 for 3D Lidar Reconstruction with Probabilistic Depth Completion for Robotic Navigation

Figure 2 for 3D Lidar Reconstruction with Probabilistic Depth Completion for Robotic Navigation

Figure 3 for 3D Lidar Reconstruction with Probabilistic Depth Completion for Robotic Navigation

Figure 4 for 3D Lidar Reconstruction with Probabilistic Depth Completion for Robotic Navigation

Abstract:Safe motion planning in robotics requires planning into space which has been verified to be free of obstacles. However, obtaining such environment representations using lidars is challenging by virtue of the sparsity of their depth measurements. We present a learning-aided 3D lidar reconstruction framework that upsamples sparse lidar depth measurements with the aid of overlapping camera images so as to generate denser reconstructions with more definitively free space than can be achieved with the raw lidar measurements alone. We use a neural network with an encoder-decoder structure to predict dense depth images along with depth uncertainty estimates which are fused using a volumetric mapping system. We conduct experiments on real-world outdoor datasets captured using a handheld sensing device and a legged robot. Using input data from a 16-beam lidar mapping a building network, our experiments showed that the amount of estimated free space was increased by more than 40% with our approach. We also show that our approach trained on a synthetic dataset generalises well to real-world outdoor scenes without additional fine-tuning. Finally, we demonstrate how motion planning tasks can benefit from these denser reconstructions.

* Accepted at IROS 2022; Video at https://www.youtube.com/watch?v=db3HetMq5h4

Via

Access Paper or Ask Questions

Scalable and Elastic LiDAR Reconstruction in Complex Environments Through Spatial Analysis

Jun 29, 2021

Yiduo Wang, Milad Ramezani, Matias Mattamala, Maurice Fallon

Figure 1 for Scalable and Elastic LiDAR Reconstruction in Complex Environments Through Spatial Analysis

Figure 2 for Scalable and Elastic LiDAR Reconstruction in Complex Environments Through Spatial Analysis

Figure 3 for Scalable and Elastic LiDAR Reconstruction in Complex Environments Through Spatial Analysis

Figure 4 for Scalable and Elastic LiDAR Reconstruction in Complex Environments Through Spatial Analysis

Abstract:This paper presents novel strategies for spawning and fusing submaps within an elastic dense 3D reconstruction system. The proposed system uses spatial understanding of the scanned environment to control memory usage growth by fusing overlapping submaps in different ways. This allows the number of submaps and memory consumption to scale with the size of the environment rather than the duration of exploration. By analysing spatial overlap, our system segments distinct spaces, such as rooms and stairwells on the fly during exploration. Additionally, we present a new mathematical formulation of relative uncertainty between poses to improve the global consistency of the reconstruction. Performance is demonstrated using a multi-floor multi-room indoor experiment, a large-scale outdoor experiment and a simulated dataset. Relative to our baseline, the presented approach demonstrates improved scalability and accuracy.

* 8 pages, 9 figures

Via

Access Paper or Ask Questions

Elastic and Efficient LiDAR Reconstruction for Large-Scale Exploration Tasks

Oct 19, 2020

Yiduo Wang, Nils Funk, Milad Ramezani, Sotiris Papatheodorou, Marija Popovic, Marco Camurri, Stefan Leutenegger, Maurice Fallon

Figure 1 for Elastic and Efficient LiDAR Reconstruction for Large-Scale Exploration Tasks

Figure 2 for Elastic and Efficient LiDAR Reconstruction for Large-Scale Exploration Tasks

Figure 3 for Elastic and Efficient LiDAR Reconstruction for Large-Scale Exploration Tasks

Figure 4 for Elastic and Efficient LiDAR Reconstruction for Large-Scale Exploration Tasks

Abstract:We present an efficient, elastic 3D LiDAR reconstruction framework which can reconstruct up to maximum LiDAR ranges (60 m) at multiple frames per second, thus enabling robot exploration in large-scale environments. Our approach only requires a CPU. We focus on three main challenges of large-scale reconstruction: integration of long-range LiDAR scans at high frequency, the capacity to deform the reconstruction after loop closures are detected, and scalability for long-duration exploration. Our system extends upon a state-of-the-art efficient RGB-D volumetric reconstruction technique, called supereight, to support LiDAR scans and a newly developed submapping technique to allow for dynamic correction of the 3D reconstruction. We then introduce a novel pose graph sparsification and submap fusion feature to make our system more scalable for large environments. We evaluate the performance using a published dataset captured by a handheld mapping device scanning a set of buildings, and with a mobile robot exploring an underground room network. Experimental results demonstrate that our system can reconstruct at 3 Hz with 60 m sensor range and ~5 cm resolution, while state-of-the-art approaches can only reconstruct to 25 cm resolution or 20 m range at the same frequency.

* 8 pages, 8 figures

Via

Access Paper or Ask Questions

The Newer College Dataset: Handheld LiDAR, Inertial and Vision with Ground Truth

Mar 12, 2020

Milad Ramezani, Yiduo Wang, Marco Camurri, David Wisth, Matias Mattamala, Maurice Fallon

Figure 1 for The Newer College Dataset: Handheld LiDAR, Inertial and Vision with Ground Truth

Figure 2 for The Newer College Dataset: Handheld LiDAR, Inertial and Vision with Ground Truth

Figure 3 for The Newer College Dataset: Handheld LiDAR, Inertial and Vision with Ground Truth

Figure 4 for The Newer College Dataset: Handheld LiDAR, Inertial and Vision with Ground Truth

Abstract:In this paper we present a large dataset with a variety of mobile mapping sensors collected using a handheld device carried at typical walking speeds for nearly 2.2 km through New College, Oxford. The dataset includes data from two commercially available devices - a stereoscopic-inertial camera and a multi-beam 3D LiDAR, which also provides inertial measurements. Additionally, we used a tripod-mounted survey grade LiDAR scanner to capture a detailed millimeter-accurate 3D map of the test location (containing $\sim$290 million points). Using the map we inferred centimeter-accurate 6 Degree of Freedom (DoF) ground truth for the position of the device for each LiDAR scan to enable better evaluation of LiDAR and vision localisation, mapping and reconstruction systems. This ground truth is the particular novel contribution of this dataset and we believe that it will enable systematic evaluation which many similar datasets have lacked. The dataset combines both built environments, open spaces and vegetated areas so as to test localization and mapping systems such as vision-based navigation, visual and LiDAR SLAM, 3D LIDAR reconstruction and appearance-based place recognition. The dataset is available at: ori.ox.ac.uk/datasets/newer-college-dataset

Via

Access Paper or Ask Questions

Actively Mapping Industrial Structures with Information Gain-Based Planning on a Quadruped Robot

Feb 22, 2020

Yiduo Wang, Milad Ramezani, Maurice Fallon

Figure 1 for Actively Mapping Industrial Structures with Information Gain-Based Planning on a Quadruped Robot

Figure 2 for Actively Mapping Industrial Structures with Information Gain-Based Planning on a Quadruped Robot

Figure 3 for Actively Mapping Industrial Structures with Information Gain-Based Planning on a Quadruped Robot

Figure 4 for Actively Mapping Industrial Structures with Information Gain-Based Planning on a Quadruped Robot

Abstract:In this paper, we develop an online active mapping system to enable a quadruped robot to autonomously survey large physical structures. We describe the perception, planning and control modules needed to scan and reconstruct an object of interest, without requiring a prior model. The system builds a voxel representation of the object, and iteratively determines the Next-Best-View (NBV) to extend the representation, according to both the reconstruction itself and to avoid collisions with the environment. By computing the expected information gain of a set of candidate scan locations sampled on the as-sensed terrain map, as well as the cost of reaching these candidates, the robot decides the NBV for further exploration. The robot plans an optimal path towards the NBV, avoiding obstacles and un-traversable terrain. Experimental results on both simulated and real-world environments show the capability and efficiency of our system. Finally we present a full system demonstration on the real robot, the ANYbotics ANYmal, autonomously reconstructing a building facade and an industrial structure.

Via

Access Paper or Ask Questions