Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jianzhu Huai

Thermal odometry and dense mapping using learned ddometry and Gaussian splatting

Feb 07, 2026

Tianhao Zhou, Yujia Chen, Zhihao Zhan, Yuhang Ming, Jianzhu Huai

Abstract:Thermal infrared sensors, with wavelengths longer than smoke particles, can capture imagery independent of darkness, dust, and smoke. This robustness has made them increasingly valuable for motion estimation and environmental perception in robotics, particularly in adverse conditions. Existing thermal odometry and mapping approaches, however, are predominantly geometric and often fail across diverse datasets while lacking the ability to produce dense maps. Motivated by the efficiency and high-quality reconstruction ability of recent Gaussian Splatting (GS) techniques, we propose TOM-GS, a thermal odometry and mapping method that integrates learning-based odometry with GS-based dense mapping. TOM-GS is among the first GS-based SLAM systems tailored for thermal cameras, featuring dedicated thermal image enhancement and monocular depth integration. Extensive experiments on motion estimation and novel-view rendering demonstrate that TOM-GS outperforms existing learning-based methods, confirming the benefits of learning-based pipelines for robust thermal odometry and dense reconstruction.

* 11 pages, 2 figures, 5 tables

Via

Access Paper or Ask Questions

Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems

Jul 16, 2024

Jianzhu Huai, Binliang Wang, Yuan Zhuang, Yiwen Chen, Qipeng Li, Yulong Han, Charles Toth

Figure 1 for Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems

Figure 2 for Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems

Figure 3 for Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems

Figure 4 for Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems

Abstract:4D radars are increasingly favored for odometry and mapping of autonomous systems due to their robustness in harsh weather and dynamic environments. Existing datasets, however, often cover limited areas and are typically captured using a single platform. To address this gap, we present a diverse large-scale dataset specifically designed for 4D radar-based localization and mapping. This dataset was gathered using three different platforms: a handheld device, an e-bike, and an SUV, under a variety of environmental conditions, including clear days, nighttime, and heavy rain. The data collection occurred from September 2023 to February 2024, encompassing diverse settings such as roads in a vegetated campus and tunnels on highways. Each route was traversed multiple times to facilitate place recognition evaluations. The sensor suite included a 3D lidar, 4D radars, stereo cameras, consumer-grade IMUs, and a GNSS/INS system. Sensor data packets were synchronized to GNSS time using a two-step process: a convex hull algorithm was applied to smooth host time jitter, and then odometry and correlation algorithms were used to correct constant time offsets. Extrinsic calibration between sensors was achieved through manual measurements and subsequent nonlinear optimization. The reference motion for the platforms was generated by registering lidar scans to a terrestrial laser scanner (TLS) point cloud map using a lidar inertial odometry (LIO) method in localization mode. Additionally, a data reversion technique was introduced to enable backward LIO processing. We believe this dataset will boost research in radar-based point cloud registration, odometry, mapping, and place recognition.

* 11 pages, 4 figures, 5 tables

Via

Access Paper or Ask Questions

Tightly-Coupled VLP/INS Integrated Navigation by Inclination Estimation and Blockage Handling

Apr 28, 2024

Xiao Sun, Yuan Zhuang, Xiansheng Yang, Jianzhu Huai, Tianming Huang, Daquan Feng

Figure 1 for Tightly-Coupled VLP/INS Integrated Navigation by Inclination Estimation and Blockage Handling

Figure 2 for Tightly-Coupled VLP/INS Integrated Navigation by Inclination Estimation and Blockage Handling

Figure 3 for Tightly-Coupled VLP/INS Integrated Navigation by Inclination Estimation and Blockage Handling

Figure 4 for Tightly-Coupled VLP/INS Integrated Navigation by Inclination Estimation and Blockage Handling

Abstract:Visible Light Positioning (VLP) has emerged as a promising technology capable of delivering indoor localization with high accuracy. In VLP systems that use Photodiodes (PDs) as light receivers, the Received Signal Strength (RSS) is affected by the incidence angle of light, making the inclination of PDs a critical parameter in the positioning model. Currently, most studies assume the inclination to be constant, limiting the applications and positioning accuracy. Additionally, light blockages may severely interfere with the RSS measurements but the literature has not explored blockage detection in real-world experiments. To address these problems, we propose a tightly coupled VLP/INS (Inertial Navigation System) integrated navigation system that uses graph optimization to account for varying PD inclinations and VLP blockages. We also discussed the possibility of simultaneously estimating the robot's pose and the locations of some unknown LEDs. Simulations and two groups of real-world experiments demonstrate the efficiency of our approach, achieving an average positioning accuracy of 10 cm during movement and inclination accuracy within 1 degree despite inclination changes and blockages.

Via

Access Paper or Ask Questions

3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving

Jul 18, 2023

Qipeng Li, Yuan Zhuang, Yiwen Chen, Jianzhu Huai, Miao Li, Tianbing Ma, Yufei Tang, Xinlian Liang

Figure 1 for 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving

Figure 2 for 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving

Figure 3 for 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving

Figure 4 for 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving

Abstract:For the SLAM system in robotics and autonomous driving, the accuracy of front-end odometry and back-end loop-closure detection determine the whole intelligent system performance. But the LiDAR-SLAM could be disturbed by current scene moving objects, resulting in drift errors and even loop-closure failure. Thus, the ability to detect and segment moving objects is essential for high-precision positioning and building a consistent map. In this paper, we address the problem of moving object segmentation from 3D LiDAR scans to improve the odometry and loop-closure accuracy of SLAM. We propose a novel 3D Sequential Moving-Object-Segmentation (3D-SeqMOS) method that can accurately segment the scene into moving and static objects, such as moving and static cars. Different from the existing projected-image method, we process the raw 3D point cloud and build a 3D convolution neural network for MOS task. In addition, to make full use of the spatio-temporal information of point cloud, we propose a point cloud residual mechanism using the spatial features of current scan and the temporal features of previous residual scans. Besides, we build a complete SLAM framework to verify the effectiveness and accuracy of 3D-SeqMOS. Experiments on SemanticKITTI dataset show that our proposed 3D-SeqMOS method can effectively detect moving objects and improve the accuracy of LiDAR odometry and loop-closure detection. The test results show our 3D-SeqMOS outperforms the state-of-the-art method by 12.4%. We extend the proposed method to the SemanticKITTI: Moving Object Segmentation competition and achieve the 2nd in the leaderboard, showing its effectiveness.

Via

Access Paper or Ask Questions

A Review and Comparative Study of Close-Range Geometric Camera Calibration Tools

Jun 15, 2023

Jianzhu Huai, Yuan Zhuang, Yuxin Shao, Grzegorz Jozkow, Binliang Wang, Junhui Liu, Yijia He, Alper Yilmaz

Figure 1 for A Review and Comparative Study of Close-Range Geometric Camera Calibration Tools

Figure 2 for A Review and Comparative Study of Close-Range Geometric Camera Calibration Tools

Figure 3 for A Review and Comparative Study of Close-Range Geometric Camera Calibration Tools

Figure 4 for A Review and Comparative Study of Close-Range Geometric Camera Calibration Tools

Abstract:In many camera-based applications, it is necessary to find the geometric relationship between incoming rays and image pixels, i.e., the projection model, through the geometric camera calibration (GCC). Aiming to provide practical calibration guidelines, this work surveys and evaluates the existing GCC tools. The survey covers camera models, calibration targets, and algorithms used in these tools, highlighting their properties and the trends in GCC development. The evaluation compares six target-based GCC tools, namely, BabelCalib, Basalt, Camodocal, Kalibr, the MATLAB calibrator, and the OpenCV-based ROS calibrator, with simulated and real data for cameras of wide-angle and fisheye lenses described by three traditional projection models. These tests reveal the strengths and weaknesses of these camera models, as well as the repeatability of these GCC tools. In view of the survey and evaluation, future research directions of GCC are also discussed.

* 17 pages, 13 figures

Via

Access Paper or Ask Questions

4D iRIOM: 4D Imaging Radar Inertial Odometry and Mapping

Apr 03, 2023

Yuan Zhuang, Binliang Wang, Jianzhu Huai, Miao Li

Abstract:Millimeter wave radar can measure distances, directions, and Doppler velocity for objects in harsh conditions such as fog. The 4D imaging radar with both vertical and horizontal data resembling an image can also measure objects' height. Previous studies have used 3D radars for ego-motion estimation. But few methods leveraged the rich data of imaging radars, and they usually omitted the mapping aspect, thus leading to inferior odometry accuracy. This paper presents a real-time imaging radar inertial odometry and mapping method, iRIOM, based on the submap concept. To deal with moving objects and multipath reflections, we use the graduated non-convexity method to robustly and efficiently estimate ego-velocity from a single scan. To measure the agreement between sparse non-repetitive radar scan points and submap points, the distribution-to-multi-distribution distance for matches is adopted. The ego-velocity, scan-to-submap matches are fused with the 6D inertial data by an iterative extended Kalman filter to get the platform's 3D position and orientation. A loop closure module is also developed to curb the odometry module's drift. To our knowledge, iRIOM based on the two modules is the first 4D radar inertial SLAM system. On our and third-party data, we show iRIOM's favorable odometry accuracy and mapping consistency against the FastLIO-SLAM and the EKFRIO. Also, the ablation study reveal the benefit of inertial data versus the constant velocity model, and scan-to-submap matching versus scan-to-scan matching.

* 8 pages, 8 figures, 4 tables, the proofread version will appear on RA-L soon

Via

Access Paper or Ask Questions

Observability Analysis and Keyframe-Based Filtering for Visual Inertial Odometry with Full Self-Calibration

Jan 13, 2022

Jianzhu Huai, Yukai Lin, Yuan Zhuang, Charles Toth, Dong Chen

Figure 1 for Observability Analysis and Keyframe-Based Filtering for Visual Inertial Odometry with Full Self-Calibration

Figure 2 for Observability Analysis and Keyframe-Based Filtering for Visual Inertial Odometry with Full Self-Calibration

Figure 3 for Observability Analysis and Keyframe-Based Filtering for Visual Inertial Odometry with Full Self-Calibration

Figure 4 for Observability Analysis and Keyframe-Based Filtering for Visual Inertial Odometry with Full Self-Calibration

Abstract:Camera-IMU (Inertial Measurement Unit) sensor fusion has been extensively studied in recent decades. Numerous observability analysis and fusion schemes for motion estimation with self-calibration have been presented. However, it has been uncertain whether both camera and IMU intrinsic parameters are observable under general motion. To answer this question, we first prove that for a global shutter camera-IMU system, all intrinsic and extrinsic parameters are observable with an unknown landmark. Given this, time offset and readout time of a rolling shutter (RS) camera also prove to be observable. Next, to validate this analysis and to solve the drift issue of a structureless filter during standstills, we develop a Keyframe-based Sliding Window Filter (KSWF) for odometry and self-calibration, which works with a monocular RS camera or stereo RS cameras. Though the keyframe concept is widely used in vision-based sensor fusion, to our knowledge, KSWF is the first of its kind to support self-calibration. Our simulation and real data tests validated that it is possible to fully calibrate the camera-IMU system using observations of opportunistic landmarks under diverse motion. Real data tests confirmed previous allusions that keeping landmarks in the state vector can remedy the drift in standstill, and showed that the keyframe-based scheme is an alternative cure.

* 16 pages, 9 figures

Via

Access Paper or Ask Questions

Continuous-Time Spatiotemporal Calibration of a Rolling Shutter Camera---IMU System

Aug 16, 2021

Jianzhu Huai, Yuan Zhuang, Qicheng Yuan, Yukai Lin

Figure 1 for Continuous-Time Spatiotemporal Calibration of a Rolling Shutter Camera---IMU System

Figure 2 for Continuous-Time Spatiotemporal Calibration of a Rolling Shutter Camera---IMU System

Figure 3 for Continuous-Time Spatiotemporal Calibration of a Rolling Shutter Camera---IMU System

Figure 4 for Continuous-Time Spatiotemporal Calibration of a Rolling Shutter Camera---IMU System

Abstract:The rolling shutter (RS) mechanism is widely used by consumer-grade cameras, which are essential parts in smartphones and autonomous vehicles. The RS effect leads to image distortion upon relative motion between a camera and the scene. This effect needs to be considered in video stabilization, structure from motion, and vision-aided odometry, for which recent studies have improved earlier global shutter (GS) methods by accounting for the RS effect. However, it is still unclear how the RS affects spatiotemporal calibration of the camera in a sensor assembly, which is crucial to good performance in aforementioned applications. This work takes the camera-IMU system as an example and looks into the RS effect on its spatiotemporal calibration. To this end, we develop a calibration method for a RS-camera-IMU system with continuous-time B-splines by using a calibration target. Unlike in calibrating GS cameras, every observation of a landmark on the target has a unique camera pose fitted by continuous-time B-splines. With simulated data generated from four sets of public calibration data, we show that RS can noticeably affect the extrinsic parameters, causing errors about 1$^\circ$ in orientation and 2 $cm$ in translation with a RS setting as in common smartphone cameras. With real data collected by two industrial camera-IMU systems, we find that considering the RS effect gives more accurate and consistent spatiotemporal calibration. Moreover, our method also accurately calibrates the inter-line delay of the RS. The code for simulation and calibration is publicly available.

* 11 pages, 9 figures

Via

Access Paper or Ask Questions

Consistent Right-Invariant Fixed-Lag Smoother with Application to Visual Inertial SLAM

Feb 17, 2021

Jianzhu Huai, Yukai Lin, Yuan Zhuang, Min Shi

Figure 1 for Consistent Right-Invariant Fixed-Lag Smoother with Application to Visual Inertial SLAM

Figure 2 for Consistent Right-Invariant Fixed-Lag Smoother with Application to Visual Inertial SLAM

Figure 3 for Consistent Right-Invariant Fixed-Lag Smoother with Application to Visual Inertial SLAM

Figure 4 for Consistent Right-Invariant Fixed-Lag Smoother with Application to Visual Inertial SLAM

Abstract:State estimation problems that use relative observations routinely arise in navigation of unmanned aerial vehicles, autonomous ground vehicles, \etc whose proper operation relies on accurate state estimates and reliable covariances. These problems have immanent unobservable directions. Traditional causal estimators, however, usually gain spurious information on the unobservable directions, leading to over confident covariance inconsistent with the actual estimator errors. The consistency problem of fixed-lag smoothers (FLSs) has only been attacked by the first estimate Jacobian (FEJ) technique because of the complexity to analyze their observability property. But the FEJ has several drawbacks hampering its wide adoption. To ensure the consistency of a FLS, this paper introduces the right invariant error formulation into the FLS framework. To our knowledge, we are the first to analyze the observability of a FLS with the right invariant error. Our main contributions are twofold. As the first novelty, to bypass the complexity of analysis with the classic observability matrix, we show that observability analysis of FLSs can be done equivalently on the linearized system. Second, we prove that the inconsistency issue in the traditional FLS can be elegantly solved by the right invariant error formulation without artificially correcting Jacobians. By applying the proposed FLS to the monocular visual inertial simultaneous localization and mapping (SLAM) problem, we confirm that the method consistently estimates covariance similarly to a batch smoother in simulation and that our method achieved comparable accuracy as traditional FLSs on real data.

* 13 pages, 4 figures, AAAI 2021 Conference

Via

Access Paper or Ask Questions

A Versatile Keyframe-Based Structureless Filter for Visual Inertial Odometry

Jan 02, 2021

Jianzhu Huai, Yukai Lin, Charles Toth, Yuan Zhuang, Dong Chen

Figure 1 for A Versatile Keyframe-Based Structureless Filter for Visual Inertial Odometry

Figure 2 for A Versatile Keyframe-Based Structureless Filter for Visual Inertial Odometry

Figure 3 for A Versatile Keyframe-Based Structureless Filter for Visual Inertial Odometry

Figure 4 for A Versatile Keyframe-Based Structureless Filter for Visual Inertial Odometry

Abstract:Motion estimation by fusing data from at least a camera and an Inertial Measurement Unit (IMU) enables many applications in robotics. However, among the multitude of Visual Inertial Odometry (VIO) methods, few efficiently estimate device motion with consistent covariance, and calibrate sensor parameters online for handling data from consumer sensors. This paper addresses the gap with a Keyframe-based Structureless Filter (KSF). For efficiency, landmarks are not included in the filter's state vector. For robustness, KSF associates feature observations and manages state variables using the concept of keyframes. For flexibility, KSF supports anytime calibration of IMU systematic errors, as well as extrinsic, intrinsic, and temporal parameters of each camera. Estimator consistency and observability of sensor parameters were analyzed by simulation. Sensitivity to design options, e.g., feature matching method and camera count was studied with the EuRoC benchmark. Sensor parameter estimation was evaluated on raw TUM VI sequences and smartphone data. Moreover, pose estimation accuracy was evaluated on EuRoC and TUM VI sequences versus recent VIO methods. These tests confirm that KSF reliably calibrates sensor parameters when the data contain adequate motion, and consistently estimate motion with accuracy rivaling recent VIO methods. Our implementation runs at 42 Hz with stereo camera images on a consumer laptop.

* 18 pages, 5 figures, 13 tables

Via

Access Paper or Ask Questions