Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joshua Knights

REGRACE: A Robust and Efficient Graph-based Re-localization Algorithm using Consistency Evaluation

Mar 05, 2025

Débora N. P. Oliveira, Joshua Knights, Sebastián Barbas Laina, Simon Boche, Wolfram Burgard, Stefan Leutenegger

Figure 1 for REGRACE: A Robust and Efficient Graph-based Re-localization Algorithm using Consistency Evaluation

Figure 2 for REGRACE: A Robust and Efficient Graph-based Re-localization Algorithm using Consistency Evaluation

Figure 3 for REGRACE: A Robust and Efficient Graph-based Re-localization Algorithm using Consistency Evaluation

Figure 4 for REGRACE: A Robust and Efficient Graph-based Re-localization Algorithm using Consistency Evaluation

Abstract:Loop closures are essential for correcting odometry drift and creating consistent maps, especially in the context of large-scale navigation. Current methods using dense point clouds for accurate place recognition do not scale well due to computationally expensive scan-to-scan comparisons. Alternative object-centric approaches are more efficient but often struggle with sensitivity to viewpoint variation. In this work, we introduce REGRACE, a novel approach that addresses these challenges of scalability and perspective difference in re-localization by using LiDAR-based submaps. We introduce rotation-invariant features for each labeled object and enhance them with neighborhood context through a graph neural network. To identify potential revisits, we employ a scalable bag-of-words approach, pooling one learned global feature per submap. Additionally, we define a revisit with geometrical consistency cues rather than embedding distance, allowing us to recognize far-away loop closures. Our evaluations demonstrate that REGRACE achieves similar results compared to state-of-the-art place recognition and registration baselines while being twice as fast.

* Submitted to IROS2025

Via

Access Paper or Ask Questions

SOLVR: Submap Oriented LiDAR-Visual Re-Localisation

Sep 16, 2024

Joshua Knights, Sebastián Barbas Laina, Peyman Moghadam, Stefan Leutenegger

Abstract:This paper proposes SOLVR, a unified pipeline for learning based LiDAR-Visual re-localisation which performs place recognition and 6-DoF registration across sensor modalities. We propose a strategy to align the input sensor modalities by leveraging stereo image streams to produce metric depth predictions with pose information, followed by fusing multiple scene views from a local window using a probabilistic occupancy framework to expand the limited field-of-view of the camera. Additionally, SOLVR adopts a flexible definition of what constitutes positive examples for different training losses, allowing us to simultaneously optimise place recognition and registration performance. Furthermore, we replace RANSAC with a registration function that weights a simple least-squares fitting with the estimated inlier likelihood of sparse keypoint correspondences, improving performance in scenarios with a low inlier ratio between the query and retrieved place. Our experiments on the KITTI and KITTI360 datasets show that SOLVR achieves state-of-the-art performance for LiDAR-Visual place recognition and registration, particularly improving registration accuracy over larger distances between the query and retrieved place.

* Submitted to ICRA2025

Via

Access Paper or Ask Questions

WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments

Dec 23, 2023

Kavisha Vidanapathirana, Joshua Knights, Stephen Hausler, Mark Cox, Milad Ramezani, Jason Jooste, Ethan Griffiths, Shaheer Mohamed, Sridha Sridharan, Clinton Fookes(+1 more)

Figure 1 for WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments

Figure 2 for WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments

Figure 3 for WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments

Figure 4 for WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments

Abstract:Recent progress in semantic scene understanding has primarily been enabled by the availability of semantically annotated bi-modal (camera and lidar) datasets in urban environments. However, such annotated datasets are also needed for natural, unstructured environments to enable semantic perception for applications, including conservation, search and rescue, environment monitoring, and agricultural automation. Therefore, we introduce WildScenes, a bi-modal benchmark dataset consisting of multiple large-scale traversals in natural environments, including semantic annotations in high-resolution 2D images and dense 3D lidar point clouds, and accurate 6-DoF pose information. The data is (1) trajectory-centric with accurate localization and globally aligned point clouds, (2) calibrated and synchronized to support bi-modal inference, and (3) containing different natural environments over 6 months to support research on domain adaptation. Our 3D semantic labels are obtained via an efficient automated process that transfers the human-annotated 2D labels from multiple views into 3D point clouds, thus circumventing the need for expensive and time-consuming human annotation in 3D. We introduce benchmarks on 2D and 3D semantic segmentation and evaluate a variety of recent deep-learning techniques to demonstrate the challenges in semantic segmentation in natural environments. We propose train-val-test splits for standard benchmarks as well as domain adaptation benchmarks and utilize an automated split generation technique to ensure the balance of class label distributions. The data, evaluation scripts and pretrained models will be released upon acceptance at https://csiro-robotics.github.io/WildScenes.

* Under review. The first 3 authors contributed equally

Via

Access Paper or Ask Questions

Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition

Aug 31, 2023

Milad Ramezani, Liang Wang, Joshua Knights, Zhibin Li, Pauline Pounds, Peyman Moghadam

Figure 1 for Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition

Figure 2 for Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition

Figure 3 for Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition

Figure 4 for Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition

Abstract:This paper proposes a lidar place recognition approach, called P-GAT, to increase the receptive field between point clouds captured over time. Instead of comparing pairs of point clouds, we compare the similarity between sets of point clouds to use the maximum spatial and temporal information between neighbour clouds utilising the concept of pose-graph SLAM. Leveraging intra- and inter-attention and graph neural network, P-GAT relates point clouds captured in nearby locations in Euclidean space and their embeddings in feature space. Experimental results on the large-scale publically available datasets demonstrate the effectiveness of our approach in recognising scenes lacking distinct features and when training and testing environments have different distributions (domain adaptation). Further, an exhaustive comparison with the state-of-the-art shows improvements in performance gains. Code will be available upon acceptance.

* 8 pages, 3 figures, 5 tables

Via

Access Paper or Ask Questions

GeoAdapt: Self-Supervised Test-Time Adaption in LiDAR Place Recognition Using Geometric Priors

Aug 09, 2023

Joshua Knights, Stephen Hausler, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

Figure 1 for GeoAdapt: Self-Supervised Test-Time Adaption in LiDAR Place Recognition Using Geometric Priors

Figure 2 for GeoAdapt: Self-Supervised Test-Time Adaption in LiDAR Place Recognition Using Geometric Priors

Figure 3 for GeoAdapt: Self-Supervised Test-Time Adaption in LiDAR Place Recognition Using Geometric Priors

Figure 4 for GeoAdapt: Self-Supervised Test-Time Adaption in LiDAR Place Recognition Using Geometric Priors

Abstract:LiDAR place recognition approaches based on deep learning suffer a significant degradation in performance when there is a shift between the distribution of the training and testing datasets, with re-training often required to achieve top performance. However, obtaining accurate ground truth on new environments can be prohibitively expensive, especially in complex or GPS-deprived environments. To address this issue we propose GeoAdapt, which introduces a novel auxiliary classification head to generate pseudo-labels for re-training on unseen environments in a self-supervised manner. GeoAdapt uses geometric consistency as a prior to improve the robustness of our generated pseudo-labels against domain shift, improving the performance and reliability of our Test-Time Adaptation approach. Comprehensive experiments show that GeoAdapt significantly boosts place recognition performance across moderate to severe domain shifts, and is competitive with fully supervised test-time adaptation approaches. Our code will be available at https://github.com/csiro-robotics/GeoAdapt.

* Submitted to RA-L

Via

Access Paper or Ask Questions

Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments

Nov 29, 2022

Joshua Knights, Kavisha Vidanapathirana, Milad Ramezani, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

Abstract:Many existing datasets for lidar place recognition are solely representative of structured urban environments, and have recently been saturated in performance by deep learning based approaches. Natural and unstructured environments present many additional challenges for the tasks of long-term localisation but these environments are not represented in currently available datasets. To address this we introduce Wild-Places, a challenging large-scale dataset for lidar place recognition in unstructured, natural environments. Wild-Places contains eight lidar sequences collected with a handheld sensor payload over the course of fourteen months, containing a total of 67K undistorted lidar submaps along with accurate 6DoF ground truth. Our dataset contains multiple revisits both within and between sequences, allowing for both intra-sequence (i.e. loop closure detection) and inter-sequence (i.e. re-localisation) place recognition. We also benchmark several state-of-the-art approaches to demonstrate the challenges that this dataset introduces, particularly the case of long-term place recognition due to natural environments changing over time. Our dataset and code will be available at https://csiro-robotics.github.io/Wild-Places.

* Equal Contribution from first two authors Under Review Website link: https://csiro-robotics.github.io/Wild-Places/

Via

Access Paper or Ask Questions

Uncertainty-Aware Lidar Place Recognition in Novel Environments

Oct 04, 2022

Keita Mason, Joshua Knights, Milad Ramezani, Peyman Moghadam, Dimity Miller

Figure 1 for Uncertainty-Aware Lidar Place Recognition in Novel Environments

Figure 2 for Uncertainty-Aware Lidar Place Recognition in Novel Environments

Figure 3 for Uncertainty-Aware Lidar Place Recognition in Novel Environments

Figure 4 for Uncertainty-Aware Lidar Place Recognition in Novel Environments

Abstract:State-of-the-art approaches to lidar place recognition degrade significantly when tested on novel environments that are not present in their training dataset. To improve their reliability, we propose uncertainty-aware lidar place recognition, where each predicted place match must have an associated uncertainty that can be used to identify and reject potentially incorrect matches. We introduce a novel evaluation protocol designed to benchmark uncertainty-aware lidar place recognition, and present Deep Ensembles as the first uncertainty-aware approach for this task. Testing across three large-scale datasets and three state-of-the-art architectures, we show that Deep Ensembles consistently improves the performance of lidar place recognition in novel environments. Compared to a standard network, our results show that Deep Ensembles improves the Recall@1 by more than 5% and AuPR by more than 3% on average when tested on previously unseen environments. Our code repository will be made publicly available upon paper acceptance at https://github.com/csiro-robotics/Uncertainty-LPR.

* 7 pages, 4 figures. Under Review

Via

Access Paper or Ask Questions

InCloud: Incremental Learning for Point Cloud Place Recognition

Mar 02, 2022

Joshua Knights, Peyman Moghadam, Milad Ramezani, Sridha Sridharan, Clinton Fookes

Figure 1 for InCloud: Incremental Learning for Point Cloud Place Recognition

Figure 2 for InCloud: Incremental Learning for Point Cloud Place Recognition

Figure 3 for InCloud: Incremental Learning for Point Cloud Place Recognition

Figure 4 for InCloud: Incremental Learning for Point Cloud Place Recognition

Abstract:Place recognition is a fundamental component of robotics, and has seen tremendous improvements through the use of deep learning models in recent years. Networks can experience significant drops in performance when deployed in unseen or highly dynamic environments, and require additional training on the collected data. However naively fine-tuning on new training distributions can cause severe degradation of performance on previously visited domains, a phenomenon known as catastrophic forgetting. In this paper we address the problem of incremental learning for point cloud place recognition and introduce InCloud, a structure-aware distillation-based approach which preserves the higher-order structure of the network's embedding space. We introduce several challenging new benchmarks on four popular and large-scale LiDAR datasets (Oxford, MulRan, In-house and KITTI) showing broad improvements in point cloud place recognition performance over a variety of network architectures. To the best of our knowledge, this work is the first to effectively apply incremental learning for point cloud place recognition.

* Under Review

Via

Access Paper or Ask Questions

Point Cloud Segmentation Using Sparse Temporal Local Attention

Dec 02, 2021

Joshua Knights, Peyman Moghadam, Clinton Fookes, Sridha Sridharan

Figure 1 for Point Cloud Segmentation Using Sparse Temporal Local Attention

Figure 2 for Point Cloud Segmentation Using Sparse Temporal Local Attention

Figure 3 for Point Cloud Segmentation Using Sparse Temporal Local Attention

Figure 4 for Point Cloud Segmentation Using Sparse Temporal Local Attention

Abstract:Point clouds are a key modality used for perception in autonomous vehicles, providing the means for a robust geometric understanding of the surrounding environment. However despite the sensor outputs from autonomous vehicles being naturally temporal in nature, there is still limited exploration of exploiting point cloud sequences for 3D seman-tic segmentation. In this paper we propose a novel Sparse Temporal Local Attention (STELA) module which aggregates intermediate features from a local neighbourhood in previous point cloud frames to provide a rich temporal context to the decoder. Using the sparse local neighbourhood enables our approach to gather features more flexibly than those which directly match point features, and more efficiently than those which perform expensive global attention over the whole point cloud frame. We achieve a competitive mIoU of 64.3% on the SemanticKitti dataset, and demonstrate significant improvement over the single-frame baseline in our ablation studies.

* 8 pages, 3 figures Published at the Australasian Conference on Robotics and Automation (ACRA) 2021

Via

Access Paper or Ask Questions

Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

May 01, 2020

Joshua Knights, Anthony Vanderkop, Daniel Ward, Olivia Mackenzie-Ross, Peyman Moghadam

Figure 1 for Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

Figure 2 for Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

Figure 3 for Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

Figure 4 for Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

Abstract:This paper presents TCE: Temporally Coherent Embeddings for self-supervised video representation learning. The proposed method exploits inherent structure of unlabeled video data to explicitly enforce temporal coherency in the embedding space, rather than indirectly learning it through ranking or predictive pretext tasks. In the same way that high-level visual information in the world changes smoothly, we believe that nearby frames in learned representations should demonstrate similar properties. Using this assumption, we train the TCE model to encode videos such that adjacent frames exist close to each other and videos are separated from one another. Using TCE we learn robust representations from large quantities of unlabeled video data. We evaluate our self-supervised trained TCE model by adding a classification layer and finetuning the learned representation on the downstream task of video action recognition on the UCF101 dataset. We obtain 67.01% accuracy and outperform the state-of-the-art self-supervised methods trained on UCF101 despite using a significantly smaller dataset for pre-training. Notably, we demonstrate results competitive with more complex 3D-CNN based networks while training with a 2D-CNN network backbone on action recognition tasks. Our training code and pretrained models are available at https://github.com/csiro-robotics/TCE

* Under review! Project page: https://csiro-robotics.github.io/TCE_Webpage/

Via

Access Paper or Ask Questions