Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrew B. Barron

Are Transformers Truly Foundational for Robotics?

Nov 25, 2024

James A. R. Marshall, Andrew B. Barron

Figure 1 for Are Transformers Truly Foundational for Robotics?

Figure 2 for Are Transformers Truly Foundational for Robotics?

Abstract:Generative Pre-Trained Transformers (GPTs) are hyped to revolutionize robotics. Here we question their utility. GPTs for autonomous robotics demand enormous and costly compute, excessive training times and (often) offboard wireless control. We contrast GPT state of the art with how tiny insect brains have achieved robust autonomy with none of these constraints. We highlight lessons that can be learned from biology to enhance the utility of GPTs in robotics.

Via

Access Paper or Ask Questions

EchoVPR: Echo State Networks for Visual Place Recognition

Oct 11, 2021

Anil Ozdemir, Andrew B. Barron, Andrew Philippides, Michael Mangan, Eleni Vasilaki, Luca Manneschi

Figure 1 for EchoVPR: Echo State Networks for Visual Place Recognition

Figure 2 for EchoVPR: Echo State Networks for Visual Place Recognition

Figure 3 for EchoVPR: Echo State Networks for Visual Place Recognition

Figure 4 for EchoVPR: Echo State Networks for Visual Place Recognition

Abstract:Recognising previously visited locations is an important, but unsolved, task in autonomous navigation. Current visual place recognition (VPR) benchmarks typically challenge models to recover the position of a query image (or images) from sequential datasets that include both spatial and temporal components. Recently, Echo State Network (ESN) varieties have proven particularly powerful at solving machine learning tasks that require spatio-temporal modelling. These networks are simple, yet powerful neural architectures that -- exhibiting memory over multiple time-scales and non-linear high-dimensional representations -- can discover temporal relations in the data while still maintaining linearity in the learning. In this paper, we present a series of ESNs and analyse their applicability to the VPR problem. We report that the addition of ESNs to pre-processed convolutional neural networks led to a dramatic boost in performance in comparison to non-recurrent networks in four standard benchmarks (GardensPoint, SPEDTest, ESSEX3IN1, Nordland) demonstrating that ESNs are able to capture the temporal structure inherent in VPR problems. Moreover, we show that ESNs can outperform class-leading VPR models which also exploit the sequential dynamics of the data. Finally, our results demonstrate that ESNs also improve generalisation abilities, robustness, and accuracy further supporting their suitability to VPR applications.

* 8 pages, 7 figures, submitted to ICRA 2022 (under review)

Via

Access Paper or Ask Questions

A Compact Neural Architecture for Visual Place Recognition

Oct 15, 2019

Marvin Chancán, Luis Hernandez-Nunez, Ajay Narendra, Andrew B. Barron, Michael Milford

Figure 1 for A Compact Neural Architecture for Visual Place Recognition

Figure 2 for A Compact Neural Architecture for Visual Place Recognition

Figure 3 for A Compact Neural Architecture for Visual Place Recognition

Figure 4 for A Compact Neural Architecture for Visual Place Recognition

Abstract:State-of-the-art algorithms for visual place recognition can be broadly split into two categories: computationally expensive deep-learning/image retrieval based techniques with minimal biological plausibility, and computationally cheap, biologically inspired models that yield poor performance in real-world environments. In this paper we present a new compact and high-performing system that bridges this divide for the first time. Our approach comprises two key components: FlyNet, a compact, sparse two-layer neural network inspired by fruit fly brain architectures, and a one-dimensional continuous attractor neural network (CANN). Our FlyNet+CANN network combines the compact pattern recognition capabilities of the FlyNet model with the powerful temporal filtering capabilities of an equally compact CANN, replicating entirely in a neural network implementation the functionality that yields high performance in algorithmic localization approaches like SeqSLAM. We evaluate our approach and compare it to three state-of-the-art methods on two benchmark real-world datasets with small viewpoint changes and extreme appearance variations including different times of day (afternoon to night) where it achieves an AUC performance of 87%, compared to 60% for Multi-Process Fusion, 46% for LoST-X and 1% for SeqSLAM, while being 6.5, 310, and 1.5 times faster respectively.

* Submitted to RA-L with ICRA 2020 presentation option, 8 pages, 13 figures

Via

Access Paper or Ask Questions