Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vasileios Belagiannis

Uncertainty-Aware Likelihood Ratio Estimation for Pixel-Wise Out-of-Distribution Detection

Aug 01, 2025

Marc Hölle, Walter Kellermann, Vasileios Belagiannis

Abstract:Semantic segmentation models trained on known object classes often fail in real-world autonomous driving scenarios by confidently misclassifying unknown objects. While pixel-wise out-of-distribution detection can identify unknown objects, existing methods struggle in complex scenes where rare object classes are often confused with truly unknown objects. We introduce an uncertainty-aware likelihood ratio estimation method that addresses these limitations. Our approach uses an evidential classifier within a likelihood ratio test to distinguish between known and unknown pixel features from a semantic segmentation model, while explicitly accounting for uncertainty. Instead of producing point estimates, our method outputs probability distributions that capture uncertainty from both rare training examples and imperfect synthetic outliers. We show that by incorporating uncertainty in this way, outlier exposure can be leveraged more effectively. Evaluated on five standard benchmark datasets, our method achieves the lowest average false positive rate (2.5%) among state-of-the-art while maintaining high average precision (90.91%) and incurring only negligible computational overhead. Code is available at https://github.com/glasbruch/ULRE.

* Accepted at ICCVW 2025, 11 pages, 4 figures

Via

Access Paper or Ask Questions

Revisiting Gradient-based Uncertainty for Monocular Depth Estimation

Feb 09, 2025

Julia Hornauer, Amir El-Ghoussani, Vasileios Belagiannis

Figure 1 for Revisiting Gradient-based Uncertainty for Monocular Depth Estimation

Figure 2 for Revisiting Gradient-based Uncertainty for Monocular Depth Estimation

Figure 3 for Revisiting Gradient-based Uncertainty for Monocular Depth Estimation

Figure 4 for Revisiting Gradient-based Uncertainty for Monocular Depth Estimation

Abstract:Monocular depth estimation, similar to other image-based tasks, is prone to erroneous predictions due to ambiguities in the image, for example, caused by dynamic objects or shadows. For this reason, pixel-wise uncertainty assessment is required for safety-critical applications to highlight the areas where the prediction is unreliable. We address this in a post hoc manner and introduce gradient-based uncertainty estimation for already trained depth estimation models. To extract gradients without depending on the ground truth depth, we introduce an auxiliary loss function based on the consistency of the predicted depth and a reference depth. The reference depth, which acts as pseudo ground truth, is in fact generated using a simple image or feature augmentation, making our approach simple and effective. To obtain the final uncertainty score, the derivatives w.r.t. the feature maps from single or multiple layers are calculated using back-propagation. We demonstrate that our gradient-based approach is effective in determining the uncertainty without re-training using the two standard depth estimation benchmarks KITTI and NYU. In particular, for models trained with monocular sequences and therefore most prone to uncertainty, our method outperforms related approaches. In addition, we publicly provide our code and models: https://github.com/jhornauer/GrUMoDepth

* Accepted to TPAMI

Via

Access Paper or Ask Questions

Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

Nov 29, 2024

Michele De Vita, Vasileios Belagiannis

Figure 1 for Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

Figure 2 for Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

Figure 3 for Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

Figure 4 for Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

Abstract:Despite the remarkable progress in generative modelling, current diffusion models lack a quantitative approach to assess image quality. To address this limitation, we propose to estimate the pixel-wise aleatoric uncertainty during the sampling phase of diffusion models and utilise the uncertainty to improve the sample generation quality. The uncertainty is computed as the variance of the denoising scores with a perturbation scheme that is specifically designed for diffusion models. We then show that the aleatoric uncertainty estimates are related to the second-order derivative of the diffusion noise distribution. We evaluate our uncertainty estimation algorithm and the uncertainty-guided sampling on the ImageNet and CIFAR-10 datasets. In our comparisons with the related work, we demonstrate promising results in filtering out low quality samples. Furthermore, we show that our guided approach leads to better sample generation in terms of FID scores.

* Accepted at WACV 2025

Via

Access Paper or Ask Questions

ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

Jul 09, 2024

Yuyuan Liu, Yuanhong Chen, Hu Wang, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

Figure 1 for ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

Figure 2 for ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

Figure 3 for ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

Figure 4 for ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

Abstract:The costly and time-consuming annotation process to produce large training sets for modelling semantic LiDAR segmentation methods has motivated the development of semi-supervised learning (SSL) methods. However, such SSL approaches often concentrate on employing consistency learning only for individual LiDAR representations. This narrow focus results in limited perturbations that generally fail to enable effective consistency learning. Additionally, these SSL approaches employ contrastive learning based on the sampling from a limited set of positive and negative embedding samples. This paper introduces a novel semi-supervised LiDAR semantic segmentation framework called ItTakesTwo (IT2). IT2 is designed to ensure consistent predictions from peer LiDAR representations, thereby improving the perturbation effectiveness in consistency learning. Furthermore, our contrastive learning employs informative samples drawn from a distribution of positive and negative embeddings learned from the entire training set. Results on public benchmarks show that our approach achieves remarkable improvements over the previous state-of-the-art (SOTA) methods in the field. The code is available at: https://github.com/yyliu01/IT2.

* 27 pages (15 pages main paper and 12 pages supplementary with references), ECCV 2024 accepted

Via

Access Paper or Ask Questions

Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation

May 27, 2024

Amir El-Ghoussani, Julia Hornauer, Gustavo Carneiro, Vasileios Belagiannis

Abstract:In monocular depth estimation, unsupervised domain adaptation has recently been explored to relax the dependence on large annotated image-based depth datasets. However, this comes at the cost of training multiple models or requiring complex training protocols. We formulate unsupervised domain adaptation for monocular depth estimation as a consistency-based semi-supervised learning problem by assuming access only to the source domain ground truth labels. To this end, we introduce a pairwise loss function that regularises predictions on the source domain while enforcing perturbation consistency across multiple augmented views of the unlabelled target samples. Importantly, our approach is simple and effective, requiring only training of a single model in contrast to the prior work. In our experiments, we rely on the standard depth estimation benchmarks KITTI and NYUv2 to demonstrate state-of-the-art results compared to related approaches. Furthermore, we analyse the simplicity and effectiveness of our approach in a series of ablation studies. The code is available at \url{https://github.com/AmirMaEl/SemiSupMDE}.

* Accepted to Conference on Lifelong Learning Agents (CoLLAs) 2024

Via

Access Paper or Ask Questions

Multi-conditioned Graph Diffusion for Neural Architecture Search

Mar 09, 2024

Rohan Asthana, Joschua Conrad, Youssef Dawoud, Maurits Ortmanns, Vasileios Belagiannis

Abstract:Neural architecture search automates the design of neural network architectures usually by exploring a large and thus complex architecture search space. To advance the architecture search, we present a graph diffusion-based NAS approach that uses discrete conditional graph diffusion processes to generate high-performing neural network architectures. We then propose a multi-conditioned classifier-free guidance approach applied to graph diffusion networks to jointly impose constraints such as high accuracy and low hardware latency. Unlike the related work, our method is completely differentiable and requires only a single model training. In our evaluations, we show promising results on six standard benchmarks, yielding novel and unique architectures at a fast speed, i.e. less than 0.2 seconds per architecture. Furthermore, we demonstrate the generalisability and efficiency of our method through experiments on ImageNet dataset.

* Transactions on Machine Learning Research (TMLR)

Via

Access Paper or Ask Questions

Pedestrian Environment Model for Automated Driving

Aug 17, 2023

Adrian Holzbock, Alexander Tsaregorodtsev, Vasileios Belagiannis

Abstract:Besides interacting correctly with other vehicles, automated vehicles should also be able to react in a safe manner to vulnerable road users like pedestrians or cyclists. For a safe interaction between pedestrians and automated vehicles, the vehicle must be able to interpret the pedestrian's behavior. Common environment models do not contain information like body poses used to understand the pedestrian's intent. In this work, we propose an environment model that includes the position of the pedestrians as well as their pose information. We only use images from a monocular camera and the vehicle's localization data as input to our pedestrian environment model. We extract the skeletal information with a neural network human pose estimator from the image. Furthermore, we track the skeletons with a simple tracking algorithm based on the Hungarian algorithm and an ego-motion compensation. To obtain the 3D information of the position, we aggregate the data from consecutive frames in conjunction with the vehicle position. We demonstrate our pedestrian environment model on data generated with the CARLA simulator and the nuScenes dataset. Overall, we reach a relative position error of around 16% on both datasets.

* Accepted for presentation at the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023), 24-28 September 2023, Bilbao, Bizkaia, Spain

Via

Access Paper or Ask Questions

Out-of-Distribution Detection for Monocular Depth Estimation

Aug 11, 2023

Julia Hornauer, Adrian Holzbock, Vasileios Belagiannis

Abstract:In monocular depth estimation, uncertainty estimation approaches mainly target the data uncertainty introduced by image noise. In contrast to prior work, we address the uncertainty due to lack of knowledge, which is relevant for the detection of data not represented by the training distribution, the so-called out-of-distribution (OOD) data. Motivated by anomaly detection, we propose to detect OOD images from an encoder-decoder depth estimation model based on the reconstruction error. Given the features extracted with the fixed depth encoder, we train an image decoder for image reconstruction using only in-distribution data. Consequently, OOD images result in a high reconstruction error, which we use to distinguish between in- and out-of-distribution samples. We built our experiments on the standard NYU Depth V2 and KITTI benchmarks as in-distribution data. Our post hoc method performs astonishingly well on different models and outperforms existing uncertainty estimation approaches without modifying the trained encoder-decoder depth estimation model.

* Accepted to ICCV 2023

Via

Access Paper or Ask Questions

SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation

Aug 09, 2023

Youssef Dawoud, Gustavo Carneiro, Vasileios Belagiannis

Figure 1 for SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation

Figure 2 for SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation

Figure 3 for SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation

Figure 4 for SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation

Abstract:Generalisation of deep neural networks becomes vulnerable when distribution shifts are encountered between train (source) and test (target) domain data. Few-shot domain adaptation mitigates this issue by adapting deep neural networks pre-trained on the source domain to the target domain using a randomly selected and annotated support set from the target domain. This paper argues that randomly selecting the support set can be further improved for effectively adapting the pre-trained source models to the target domain. Alternatively, we propose SelectNAdapt, an algorithm to curate the selection of the target domain samples, which are then annotated and included in the support set. In particular, for the K-shot adaptation problem, we first leverage self-supervision to learn features of the target domain data. Then, we propose a per-class clustering scheme of the learned target domain features and select K representative target samples using a distance-based scoring function. Finally, we bring our selection setup towards a practical ground by relying on pseudo-labels for clustering semantically similar target domain samples. Our experiments show promising results on three few-shot domain adaptation benchmarks for image recognition compared to related approaches and the standard random selection.

* Accepted to ICCV Workshop

Via

Access Paper or Ask Questions

Joint Out-of-Distribution Detection and Uncertainty Estimation for Trajectory Prediction

Aug 04, 2023

Julian Wiederer, Julian Schmidt, Ulrich Kressel, Klaus Dietmayer, Vasileios Belagiannis

Figure 1 for Joint Out-of-Distribution Detection and Uncertainty Estimation for Trajectory Prediction

Figure 2 for Joint Out-of-Distribution Detection and Uncertainty Estimation for Trajectory Prediction

Figure 3 for Joint Out-of-Distribution Detection and Uncertainty Estimation for Trajectory Prediction

Figure 4 for Joint Out-of-Distribution Detection and Uncertainty Estimation for Trajectory Prediction

Abstract:Despite the significant research efforts on trajectory prediction for automated driving, limited work exists on assessing the prediction reliability. To address this limitation we propose an approach that covers two sources of error, namely novel situations with out-of-distribution (OOD) detection and the complexity in in-distribution (ID) situations with uncertainty estimation. We introduce two modules next to an encoder-decoder network for trajectory prediction. Firstly, a Gaussian mixture model learns the probability density function of the ID encoder features during training, and then it is used to detect the OOD samples in regions of the feature space with low likelihood. Secondly, an error regression network is applied to the encoder, which learns to estimate the trajectory prediction error in supervised training. During inference, the estimated prediction error is used as the uncertainty. In our experiments, the combination of both modules outperforms the prior work in OOD detection and uncertainty estimation, on the Shifts robust trajectory prediction dataset by $2.8 \%$ and $10.1 \%$, respectively. The code is publicly available.

* Accepted to the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

Via

Access Paper or Ask Questions