Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Silvio Galesso

Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond

Jul 22, 2024

Silvio Galesso, Philipp Schröppel, Hssan Driss, Thomas Brox

Figure 1 for Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond

Figure 2 for Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond

Figure 3 for Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond

Figure 4 for Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond

Abstract:In recent years, research on out-of-distribution (OoD) detection for semantic segmentation has mainly focused on road scenes -- a domain with a constrained amount of semantic diversity. In this work, we challenge this constraint and extend the domain of this task to general natural images. To this end, we introduce: 1. the ADE-OoD benchmark, which is based on the ADE20k dataset and includes images from diverse domains with a high semantic diversity, and 2. a novel approach that uses Diffusion score matching for OoD detection (DOoD) and is robust to the increased semantic diversity. ADE-OoD features indoor and outdoor images, defines 150 semantic categories as in-distribution, and contains a variety of OoD objects. For DOoD, we train a diffusion model with an MLP architecture on semantic in-distribution embeddings and build on the score matching interpretation to compute pixel-wise OoD scores at inference time. On common road scene OoD benchmarks, DOoD performs on par or better than the state of the art, without using outliers for training or making assumptions about the data domain. On ADE-OoD, DOoD outperforms previous approaches, but leaves much room for future improvements.

* ECCV 2024 - Benchmark page: https://ade-ood.github.io/

Via

Access Paper or Ask Questions

Compositional Servoing by Recombining Demonstrations

Oct 06, 2023

Max Argus, Abhijeet Nayak, Martin Büchner, Silvio Galesso, Abhinav Valada, Thomas Brox

Abstract:Learning-based manipulation policies from image inputs often show weak task transfer capabilities. In contrast, visual servoing methods allow efficient task transfer in high-precision scenarios while requiring only a few demonstrations. In this work, we present a framework that formulates the visual servoing task as graph traversal. Our method not only extends the robustness of visual servoing, but also enables multitask capability based on a few task-specific demonstrations. We construct demonstration graphs by splitting existing demonstrations and recombining them. In order to traverse the demonstration graph in the inference case, we utilize a similarity function that helps select the best demonstration for a specific task. This enables us to compute the shortest path through the graph. Ultimately, we show that recombining demonstrations leads to higher task-respective success. We present extensive simulation and real-world experimental results that demonstrate the efficacy of our approach.

* http://compservo.cs.uni-freiburg.de

Via

Access Paper or Ask Questions

Far Away in the Deep Space: Nearest-Neighbor-Based Dense Out-of-Distribution Detection

Nov 12, 2022

Silvio Galesso, Max Argus, Thomas Brox

Abstract:The key to out-of-distribution detection is density estimation of the in-distribution data or of its feature representations. While good parametric solutions to this problem exist for well curated classification data, these are less suitable for complex domains, such as semantic segmentation. In this paper, we show that a k-Nearest-Neighbors approach can achieve surprisingly good results with small reference datasets and runtimes, and be robust with respect to hyperparameters, such as the number of neighbors and the choice of the support set size. Moreover, we show that it combines well with anomaly scores from standard parametric approaches, and we find that transformer features are particularly well suited to detect novel objects in combination with k-Nearest-Neighbors. Ultimately, the approach is simple and non-invasive, i.e., it does not affect the primary segmentation performance, avoids training on examples of anomalies, and achieves state-of-the-art results on the common benchmarks with +23% and +16% AP improvements on on RoadAnomaly and StreetHazards respectively.

Via

Access Paper or Ask Questions

Probing Contextual Diversity for Dense Out-of-Distribution Detection

Aug 30, 2022

Silvio Galesso, Maria Alejandra Bravo, Mehdi Naouar, Thomas Brox

Figure 1 for Probing Contextual Diversity for Dense Out-of-Distribution Detection

Figure 2 for Probing Contextual Diversity for Dense Out-of-Distribution Detection

Figure 3 for Probing Contextual Diversity for Dense Out-of-Distribution Detection

Figure 4 for Probing Contextual Diversity for Dense Out-of-Distribution Detection

Abstract:Detection of out-of-distribution (OoD) samples in the context of image classification has recently become an area of interest and active study, along with the topic of uncertainty estimation, to which it is closely related. In this paper we explore the task of OoD segmentation, which has been studied less than its classification counterpart and presents additional challenges. Segmentation is a dense prediction task for which the model's outcome for each pixel depends on its surroundings. The receptive field and the reliance on context play a role for distinguishing different classes and, correspondingly, for spotting OoD entities. We introduce MOoSe, an efficient strategy to leverage the various levels of context represented within semantic segmentation models and show that even a simple aggregation of multi-scale representations has consistently positive effects on OoD detection and uncertainty estimation.

* Safe Artificial Intelligence for Automated Driving Workshop, ECCV 2022

Via

Access Paper or Ask Questions

Essentials for Class Incremental Learning

Feb 18, 2021

Sudhanshu Mittal, Silvio Galesso, Thomas Brox

Figure 1 for Essentials for Class Incremental Learning

Figure 2 for Essentials for Class Incremental Learning

Figure 3 for Essentials for Class Incremental Learning

Figure 4 for Essentials for Class Incremental Learning

Abstract:Contemporary neural networks are limited in their ability to learn from evolving streams of training data. When trained sequentially on new or evolving tasks, their accuracy drops sharply, making them unsuitable for many real-world applications. In this work, we shed light on the causes of this well-known yet unsolved phenomenon - often referred to as catastrophic forgetting - in a class-incremental setup. We show that a combination of simple components and a loss that balances intra-task and inter-task learning can already resolve forgetting to the same extent as more complex measures proposed in literature. Moreover, we identify poor quality of the learned representation as another reason for catastrophic forgetting in class-IL. We show that performance is correlated with secondary class information (dark knowledge) learned by the model and it can be improved by an appropriate regularizer. With these lessons learned, class-incremental learning results on CIFAR-100 and ImageNet improve over the state-of-the-art by a large margin, while keeping the approach simple.

Via

Access Paper or Ask Questions

Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow

Aug 06, 2018

Eddy Ilg, Özgün Çiçek, Silvio Galesso, Aaron Klein, Osama Makansi, Frank Hutter, Thomas Brox

Figure 1 for Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow

Figure 2 for Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow

Figure 3 for Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow

Figure 4 for Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow

Abstract:Optical flow estimation can be formulated as an end-to-end supervised learning problem, which yields estimates with a superior accuracy-runtime tradeoff compared to alternative methodology. In this paper, we make such networks estimate their local uncertainty about the correctness of their prediction, which is vital information when building decisions on top of the estimations. For the first time we compare several strategies and techniques to estimate uncertainty in a large-scale computer vision task like optical flow estimation. Moreover, we introduce a new network architecture and loss function that enforce complementary hypotheses and provide uncertainty estimates efficiently with a single forward pass and without the need for sampling or ensembles. We demonstrate the quality of the uncertainty estimates, which is clearly above previous confidence measures on optical flow and allows for interactive frame rates.

* Accepted to ECCV 2018 as poster. See Video at: https://youtu.be/HvyovWSo8uE

Via

Access Paper or Ask Questions