Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maximilian Nitsche

IBM Consulting, Germany, Karlsruhe Institute of Technology, Germany

Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation

Mar 20, 2025

Clive Tinashe Marimo, Benedikt Blumenstiel, Maximilian Nitsche, Johannes Jakubik, Thomas Brunschwiler

Abstract:Vision-language models for Earth observation (EO) typically rely on the visual spectrum of data as the only model input, thus failing to leverage the rich spectral information available in the multispectral channels recorded by satellites. Therefore, in this paper, we introduce Llama3-MS-CLIP, the first vision-language model pre-trained with contrastive learning on a large-scale multispectral dataset and report on the performance gains due to the extended spectral range. Furthermore, we present the largest-to-date image-caption dataset for multispectral data, consisting of one million Sentinel-2 samples and corresponding textual descriptions generated with Llama3-LLaVA-Next and Overture Maps data. We develop a scalable captioning pipeline, which is validated by domain experts. We evaluate Llama3-MS-CLIP on multispectral zero-shot image classification and retrieval using three datasets of varying complexity. Our results demonstrate that Llama3-MS-CLIP significantly outperforms other RGB-based approaches, improving classification accuracy by 6.77% on average and retrieval performance by 4.63% mAP compared to the second-best model. Our results emphasize the relevance of multispectral vision-language learning. We release the image-caption dataset, code, and model weights under an open-source license.

Via

Access Paper or Ask Questions

Investigating the Role of Explainability and AI Literacy in User Compliance

Jun 18, 2024

Niklas Kühl, Christian Meske, Maximilian Nitsche, Jodie Lobana

Abstract:AI is becoming increasingly common across different domains. However, as sophisticated AI-based systems are often black-boxed, rendering the decision-making logic opaque, users find it challenging to comply with their recommendations. Although researchers are investigating Explainable AI (XAI) to increase the transparency of the underlying machine learning models, it is unclear what types of explanations are effective and what other factors increase compliance. To better understand the interplay of these factors, we conducted an experiment with 562 participants who were presented with the recommendations of an AI and two different types of XAI. We find that users' compliance increases with the introduction of XAI but is also affected by AI literacy. We also find that the relationships between AI literacy XAI and users' compliance are mediated by the users' mental model of AI. Our study has several implications for successfully designing AI-based systems utilizing XAI.

Via

Access Paper or Ask Questions

AB2CD: AI for Building Climate Damage Classification and Detection

Sep 03, 2023

Maximilian Nitsche, S. Karthik Mukkavilli, Niklas Kühl, Thomas Brunschwiler

Abstract:We explore the implementation of deep learning techniques for precise building damage assessment in the context of natural hazards, utilizing remote sensing data. The xBD dataset, comprising diverse disaster events from across the globe, serves as the primary focus, facilitating the evaluation of deep learning models. We tackle the challenges of generalization to novel disasters and regions while accounting for the influence of low-quality and noisy labels inherent in natural hazard data. Furthermore, our investigation quantitatively establishes that the minimum satellite imagery resolution essential for effective building damage detection is 3 meters and below 1 meter for classification using symmetric and asymmetric resolution perturbation analyses. To achieve robust and accurate evaluations of building damage detection and classification, we evaluated different deep learning models with residual, squeeze and excitation, and dual path network backbones, as well as ensemble techniques. Overall, the U-Net Siamese network ensemble with F-1 score of 0.812 performed the best against the xView2 challenge benchmark. Additionally, we evaluate a Universal model trained on all hazards against a flood expert model and investigate generalization gaps across events, and out of distribution from field data in the Ahr Valley. Our research findings showcase the potential and limitations of advanced AI solutions in enhancing the impact assessment of climate change-induced extreme weather events, such as floods and hurricanes. These insights have implications for disaster impact assessment in the face of escalating climate challenges.

* 9 pages, 4 figures

Via

Access Paper or Ask Questions

Experiments on Anomaly Detection in Autonomous Driving by Forward-Backward Style Transfers

Jul 13, 2022

Daniel Bogdoll, Meng Zhang, Maximilian Nitsche, J. Marius Zöllner

Figure 1 for Experiments on Anomaly Detection in Autonomous Driving by Forward-Backward Style Transfers

Figure 2 for Experiments on Anomaly Detection in Autonomous Driving by Forward-Backward Style Transfers

Figure 3 for Experiments on Anomaly Detection in Autonomous Driving by Forward-Backward Style Transfers

Figure 4 for Experiments on Anomaly Detection in Autonomous Driving by Forward-Backward Style Transfers

Abstract:Great progress has been achieved in the community of autonomous driving in the past few years. As a safety-critical problem, however, anomaly detection is a huge hurdle towards a large-scale deployment of autonomous vehicles in the real world. While many approaches, such as uncertainty estimation or segmentation-based image resynthesis, are extremely promising, there is more to be explored. Especially inspired by works on anomaly detection based on image resynthesis, we propose a novel approach for anomaly detection through style transfer. We leverage generative models to map an image from its original style domain of road traffic to an arbitrary one and back to generate pixelwise anomaly scores. However, our experiments have proven our hypothesis wrong, and we were unable to produce significant results. Nevertheless, we want to share our findings, so that others can learn from our experiments.

* Daniel Bogdoll and Meng Zhang contributed equally. Accepted for publication at ICECCME 2022

Via

Access Paper or Ask Questions

A Meta-Analysis on the Utility of Explainable Artificial Intelligence in Human-AI Decision-Making

May 10, 2022

Max Schemmer, Patrick Hemmer, Maximilian Nitsche, Niklas Kühl, Michael Vössing

Figure 1 for A Meta-Analysis on the Utility of Explainable Artificial Intelligence in Human-AI Decision-Making

Figure 2 for A Meta-Analysis on the Utility of Explainable Artificial Intelligence in Human-AI Decision-Making

Figure 3 for A Meta-Analysis on the Utility of Explainable Artificial Intelligence in Human-AI Decision-Making

Figure 4 for A Meta-Analysis on the Utility of Explainable Artificial Intelligence in Human-AI Decision-Making

Abstract:Research in Artificial Intelligence (AI)-assisted decision-making is experiencing tremendous growth with a constantly rising number of studies evaluating the effect of AI with and without techniques from the field of explainable AI (XAI) on human decision-making performance. However, as tasks and experimental setups vary due to different objectives, some studies report improved user decision-making performance through XAI, while others report only negligible effects. Therefore, in this article, we present an initial synthesis of existing research on XAI studies using a statistical meta-analysis to derive implications across existing research. We observe a statistically positive impact of XAI on users' performance. Additionally, first results might indicate that human-AI decision-making yields better task performance on text data. However, we find no effect of explanations on users' performance compared to sole AI predictions. Our initial synthesis gives rise to future research to investigate the underlying causes as well as contribute to further development of algorithms that effectively benefit human decision-makers in the form of explanations.

* AAI/ACM Conference on AI, Ethics, and Society (AIES) 2022

Via

Access Paper or Ask Questions

Multimodal Detection of Unknown Objects on Roads for Autonomous Driving

May 03, 2022

Daniel Bogdoll, Enrico Eisen, Maximilian Nitsche, Christin Scheib, J. Marius Zöllner

Figure 1 for Multimodal Detection of Unknown Objects on Roads for Autonomous Driving

Figure 2 for Multimodal Detection of Unknown Objects on Roads for Autonomous Driving

Figure 3 for Multimodal Detection of Unknown Objects on Roads for Autonomous Driving

Abstract:Tremendous progress in deep learning over the last years has led towards a future with autonomous vehicles on our roads. Nevertheless, the performance of their perception systems is strongly dependent on the quality of the utilized training data. As these usually only cover a fraction of all object classes an autonomous driving system will face, such systems struggle with handling the unexpected. In order to safely operate on public roads, the identification of objects from unknown classes remains a crucial task. In this paper, we propose a novel pipeline to detect unknown objects. Instead of focusing on a single sensor modality, we make use of lidar and camera data by combining state-of-the art detection models in a sequential manner. We evaluate our approach on the Waymo Open Perception Dataset and point out current research gaps in anomaly detection.

* Daniel Bogdoll, Enrico Eisen, Maximilian Nitsche and Christin Scheib contributed equally

Via

Access Paper or Ask Questions

Anomaly Detection in Autonomous Driving: A Survey

Apr 17, 2022

Daniel Bogdoll, Maximilian Nitsche, J. Marius Zöllner

Figure 1 for Anomaly Detection in Autonomous Driving: A Survey

Figure 2 for Anomaly Detection in Autonomous Driving: A Survey

Figure 3 for Anomaly Detection in Autonomous Driving: A Survey

Figure 4 for Anomaly Detection in Autonomous Driving: A Survey

Abstract:Nowadays, there are outstanding strides towards a future with autonomous vehicles on our roads. While the perception of autonomous vehicles performs well under closed-set conditions, they still struggle to handle the unexpected. This survey provides an extensive overview of anomaly detection techniques based on camera, lidar, radar, multimodal and abstract object level data. We provide a systematization including detection approach, corner case level, ability for an online application, and further attributes. We outline the state-of-the-art and point out current research gaps.

* Daniel Bogdoll and Maximilian Nitsche contributed equally. Accepted for publication at CVPR 2022 WAD workshop

Via

Access Paper or Ask Questions