Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Johannes Maucher

Anonymization of Documents for Law Enforcement with Machine Learning

Jan 13, 2025

Manuel Eberhardinger, Patrick Takenaka, Daniel Grießhaber, Johannes Maucher

Figure 1 for Anonymization of Documents for Law Enforcement with Machine Learning

Figure 2 for Anonymization of Documents for Law Enforcement with Machine Learning

Figure 3 for Anonymization of Documents for Law Enforcement with Machine Learning

Figure 4 for Anonymization of Documents for Law Enforcement with Machine Learning

Abstract:The steadily increasing utilization of data-driven methods and approaches in areas that handle sensitive personal information such as in law enforcement mandates an ever increasing effort in these institutions to comply with data protection guidelines. In this work, we present a system for automatically anonymizing images of scanned documents, reducing manual effort while ensuring data protection compliance. Our method considers the viability of further forensic processing after anonymization by minimizing automatically redacted areas by combining automatic detection of sensitive regions with knowledge from a manually anonymized reference document. Using a self-supervised image model for instance retrieval of the reference document, our approach requires only one anonymized example to efficiently redact all documents of the same type, significantly reducing processing time. We show that our approach outperforms both a purely automatic redaction system and also a naive copy-paste scheme of the reference anonymization to other documents on a hand-crafted dataset of ground truth redactions.

* Accepted at IEEE Symposium on CI in Security, Defence and Biometrics 2025 (IEEE CISDB)

Via

Access Paper or Ask Questions

Divide and Conquer: A Systematic Approach for Industrial Scale High-Definition OpenDRIVE Generation from Sparse Point Clouds

Jul 26, 2024

Leon Eisemann, Johannes Maucher

Abstract:High-definition road maps play a crucial role in the functionality and verification of highly automated driving functions. These contain precise information about the road network, geometry, condition, as well as traffic signs. Despite their importance for the development and evaluation of driving functions, the generation of high-definition maps is still an ongoing research topic. While previous work in this area has primarily focused on the accuracy of road geometry, we present a novel approach for automated large-scale map generation for use in industrial applications. Our proposed method leverages a minimal number of external information about the road to process LiDAR data in segments. These segments are subsequently combined, enabling a flexible and scalable process that achieves high-definition accuracy. Additionally, we showcase the use of the resulting OpenDRIVE in driving function simulation.

* 2024 IEEE Intelligent Vehicles Symposium (IV)
* 8 pages, 5 figures, 1 table. arXiv admin note: text overlap with arXiv:2405.07544

Via

Access Paper or Ask Questions

Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming

Jul 20, 2024

Manuel Eberhardinger, Florian Rupp, Johannes Maucher, Setareh Maghsudi

Figure 1 for Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming

Figure 2 for Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming

Figure 3 for Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming

Figure 4 for Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming

Abstract:Despite tremendous progress, machine learning and deep learning still suffer from incomprehensible predictions. Incomprehensibility, however, is not an option for the use of (deep) reinforcement learning in the real world, as unpredictable actions can seriously harm the involved individuals. In this work, we propose a genetic programming framework to generate explanations for the decision-making process of already trained agents by imitating them with programs. Programs are interpretable and can be executed to generate explanations of why the agent chooses a particular action. Furthermore, we conduct an ablation study that investigates how extending the domain-specific language by using library learning alters the performance of the method. We compare our results with the previous state of the art for this problem and show that we are comparable in performance but require much less hardware resources and computation time.

* Accepted at: The Fifteenth International Conference on Swarm Intelligence (ICSI'2024)

Via

Access Paper or Ask Questions

Guiding Video Prediction with Explicit Procedural Knowledge

Jun 26, 2024

Patrick Takenaka, Johannes Maucher, Marco F. Huber

Abstract:We propose a general way to integrate procedural knowledge of a domain into deep learning models. We apply it to the case of video prediction, building on top of object-centric deep models and show that this leads to a better performance than using data-driven models alone. We develop an architecture that facilitates latent space disentanglement in order to use the integrated procedural knowledge, and establish a setup that allows the model to learn the procedural interface in the latent space using the downstream task of video prediction. We contrast the performance to a state-of-the-art data-driven approach and show that problems where purely data-driven approaches struggle can be handled by using knowledge about the domain, providing an alternative to simply collecting more data.

* 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Paris, France, 2023, pp. 1076-1084
* Published in 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)

Via

Access Paper or Ask Questions

Automatic Odometry-Less OpenDRIVE Generation From Sparse Point Clouds

May 13, 2024

Leon Eisemann, Johannes Maucher

Figure 1 for Automatic Odometry-Less OpenDRIVE Generation From Sparse Point Clouds

Figure 2 for Automatic Odometry-Less OpenDRIVE Generation From Sparse Point Clouds

Figure 3 for Automatic Odometry-Less OpenDRIVE Generation From Sparse Point Clouds

Figure 4 for Automatic Odometry-Less OpenDRIVE Generation From Sparse Point Clouds

Abstract:High-resolution road representations are a key factor for the success of (highly) automated driving functions. These representations, for example, high-definition (HD) maps, contain accurate information on a multitude of factors, among others: road geometry, lane information, and traffic signs. Through the growing complexity and functionality of automated driving functions, also the requirements on testing and evaluation grow continuously. This leads to an increasing interest in virtual test drives for evaluation purposes. As roads play a crucial role in traffic flow, accurate real-world representations are needed, especially when deriving realistic driving behavior data. This paper proposes a novel approach to generate realistic road representations based solely on point cloud information, independent of the LiDAR sensor, mounting position, and without the need for odometry data, multi-sensor fusion, machine learning, or highly-accurate calibration. As the primary use case is simulation, we use the OpenDRIVE format for evaluation.

* 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)
* 8 pages, 4 figures, 3 algorithms, 2 tables

Via

Access Paper or Ask Questions

Trapped in texture bias? A large scale comparison of deep instance segmentation

Jan 17, 2024

Johannes Theodoridis, Jessica Hofmann, Johannes Maucher, Andreas Schilling

Abstract:Do deep learning models for instance segmentation generalize to novel objects in a systematic way? For classification, such behavior has been questioned. In this study, we aim to understand if certain design decisions such as framework, architecture or pre-training contribute to the semantic understanding of instance segmentation. To answer this question, we consider a special case of robustness and compare pre-trained models on a challenging benchmark for object-centric, out-of-distribution texture. We do not introduce another method in this work. Instead, we take a step back and evaluate a broad range of existing literature. This includes Cascade and Mask R-CNN, Swin Transformer, BMask, YOLACT(++), DETR, BCNet, SOTR and SOLOv2. We find that YOLACT++, SOTR and SOLOv2 are significantly more robust to out-of-distribution texture than other frameworks. In addition, we show that deeper and dynamic architectures improve robustness whereas training schedules, data augmentation and pre-training have only a minor impact. In summary we evaluate 68 models on 61 versions of MS COCO for a total of 4148 evaluations.

* ECCV 2022 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part VIII. Springer-Verlag, Berlin, Heidelberg, 609-627
* Accepted at ECCV 2022. Code: https://github.com/JohannesTheo/trapped-in-texture-bias

Via

Access Paper or Ask Questions

Learning of Generalizable and Interpretable Knowledge in Grid-Based Reinforcement Learning Environments

Sep 07, 2023

Manuel Eberhardinger, Johannes Maucher, Setareh Maghsudi

Abstract:Understanding the interactions of agents trained with deep reinforcement learning is crucial for deploying agents in games or the real world. In the former, unreasonable actions confuse players. In the latter, that effect is even more significant, as unexpected behavior cause accidents with potentially grave and long-lasting consequences for the involved individuals. In this work, we propose using program synthesis to imitate reinforcement learning policies after seeing a trajectory of the action sequence. Programs have the advantage that they are inherently interpretable and verifiable for correctness. We adapt the state-of-the-art program synthesis system DreamCoder for learning concepts in grid-based environments, specifically, a navigation task and two miniature versions of Atari games, Space Invaders and Asterix. By inspecting the generated libraries, we can make inferences about the concepts the black-box agent has learned and better understand the agent's behavior. We achieve the same by visualizing the agent's decision-making process for the imitated sequences. We evaluate our approach with different types of program synthesizers based on a search-only method, a neural-guided search, and a language model fine-tuned on code.

* to be published in AIIDE 2023

Via

Access Paper or Ask Questions

Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Dec 04, 2020

Daniel Grießhaber, Johannes Maucher, Ngoc Thang Vu

Figure 1 for Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Figure 2 for Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Figure 3 for Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Figure 4 for Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Abstract:Recently, leveraging pre-trained Transformer based language models in down stream, task specific models has advanced state of the art results in natural language understanding tasks. However, only a little research has explored the suitability of this approach in low resource settings with less than 1,000 training data points. In this work, we explore fine-tuning methods of BERT -- a pre-trained Transformer based language model -- by utilizing pool-based active learning to speed up training while keeping the cost of labeling new data constant. Our experimental results on the GLUE data set show an advantage in model performance by maximizing the approximate knowledge gain of the model when querying from the pool of unlabeled data. Finally, we demonstrate and analyze the benefits of freezing layers of the language model during fine-tuning to reduce the number of trainable parameters, making it more suitable for low-resource settings.

* COLING'2020

Via

Access Paper or Ask Questions

Low-Resource Text Classification using Domain-Adversarial Learning

Jul 13, 2018

Daniel Grießhaber, Ngoc Thang Vu, Johannes Maucher

Figure 1 for Low-Resource Text Classification using Domain-Adversarial Learning

Figure 2 for Low-Resource Text Classification using Domain-Adversarial Learning

Figure 3 for Low-Resource Text Classification using Domain-Adversarial Learning

Abstract:Deep learning techniques have recently shown to be successful in many natural language processing tasks forming state-of-the-art systems. They require, however, a large amount of annotated data which is often missing. This paper explores the use of domain-adversarial learning as a regularizer to avoid overfitting when training domain invariant features for deep, complex neural network in low-resource and zero-resource settings in new target domains or languages. In the case of new languages, we show that monolingual word-vectors can be directly used for training without pre-alignment. Their projection into a common space can be learnt ad-hoc at training time reaching the final performance of pretrained multilingual word-vectors.

* To be published on the 6th International Conference on Statistical Language and Speech Processing (SLSP) 2018

Via

Access Paper or Ask Questions