Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dennis Gross

Semi-supervised CAPP Transformer Learning via Pseudo-labeling

Feb 01, 2026

Dennis Gross, Helge Spieker, Arnaud Gotlieb, Emmanuel Stathatos, Panorios Benardos, George-Christopher Vosniakos

Abstract:High-level Computer-Aided Process Planning (CAPP) generates manufacturing process plans from part specifications. It suffers from limited dataset availability in industry, reducing model generalization. We propose a semi-supervised learning approach to improve transformer-based CAPP transformer models without manual labeling. An oracle, trained on available transformer behaviour data, filters correct predictions from unseen parts, which are then used for one-shot retraining. Experiments on small-scale datasets with simulated ground truth across the full data distribution show consistent accuracy gains over baselines, demonstrating the method's effectiveness in data-scarce manufacturing environments.

Via

Access Paper or Ask Questions

Translating the Rashomon Effect to Sequential Decision-Making Tasks

Dec 19, 2025

Dennis Gross, Jørn Eirik Betten, Helge Spieker

Abstract:The Rashomon effect describes the phenomenon where multiple models trained on the same data produce identical predictions while differing in which features they rely on internally. This effect has been studied extensively in classification tasks, but not in sequential decision-making, where an agent learns a policy to achieve an objective by taking actions in an environment. In this paper, we translate the Rashomon effect to sequential decision-making. We define it as multiple policies that exhibit identical behavior, visiting the same states and selecting the same actions, while differing in their internal structure, such as feature attributions. Verifying identical behavior in sequential decision-making differs from classification. In classification, predictions can be directly compared to ground-truth labels. In sequential decision-making with stochastic transitions, the same policy may succeed or fail on any single trajectory due to randomness. We address this using formal verification methods that construct and compare the complete probabilistic behavior of each policy in the environment. Our experiments demonstrate that the Rashomon effect exists in sequential decision-making. We further show that ensembles constructed from the Rashomon set exhibit greater robustness to distribution shifts than individual policies. Additionally, permissive policies derived from the Rashomon set reduce computational requirements for verification while maintaining optimal performance.

Via

Access Paper or Ask Questions

Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies

Jan 06, 2025

Dennis Gross, Helge Spieker

Figure 1 for Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies

Figure 2 for Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies

Figure 3 for Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies

Figure 4 for Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies

Abstract:Deep reinforcement learning (RL) policies can demonstrate unsafe behaviors and are challenging to interpret. To address these challenges, we combine RL policy model checking--a technique for determining whether RL policies exhibit unsafe behaviors--with co-activation graph analysis--a method that maps neural network inner workings by analyzing neuron activation patterns--to gain insight into the safe RL policy's sequential decision-making. This combination lets us interpret the RL policy's inner workings for safe decision-making. We demonstrate its applicability in various experiments.

Via

Access Paper or Ask Questions

Turn-based Multi-Agent Reinforcement Learning Model Checking

Jan 06, 2025

Dennis Gross

Figure 1 for Turn-based Multi-Agent Reinforcement Learning Model Checking

Figure 2 for Turn-based Multi-Agent Reinforcement Learning Model Checking

Figure 3 for Turn-based Multi-Agent Reinforcement Learning Model Checking

Figure 4 for Turn-based Multi-Agent Reinforcement Learning Model Checking

Abstract:In this paper, we propose a novel approach for verifying the compliance of turn-based multi-agent reinforcement learning (TMARL) agents with complex requirements in stochastic multiplayer games. Our method overcomes the limitations of existing verification approaches, which are inadequate for dealing with TMARL agents and not scalable to large games with multiple agents. Our approach relies on tight integration of TMARL and a verification technique referred to as model checking. We demonstrate the effectiveness and scalability of our technique through experiments in different types of environments. Our experiments show that our method is suited to verify TMARL agents and scales better than naive monolithic model checking.

Via

Access Paper or Ask Questions

Enhancing RL Safety with Counterfactual LLM Reasoning

Sep 16, 2024

Dennis Gross, Helge Spieker

Figure 1 for Enhancing RL Safety with Counterfactual LLM Reasoning

Abstract:Reinforcement learning (RL) policies may exhibit unsafe behavior and are hard to explain. We use counterfactual large language model reasoning to enhance RL policy safety post-training. We show that our approach improves and helps to explain the RL policy safety.

Via

Access Paper or Ask Questions

Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies

Sep 16, 2024

Dennis Gross, Helge Spieker

Figure 1 for Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies

Figure 2 for Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies

Figure 3 for Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies

Abstract:Pruning neural networks (NNs) can streamline them but risks removing vital parameters from safe reinforcement learning (RL) policies. We introduce an interpretable RL method called VERINTER, which combines NN pruning with model checking to ensure interpretable RL safety. VERINTER exactly quantifies the effects of pruning and the impact of neural connections on complex safety properties by analyzing changes in safety measurements. This method maintains safety in pruned RL policies and enhances understanding of their safety dynamics, which has proven effective in multiple RL settings.

Via

Access Paper or Ask Questions

Efficient Milling Quality Prediction with Explainable Machine Learning

Sep 16, 2024

Dennis Gross, Helge Spieker, Arnaud Gotlieb, Ricardo Knoblauch, Mohamed Elmansori

Figure 1 for Efficient Milling Quality Prediction with Explainable Machine Learning

Figure 2 for Efficient Milling Quality Prediction with Explainable Machine Learning

Figure 3 for Efficient Milling Quality Prediction with Explainable Machine Learning

Figure 4 for Efficient Milling Quality Prediction with Explainable Machine Learning

Abstract:This paper presents an explainable machine learning (ML) approach for predicting surface roughness in milling. Utilizing a dataset from milling aluminum alloy 2017A, the study employs random forest regression models and feature importance techniques. The key contributions include developing ML models that accurately predict various roughness values and identifying redundant sensors, particularly those for measuring normal cutting force. Our experiments show that removing certain sensors can reduce costs without sacrificing predictive accuracy, highlighting the potential of explainable machine learning to improve cost-effectiveness in machining.

* arXiv admin note: substantial text overlap with arXiv:2403.18731

Via

Access Paper or Ask Questions

Probabilistic Model Checking of Stochastic Reinforcement Learning Policies

Mar 27, 2024

Dennis Gross, Helge Spieker

Figure 1 for Probabilistic Model Checking of Stochastic Reinforcement Learning Policies

Figure 2 for Probabilistic Model Checking of Stochastic Reinforcement Learning Policies

Figure 3 for Probabilistic Model Checking of Stochastic Reinforcement Learning Policies

Figure 4 for Probabilistic Model Checking of Stochastic Reinforcement Learning Policies

Abstract:We introduce a method to verify stochastic reinforcement learning (RL) policies. This approach is compatible with any RL algorithm as long as the algorithm and its corresponding environment collectively adhere to the Markov property. In this setting, the future state of the environment should depend solely on its current state and the action executed, independent of any previous states or actions. Our method integrates a verification technique, referred to as model checking, with RL, leveraging a Markov decision process, a trained RL policy, and a probabilistic computation tree logic (PCTL) formula to build a formal model that can be subsequently verified via the model checker Storm. We demonstrate our method's applicability across multiple benchmarks, comparing it to baseline methods called deterministic safety estimates and naive monolithic model checking. Our results show that our method is suited to verify stochastic RL policies.

Via

Access Paper or Ask Questions

Enhancing Manufacturing Quality Prediction Models through the Integration of Explainability Methods

Mar 27, 2024

Dennis Gross, Helge Spieker, Arnaud Gotlieb, Ricardo Knoblauch

Figure 1 for Enhancing Manufacturing Quality Prediction Models through the Integration of Explainability Methods

Figure 2 for Enhancing Manufacturing Quality Prediction Models through the Integration of Explainability Methods

Figure 3 for Enhancing Manufacturing Quality Prediction Models through the Integration of Explainability Methods

Abstract:This research presents a method that utilizes explainability techniques to amplify the performance of machine learning (ML) models in forecasting the quality of milling processes, as demonstrated in this paper through a manufacturing use case. The methodology entails the initial training of ML models, followed by a fine-tuning phase where irrelevant features identified through explainability methods are eliminated. This procedural refinement results in performance enhancements, paving the way for potential reductions in manufacturing costs and a better understanding of the trained ML models. This study highlights the usefulness of explainability techniques in both explaining and optimizing predictive models in the manufacturing realm.

Via

Access Paper or Ask Questions

Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking

Dec 10, 2022

Dennis Gross, Thiago D. Simao, Nils Jansen, Guillermo A. Perez

Abstract:Deep Reinforcement Learning (RL) agents are susceptible to adversarial noise in their observations that can mislead their policies and decrease their performance. However, an adversary may be interested not only in decreasing the reward, but also in modifying specific temporal logic properties of the policy. This paper presents a metric that measures the exact impact of adversarial attacks against such properties. We use this metric to craft optimal adversarial attacks. Furthermore, we introduce a model checking method that allows us to verify the robustness of RL policies against adversarial attacks. Our empirical analysis confirms (1) the quality of our metric to craft adversarial attacks against temporal logic properties, and (2) that we are able to concisely assess a system's robustness against attacks.

* ICAART 2023 Paper (Technical Report)

Via

Access Paper or Ask Questions