Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Flögel

SINRL: Socially Integrated Navigation with Reinforcement Learning using Spiking Neural Networks

Dec 08, 2025

Florian Tretter, Daniel Flögel, Alexandru Vasilache, Max Grobbel, Jürgen Becker, Sören Hohmann

Figure 1 for SINRL: Socially Integrated Navigation with Reinforcement Learning using Spiking Neural Networks

Figure 2 for SINRL: Socially Integrated Navigation with Reinforcement Learning using Spiking Neural Networks

Figure 3 for SINRL: Socially Integrated Navigation with Reinforcement Learning using Spiking Neural Networks

Figure 4 for SINRL: Socially Integrated Navigation with Reinforcement Learning using Spiking Neural Networks

Abstract:Integrating autonomous mobile robots into human environments requires human-like decision-making and energy-efficient, event-based computation. Despite progress, neuromorphic methods are rarely applied to Deep Reinforcement Learning (DRL) navigation approaches due to unstable training. We address this gap with a hybrid socially integrated DRL actor-critic approach that combines Spiking Neural Networks (SNNs) in the actor with Artificial Neural Networks (ANNs) in the critic and a neuromorphic feature extractor to capture temporal crowd dynamics and human-robot interactions. Our approach enhances social navigation performance and reduces estimated energy consumption by approximately 1.69 orders of magnitude.

* 8 pages, 6 figures

Via

Access Paper or Ask Questions

Disentangling Coordiante Frames for Task Specific Motion Retargeting in Teleoperation using Shared Control and VR Controllers

May 19, 2025

Max Grobbel, Daniel Flögel, Philipp Rigoll, Sören Hohmann

Abstract:Task performance in terms of task completion time in teleoperation is still far behind compared to humans conducting tasks directly. One large identified impact on this is the human capability to perform transformations and alignments, which is directly influenced by the point of view and the motion retargeting strategy. In modern teleoperation systems, motion retargeting is usually implemented through a one time calibration or switching modes. Complex tasks, like concatenated screwing, might be difficult, because the operator has to align (e.g. mirror) rotational and translational input commands. Recent research has shown, that the separation of translation and rotation leads to increased task performance. This work proposes a formal motion retargeting method, which separates translational and rotational input commands. This method is then included in a optimal control based trajectory planner and shown to work on a UR5e manipulator.

* 8 pages, 4 figures, conference

Via

Access Paper or Ask Questions

Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Sep 16, 2024

Daniel Flögel, Marcos Gómez Villafañe, Joshua Ransiek, Sören Hohmann

Figure 1 for Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Figure 2 for Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Figure 3 for Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Figure 4 for Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Abstract:Autonomous mobile robots are increasingly employed in pedestrian-rich environments where safe navigation and appropriate human interaction are crucial. While Deep Reinforcement Learning (DRL) enables socially integrated robot behavior, challenges persist in novel or perturbed scenarios to indicate when and why the policy is uncertain. Unknown uncertainty in decision-making can lead to collisions or human discomfort and is one reason why safe and risk-aware navigation is still an open problem. This work introduces a novel approach that integrates aleatoric, epistemic, and predictive uncertainty estimation into a DRL-based navigation framework for uncertainty estimates in decision-making. We, therefore, incorporate Observation-Dependent Variance (ODV) and dropout into the Proximal Policy Optimization (PPO) algorithm. For different types of perturbations, we compare the ability of Deep Ensembles and Monte-Carlo Dropout (MC-Dropout) to estimate the uncertainties of the policy. In uncertain decision-making situations, we propose to change the robot's social behavior to conservative collision avoidance. The results show that the ODV-PPO algorithm converges faster with better generalization and disentangles the aleatoric and epistemic uncertainties. In addition, the MC-Dropout approach is more sensitive to perturbations and capable to correlate the uncertainty type to the perturbation type better. With the proposed safe action selection scheme, the robot can navigate in perturbed environments with fewer collisions.

* Submitted to the IEEE for possible publication, 8 pages, 6 figures

Via

Access Paper or Ask Questions

Socially Integrated Navigation: A Social Acting Robot with Deep Reinforcement Learning

Mar 14, 2024

Daniel Flögel, Lars Fischer, Thomas Rudolf, Tobias Schürmann, Sören Hohmann

Figure 1 for Socially Integrated Navigation: A Social Acting Robot with Deep Reinforcement Learning

Figure 2 for Socially Integrated Navigation: A Social Acting Robot with Deep Reinforcement Learning

Figure 3 for Socially Integrated Navigation: A Social Acting Robot with Deep Reinforcement Learning

Figure 4 for Socially Integrated Navigation: A Social Acting Robot with Deep Reinforcement Learning

Abstract:Mobile robots are being used on a large scale in various crowded situations and become part of our society. The socially acceptable navigation behavior of a mobile robot with individual human consideration is an essential requirement for scalable applications and human acceptance. Deep Reinforcement Learning (DRL) approaches are recently used to learn a robot's navigation policy and to model the complex interactions between robots and humans. We propose to divide existing DRL-based navigation approaches based on the robot's exhibited social behavior and distinguish between social collision avoidance with a lack of social behavior and socially aware approaches with explicit predefined social behavior. In addition, we propose a novel socially integrated navigation approach where the robot's social behavior is adaptive and emerges from the interaction with humans. The formulation of our approach is derived from a sociological definition, which states that social acting is oriented toward the acting of others. The DRL policy is trained in an environment where other agents interact socially integrated and reward the robot's behavior individually. The simulation results indicate that the proposed socially integrated navigation approach outperforms a socially aware approach in terms of distance traveled, time to completion, and negative impact on all agents within the environment.

Via

Access Paper or Ask Questions

ReACT: Reinforcement Learning for Controller Parametrization using B-Spline Geometries

Jan 10, 2024

Thomas Rudolf, Daniel Flögel, Tobias Schürmann, Simon Süß, Stefan Schwab, Sören Hohmann

Abstract:Robust and performant controllers are essential for industrial applications. However, deriving controller parameters for complex and nonlinear systems is challenging and time-consuming. To facilitate automatic controller parametrization, this work presents a novel approach using deep reinforcement learning (DRL) with N-dimensional B-spline geometries (BSGs). We focus on the control of parameter-variant systems, a class of systems with complex behavior which depends on the operating conditions. For this system class, gain-scheduling control structures are widely used in applications across industries due to well-known design principles. Facilitating the expensive controller parametrization task regarding these control structures, we deploy an DRL agent. Based on control system observations, the agent autonomously decides how to adapt the controller parameters. We make the adaptation process more efficient by introducing BSGs to map the controller parameters which may depend on numerous operating conditions. To preprocess time-series data and extract a fixed-length feature vector, we use a long short-term memory (LSTM) neural networks. Furthermore, this work contributes actor regularizations that are relevant to real-world environments which differ from training. Accordingly, we apply dropout layer normalization to the actor and critic networks of the truncated quantile critic (TQC) algorithm. To show our approach's working principle and effectiveness, we train and evaluate the DRL agent on the parametrization task of an industrial control structure with parameter lookup tables.

* 7 pages, 7 figures, accepted at the 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Honolulu, HI, USA

Via

Access Paper or Ask Questions