Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rüdiger Ehlers

University of Bremen and DFKI

Correct-by-Construction Runtime Enforcement in AI -- A Survey

Aug 30, 2022

Bettina Könighofer, Roderick Bloem, Rüdiger Ehlers, Christian Pek

Abstract:Runtime enforcement refers to the theories, techniques, and tools for enforcing correct behavior with respect to a formal specification of systems at runtime. In this paper, we are interested in techniques for constructing runtime enforcers for the concrete application domain of enforcing safety in AI. We discuss how safety is traditionally handled in the field of AI and how more formal guarantees on the safety of a self-learning agent can be given by integrating a runtime enforcer. We survey a selection of work on such enforcers, where we distinguish between approaches for discrete and continuous action spaces. The purpose of this paper is to foster a better understanding of advantages and limitations of different enforcement techniques, focusing on the specific challenges that arise due to their application in AI. Finally, we present some open challenges and avenues for future work.

Via

Access Paper or Ask Questions

Safe Multi-Agent Reinforcement Learning via Shielding

Feb 02, 2021

Ingy Elsayed-Aly, Suda Bharadwaj, Christopher Amato, Rüdiger Ehlers, Ufuk Topcu, Lu Feng

Figure 1 for Safe Multi-Agent Reinforcement Learning via Shielding

Figure 2 for Safe Multi-Agent Reinforcement Learning via Shielding

Figure 3 for Safe Multi-Agent Reinforcement Learning via Shielding

Figure 4 for Safe Multi-Agent Reinforcement Learning via Shielding

Abstract:Multi-agent reinforcement learning (MARL) has been increasingly used in a wide range of safety-critical applications, which require guaranteed safety (e.g., no unsafe states are ever visited) during the learning process.Unfortunately, current MARL methods do not have safety guarantees. Therefore, we present two shielding approaches for safe MARL. In centralized shielding, we synthesize a single shield to monitor all agents' joint actions and correct any unsafe action if necessary. In factored shielding, we synthesize multiple shields based on a factorization of the joint state space observed by all agents; the set of shields monitors agents concurrently and each shield is only responsible for a subset of agents at each step.Experimental results show that both approaches can guarantee the safety of agents during learning without compromising the quality of learned policies; moreover, factored shielding is more scalable in the number of agents than centralized shielding.

* 8 pages, 11 figures and 2 tables, to be published in AAMAS 2021

Via

Access Paper or Ask Questions

Low-Effort Specification Debugging and Analysis

Jul 21, 2014

Rüdiger Ehlers, Vasumathi Raman

Figure 1 for Low-Effort Specification Debugging and Analysis

Figure 2 for Low-Effort Specification Debugging and Analysis

Figure 3 for Low-Effort Specification Debugging and Analysis

Figure 4 for Low-Effort Specification Debugging and Analysis

Abstract:Reactive synthesis deals with the automated construction of implementations of reactive systems from their specifications. To make the approach feasible in practice, systems engineers need effective and efficient means of debugging these specifications. In this paper, we provide techniques for report-based specification debugging, wherein salient properties of a specification are analyzed, and the result presented to the user in the form of a report. This provides a low-effort way to debug specifications, complementing high-effort techniques including the simulation of synthesized implementations. We demonstrate the usefulness of our report-based specification debugging toolkit by providing examples in the context of generalized reactivity(1) synthesis.

* EPTCS 157, 2014, pp. 117-133
* In Proceedings SYNT 2014, arXiv:1407.4937

Via

Access Paper or Ask Questions