Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ichiro Hasuo

National Institute of Informatics

Learning Weighted Finite Automata over the Max-Plus Semiring and its Termination

Jul 13, 2024

Takamasa Okudono, Masaki Waga, Taro Sekiyama, Ichiro Hasuo

Figure 1 for Learning Weighted Finite Automata over the Max-Plus Semiring and its Termination

Abstract:Active learning of finite automata has been vigorously pursued for the purposes of analysis and explanation of black-box systems. In this paper, we study an L*-style learning algorithm for weighted automata over the max-plus semiring. The max-plus setting exposes a "consistency" issue in the previously studied semiring-generic extension of L*: we show that it can fail to maintain consistency of tables, and can thus make equivalence queries on obviously wrong hypothesis automata. We present a theoretical fix by a mathematically clean notion of column-closedness. We also present a nontrivial and reasonably broad class of weighted languages over the max-plus semiring in which our algorithm terminates.

Via

Access Paper or Ask Questions

Temporal Logic Formalisation of ISO 34502 Critical Scenarios: Modular Construction with the RSS Safety Distance

Mar 27, 2024

Jesse Reimann, Nico Mansion, James Haydon, Benjamin Bray, Agnishom Chattopadhyay, Sota Sato, Masaki Waga, Étienne André, Ichiro Hasuo, Naoki Ueda(+1 more)

Abstract:As the development of autonomous vehicles progresses, efficient safety assurance methods become increasingly necessary. Safety assurance methods such as monitoring and scenario-based testing call for formalisation of driving scenarios. In this paper, we develop a temporal-logic formalisation of an important class of critical scenarios in the ISO standard 34502. We use signal temporal logic (STL) as a logical formalism. Our formalisation has two main features: 1) modular composition of logical formulas for systematic and comprehensive formalisation (following the compositional methodology of ISO 34502); 2) use of the RSS distance for defining danger. We find our formalisation comes with few parameters to tune thanks to the RSS distance. We experimentally evaluated our formalisation; using its results, we discuss the validity of our formalisation and its stability with respect to the choice of some parameter values.

* 12 pages, 4 figures, 5 tables. Accepted to SAC 2024

Via

Access Paper or Ask Questions

Formal Verification of Safety Architectures for Automated Driving

Aug 20, 2023

Clovis Eberhart, Jérémy Dubut, James Haydon, Ichiro Hasuo

Abstract:Safety architectures play a crucial role in the safety assurance of automated driving vehicles (ADVs). They can be used as safety envelopes of black-box ADV controllers, and for graceful degradation from one ODD to another. Building on our previous work on the formalization of responsibility-sensitive safety (RSS), we introduce a novel program logic that accommodates assume-guarantee reasoning and fallback-like constructs. This allows us to formally define and prove the safety of existing and novel safety architectures. We apply the logic to a pull over scenario and experimentally evaluate the resulting safety architecture.

* In 2023 IEEE Intelligent Vehicles Symposium (IV), pp. 1-8 (2023)
* In proceedings of 2023 IEEE Intelligent Vehicles Symposium (IV), 8 pages, 5 figures

Via

Access Paper or Ask Questions

Formal Verification of Intersection Safety for Automated Driving

Aug 13, 2023

James Haydon, Martin Bondu, Clovis Eberhart, Jérémy Dubut, Ichiro Hasuo

Abstract:We build on our recent work on formalization of responsibility-sensitive safety (RSS) and present the first formal framework that enables mathematical proofs of the safety of control strategies in intersection scenarios. Intersection scenarios are challenging due to the complex interaction between vehicles; to cope with it, we extend the program logic dFHL in the previous work and introduce a novel formalism of hybrid control flow graphs on which our algorithm can automatically discover an RSS condition that ensures safety. An RSS condition thus discovered is experimentally evaluated; we observe that it is safe (as our safety proof says) and is not overly conservative.

* To appear in ITSC 2023. With appendices. 9 pages, 5 figures, 1 table

Via

Access Paper or Ask Questions

Dynamic Shielding for Reinforcement Learning in Black-Box Environments

Jul 27, 2022

Masaki Waga, Ezequiel Castellano, Sasinee Pruekprasert, Stefan Klikovits, Toru Takisaka, Ichiro Hasuo

Figure 1 for Dynamic Shielding for Reinforcement Learning in Black-Box Environments

Figure 2 for Dynamic Shielding for Reinforcement Learning in Black-Box Environments

Figure 3 for Dynamic Shielding for Reinforcement Learning in Black-Box Environments

Figure 4 for Dynamic Shielding for Reinforcement Learning in Black-Box Environments

Abstract:It is challenging to use reinforcement learning (RL) in cyber-physical systems due to the lack of safety guarantees during learning. Although there have been various proposals to reduce undesired behaviors during learning, most of these techniques require prior system knowledge, and their applicability is limited. This paper aims to reduce undesired behaviors during learning without requiring any prior system knowledge. We propose dynamic shielding: an extension of a model-based safe RL technique called shielding using automata learning. The dynamic shielding technique constructs an approximate system model in parallel with RL using a variant of the RPNI algorithm and suppresses undesired explorations due to the shield constructed from the learned model. Through this combination, potentially unsafe actions can be foreseen before the agent experiences them. Experiments show that our dynamic shield significantly decreases the number of undesired events during training.

* This is the author (and extended) version of the manuscript of the same name published in the proceedings of the 20th International Symposium on Automated Technology for Verification and Analysis (ATVA 2022)

Via

Access Paper or Ask Questions

Goal-Aware RSS for Complex Scenarios via Program Logic

Jul 06, 2022

Ichiro Hasuo, Clovis Eberhart, James Haydon, Jérémy Dubut, Rose Bohrer, Tsutomu Kobayashi, Sasinee Pruekprasert, Xiao-Yi Zhang, Erik André Pallas, Akihisa Yamada(+5 more)

Figure 1 for Goal-Aware RSS for Complex Scenarios via Program Logic

Figure 2 for Goal-Aware RSS for Complex Scenarios via Program Logic

Figure 3 for Goal-Aware RSS for Complex Scenarios via Program Logic

Figure 4 for Goal-Aware RSS for Complex Scenarios via Program Logic

Abstract:We introduce a goal-aware extension of responsibility-sensitive safety (RSS), a recent methodology for rule-based safety guarantee for automated driving systems (ADS). Making RSS rules guarantee goal achievement -- in addition to collision avoidance as in the original RSS -- requires complex planning over long sequences of manoeuvres. To deal with the complexity, we introduce a compositional reasoning framework based on program logic, in which one can systematically develop RSS rules for smaller subscenarios and combine them to obtain RSS rules for bigger scenarios. As the basis of the framework, we introduce a program logic dFHL that accommodates continuous dynamics and safety conditions. Our framework presents a dFHL-based workflow for deriving goal-aware RSS rules; we discuss its software support, too. We conducted experimental evaluation using RSS rules in a safety architecture. Its results show that goal-aware RSS is indeed effective in realising both collision avoidance and goal achievement.

* 33 pages, 18 figures, 1 table. Accepted for publication in IEEE Transactions on Intelligent Vehicles

Via

Access Paper or Ask Questions

Responsibility-Sensitive Safety: an Introduction with an Eye to Logical Foundations and Formalization

Jun 07, 2022

Ichiro Hasuo

Figure 1 for Responsibility-Sensitive Safety: an Introduction with an Eye to Logical Foundations and Formalization

Figure 2 for Responsibility-Sensitive Safety: an Introduction with an Eye to Logical Foundations and Formalization

Figure 3 for Responsibility-Sensitive Safety: an Introduction with an Eye to Logical Foundations and Formalization

Figure 4 for Responsibility-Sensitive Safety: an Introduction with an Eye to Logical Foundations and Formalization

Abstract:Responsibility-sensitive safety (RSS) is an approach to the safety of automated driving systems (ADS). It aims to introduce mathematically formulated safety rules, compliance with which guarantees collision avoidance as a mathematical theorem. However, despite the emphasis on mathematical and logical guarantees, the logical foundations and formalization of RSS are largely an unexplored topic of study. In this paper, we present an introduction to RSS, one that we expect will bridge between different research communities and pave the way to a logical theory of RSS, its mathematical formalization, and software tools of practical use.

* 10 pages

Via

Access Paper or Ask Questions

Fibrational Initial Algebra-Final Coalgebra Coincidence over Initial Algebras: Turning Verification Witnesses Upside Down

May 11, 2021

Mayuko Kori, Ichiro Hasuo, Shin-ya Katsumata

Figure 1 for Fibrational Initial Algebra-Final Coalgebra Coincidence over Initial Algebras: Turning Verification Witnesses Upside Down

Figure 2 for Fibrational Initial Algebra-Final Coalgebra Coincidence over Initial Algebras: Turning Verification Witnesses Upside Down

Abstract:The coincidence between initial algebras (IAs) and final coalgebras (FCs) is a phenomenon that underpins various important results in theoretical computer science. In this paper, we identify a general fibrational condition for the IA-FC coincidence, namely in the fiber over an initial algebra in the base category. Identifying (co)algebras in a fiber as (co)inductive predicates, our fibrational IA-FC coincidence allows one to use coinductive witnesses (such as invariants) for verifying inductive properties (such as liveness). Our general fibrational theory features the technical condition of stability of chain colimits; we extend the framework to the presence of a monadic effect, too, restricting to fibrations of complete lattice-valued predicates. Practical benefits of our categorical theory are exemplified by new "upside-down" witness notions for three verification problems: probabilistic liveness, and acceptance and model-checking with respect to bottom-up tree automata.

* 37 pages

Via

Access Paper or Ask Questions

Control-Data Separation and Logical Condition Propagation for Efficient Inference on Probabilistic Programs

Jan 28, 2021

Ichiro Hasuo, Yuichiro Oyabu, Clovis Eberhart, Kohei Suenaga, Kenta Cho, Shin-ya Katsumata

Figure 1 for Control-Data Separation and Logical Condition Propagation for Efficient Inference on Probabilistic Programs

Figure 2 for Control-Data Separation and Logical Condition Propagation for Efficient Inference on Probabilistic Programs

Figure 3 for Control-Data Separation and Logical Condition Propagation for Efficient Inference on Probabilistic Programs

Figure 4 for Control-Data Separation and Logical Condition Propagation for Efficient Inference on Probabilistic Programs

Abstract:We introduce a novel sampling algorithm for Bayesian inference on imperative probabilistic programs. It features a hierarchical architecture that separates control flows from data: the top-level samples a control flow, and the bottom level samples data values along the control flow picked by the top level. This separation allows us to plug various language-based analysis techniques in probabilistic program sampling; specifically, we use logical backward propagation of observations for sampling efficiency. We implemented our algorithm on top of Anglican. The experimental results demonstrate our algorithm's efficiency, especially for programs with while loops and rare observations.

* 11 pages with appendices

Via

Access Paper or Ask Questions

Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning

Nov 26, 2020

Sanghwa Lee, Jaeyoung Lee, Ichiro Hasuo

Figure 1 for Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning

Figure 2 for Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning

Figure 3 for Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning

Figure 4 for Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning

Abstract:Prioritized experience replay (PER) samples important transitions, rather than uniformly, to improve the performance of a deep reinforcement learning agent. We claim that such prioritization has to be balanced with sample diversity for making the DQN stabilized and preventing forgetting. Our proposed improvement over PER, called Predictive PER (PPER), takes three countermeasures (TDInit, TDClip, TDPred) to (i) eliminate priority outliers and explosions and (ii) improve the sample diversity and distributions, weighted by priorities, both leading to stabilizing the DQN. The most notable among the three is the introduction of the second DNN called TDPred to generalize the in-distribution priorities. Ablation study and full experiments with Atari games show that each countermeasure by its own way and PPER contribute to successfully enhancing stability and thus performance over PER.

* Presented at Deep Reinforcement Learning Workshop, NeurIPS 2020

Via

Access Paper or Ask Questions