Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giulio Antoniol

How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

Aug 03, 2021

Florian Tambon, Gabriel Laberge, Le An, Amin Nikanjam, Paulina Stevia Nouwou Mindom, Yann Pequignot, Foutse Khomh, Giulio Antoniol, Ettore Merlo, François Laviolette

Figure 1 for How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

Figure 2 for How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

Figure 3 for How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

Figure 4 for How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

Abstract:Context: Machine Learning (ML) has been at the heart of many innovations over the past years. However, including it in so-called 'safety-critical' systems such as automotive or aeronautic has proven to be very challenging, since the shift in paradigm that ML brings completely changes traditional certification approaches. Objective: This paper aims to elucidate challenges related to the certification of ML-based safety-critical systems, as well as the solutions that are proposed in the literature to tackle them, answering the question 'How to Certify Machine Learning Based Safety-critical Systems?'. Method: We conduct a Systematic Literature Review (SLR) of research papers published between 2015 to 2020, covering topics related to the certification of ML systems. In total, we identified 217 papers covering topics considered to be the main pillars of ML certification: Robustness, Uncertainty, Explainability, Verification, Safe Reinforcement Learning, and Direct Certification. We analyzed the main trends and problems of each sub-field and provided summaries of the papers extracted. Results: The SLR results highlighted the enthusiasm of the community for this subject, as well as the lack of diversity in terms of datasets and type of models. It also emphasized the need to further develop connections between academia and industries to deepen the domain study. Finally, it also illustrated the necessity to build connections between the above mention main pillars that are for now mainly studied separately. Conclusion: We highlighted current efforts deployed to enable the certification of ML based software systems, and discuss some future research directions.

* 72 pages (90 pages with ref.), submitted to a journal (Automated Software Engineering. Changes: Adding final control quality questions process of systematic literature review, adding minor changes

Via

Access Paper or Ask Questions

HOMRS: High Order Metamorphic Relations Selector for Deep Neural Networks

Jul 10, 2021

Florian Tambon, Giulio Antoniol, Foutse Khomh

Figure 1 for HOMRS: High Order Metamorphic Relations Selector for Deep Neural Networks

Figure 2 for HOMRS: High Order Metamorphic Relations Selector for Deep Neural Networks

Figure 3 for HOMRS: High Order Metamorphic Relations Selector for Deep Neural Networks

Figure 4 for HOMRS: High Order Metamorphic Relations Selector for Deep Neural Networks

Abstract:Deep Neural Networks (DNN) applications are increasingly becoming a part of our everyday life, from medical applications to autonomous cars. Traditional validation of DNN relies on accuracy measures, however, the existence of adversarial examples has highlighted the limitations of these accuracy measures, raising concerns especially when DNN are integrated into safety-critical systems. In this paper, we present HOMRS, an approach to boost metamorphic testing by automatically building a small optimized set of high order metamorphic relations from an initial set of elementary metamorphic relations. HOMRS' backbone is a multi-objective search; it exploits ideas drawn from traditional systems testing such as code coverage, test case, and path diversity. We applied HOMRS to LeNet5 DNN with MNIST dataset and we report evidence that it builds a small but effective set of high order transformations achieving a 95% kill ratio. Five raters manually labeled a pool of images before and after high order transformation; Fleiss' Kappa and statistical tests confirmed that they are metamorphic properties. HOMRS built-in relations are also effective to confront adversarial or out-of-distribution examples; HOMRS detected 92% of randomly sampled out-of-distribution images. HOMRS transformations are also suitable for online real-time use.

* 19 pages, 2 figures

Via

Access Paper or Ask Questions