Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hatem Hajri

Robust Deep Reinforcement Learning Through Adversarial Attacks and Training : A Survey

Mar 01, 2024

Lucas Schott, Josephine Delas, Hatem Hajri, Elies Gherbi, Reda Yaich, Nora Boulahia-Cuppens, Frederic Cuppens, Sylvain Lamprier

Abstract:Deep Reinforcement Learning (DRL) is an approach for training autonomous agents across various complex environments. Despite its significant performance in well known environments, it remains susceptible to minor conditions variations, raising concerns about its reliability in real-world applications. To improve usability, DRL must demonstrate trustworthiness and robustness. A way to improve robustness of DRL to unknown changes in the conditions is through Adversarial Training, by training the agent against well suited adversarial attacks on the dynamics of the environment. Addressing this critical issue, our work presents an in-depth analysis of contemporary adversarial attack methodologies, systematically categorizing them and comparing their objectives and operational mechanisms. This classification offers a detailed insight into how adversarial attacks effectively act for evaluating the resilience of DRL agents, thereby paving the way for enhancing their robustness.

* 57 pages, 16 figues, 2 tables

Via

Access Paper or Ask Questions

SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models

May 23, 2023

Martin Gonzalez, Nelson Fernandez, Thuy Tran, Elies Gherbi, Hatem Hajri, Nader Masmoudi

Abstract:A potent class of generative models known as Diffusion Probabilistic Models (DPMs) has become prominent. A forward diffusion process adds gradually noise to data, while a model learns to gradually denoise. Sampling from pre-trained DPMs is obtained by solving differential equations (DE) defined by the learnt model, a process which has shown to be prohibitively slow. Numerous efforts on speeding-up this process have consisted on crafting powerful ODE solvers. Despite being quick, such solvers do not usually reach the optimal quality achieved by available slow SDE solvers. Our goal is to propose SDE solvers that reach optimal quality without requiring several hundreds or thousands of NFEs to achieve that goal. In this work, we propose Stochastic Exponential Derivative-free Solvers (SEEDS), improving and generalizing Exponential Integrator approaches to the stochastic case on several frameworks. After carefully analyzing the formulation of exact solutions of diffusion SDEs, we craft SEEDS to analytically compute the linear part of such solutions. Inspired by the Exponential Time-Differencing method, SEEDS uses a novel treatment of the stochastic components of solutions, enabling the analytical computation of their variance, and contains high-order terms allowing to reach optimal quality sampling $\sim3$-$5\times$ faster than previous SDE methods. We validate our approach on several image generation benchmarks, showing that SEEDS outperforms or is competitive with previous SDE solvers. Contrary to the latter, SEEDS are derivative and training free, and we fully prove strong convergence guarantees for them.

* 52 pages. Comments are welcome!

Via

Access Paper or Ask Questions

Riemannian data-dependent randomized smoothing for neural networks certification

Jun 21, 2022

Pol Labarbarie, Hatem Hajri, Marc Arnaudon

Figure 1 for Riemannian data-dependent randomized smoothing for neural networks certification

Figure 2 for Riemannian data-dependent randomized smoothing for neural networks certification

Abstract:Certification of neural networks is an important and challenging problem that has been attracting the attention of the machine learning community since few years. In this paper, we focus on randomized smoothing (RS) which is considered as the state-of-the-art method to obtain certifiably robust neural networks. In particular, a new data-dependent RS technique called ANCER introduced recently can be used to certify ellipses with orthogonal axis near each input data of the neural network. In this work, we remark that ANCER is not invariant under rotation of input data and propose a new rotationally-invariant formulation of it which can certify ellipses without constraints on their axis. Our approach called Riemannian Data Dependant Randomized Smoothing (RDDRS) relies on information geometry techniques on the manifold of covariance matrices and can certify bigger regions than ANCER based on our experiments on the MNIST dataset.

* Accepted at ICLM 2022 Workshop "AdvML Frontiers"

Via

Access Paper or Ask Questions

Noisy Learning for Neural ODEs Acts as a Robustness Locus Widening

Jun 16, 2022

Martin Gonzalez, Hatem Hajri, Loic Cantat, Mihaly Petreczky

Figure 1 for Noisy Learning for Neural ODEs Acts as a Robustness Locus Widening

Figure 2 for Noisy Learning for Neural ODEs Acts as a Robustness Locus Widening

Figure 3 for Noisy Learning for Neural ODEs Acts as a Robustness Locus Widening

Figure 4 for Noisy Learning for Neural ODEs Acts as a Robustness Locus Widening

Abstract:We investigate the problems and challenges of evaluating the robustness of Differential Equation-based (DE) networks against synthetic distribution shifts. We propose a novel and simple accuracy metric which can be used to evaluate intrinsic robustness and to validate dataset corruption simulators. We also propose methodology recommendations, destined for evaluating the many faces of neural DEs' robustness and for comparing them with their discrete counterparts rigorously. We then use this criteria to evaluate a cheap data augmentation technique as a reliable way for demonstrating the natural robustness of neural ODEs against simulated image corruptions across multiple datasets.

* Accepted at ICLM 2022 Workshop "PODS"

Via

Access Paper or Ask Questions

Realization Theory Of Recurrent Neural ODEs Using Polynomial System Embeddings

May 24, 2022

Martin Gonzalez, Thibault Defourneau, Hatem Hajri, Mihaly Petreczky

Abstract:In this paper we show that neural ODE analogs of recurrent (ODE-RNN) and Long Short-Term Memory (ODE-LSTM) networks can be algorithmically embeddeded into the class of polynomial systems. This embedding preserves input-output behavior and can suitably be extended to other neural DE architectures. We then use realization theory of polynomial systems to provide necessary conditions for an input-output map to be realizable by an ODE-LSTM and sufficient conditions for minimality of such systems. These results represent the first steps towards realization theory of recurrent neural ODE architectures, which is is expected be useful for model reduction and learning algorithm analysis of recurrent neural ODEs.

* 10 pages. Comments are welcome!

Via

Access Paper or Ask Questions

Improving Robustness of Deep Reinforcement Learning Agents: Environment Attacks based on Critic Networks

Apr 07, 2021

Lucas Schott, Manon Césaire, Hatem Hajri, Sylvain Lamprier

Figure 1 for Improving Robustness of Deep Reinforcement Learning Agents: Environment Attacks based on Critic Networks

Figure 2 for Improving Robustness of Deep Reinforcement Learning Agents: Environment Attacks based on Critic Networks

Figure 3 for Improving Robustness of Deep Reinforcement Learning Agents: Environment Attacks based on Critic Networks

Figure 4 for Improving Robustness of Deep Reinforcement Learning Agents: Environment Attacks based on Critic Networks

Abstract:To improve policy robustness of deep reinforcement learning agents, a line of recent works focus on producing disturbances of the environment. Existing approaches of the literature to generate meaningful disturbances of the environment are adversarial reinforcement learning methods. These methods set the problem as a two-player game between the protagonist agent, which learns to perform a task in an environment, and the adversary agent, which learns to disturb the protagonist via modifications of the considered environment. Both protagonist and adversary are trained with deep reinforcement learning algorithms. Alternatively, we propose in this paper to build on gradient-based adversarial attacks, usually used for classification tasks for instance, that we apply on the critic network of the protagonist to identify efficient disturbances of the environment. Rather than learning an attacker policy, which usually reveals as very complex and unstable, we leverage the knowledge of the critic network of the protagonist, to dynamically complexify the task at each step of the learning process. We show that our method, while being faster and lighter, leads to significantly better improvements in policy robustness than existing methods of the literature.

Via

Access Paper or Ask Questions

Stochastic sparse adversarial attacks

Nov 24, 2020

Hatem Hajri, Manon Césaire, Théo Combey, Sylvain Lamprier, Patrick Gallinari

Figure 1 for Stochastic sparse adversarial attacks

Figure 2 for Stochastic sparse adversarial attacks

Figure 3 for Stochastic sparse adversarial attacks

Figure 4 for Stochastic sparse adversarial attacks

Abstract:Adversarial attacks of neural network classifiers (NNC) and the use of random noises in these methods have stimulated a large number of works in recent years. However, despite all the previous investigations, existing approaches that rely on random noises to fool NNC have fallen far short of the-state-of-the-art adversarial methods performances. In this paper, we fill this gap by introducing stochastic sparse adversarial attacks (SSAA), standing as simple, fast and purely noise-based targeted and untargeted attacks of NNC. SSAA offer new examples of sparse (or $L_0$) attacks for which only few methods have been proposed previously. These attacks are devised by exploiting a small-time expansion idea widely used for Markov processes. Experiments on small and large datasets (CIFAR-10 and ImageNet) illustrate several advantages of SSAA in comparison with the-state-of-the-art methods. For instance, in the untargeted case, our method called voting folded Gaussian attack (VFGA) scales efficiently to ImageNet and achieves a significantly lower $L_0$ score than SparseFool (up to $\frac{1}{14}$ lower) while being faster. In the targeted setting, VFGA achives appealing results on ImageNet and is significantly much faster than Carlini-Wagner $L_0$ attack.

* 19 pages

Via

Access Paper or Ask Questions

Probabilistic Jacobian-based Saliency Maps Attacks

Jul 12, 2020

António Loison, Théo Combey, Hatem Hajri

Figure 1 for Probabilistic Jacobian-based Saliency Maps Attacks

Figure 2 for Probabilistic Jacobian-based Saliency Maps Attacks

Figure 3 for Probabilistic Jacobian-based Saliency Maps Attacks

Figure 4 for Probabilistic Jacobian-based Saliency Maps Attacks

Abstract:Machine learning models have achieved spectacular performances in various critical fields including intelligent monitoring, autonomous driving and malware detection. Therefore, robustness against adversarial attacks represents a key issue to trust these models. In particular, the Jacobian-based Saliency Map Attack (JSMA) is widely used to fool neural network classifiers. In this paper, we introduce Weighted JSMA (WJSMA) and Taylor JSMA (TJSMA), simple, faster and more efficient versions of JSMA. These attacks rely upon new saliency maps involving the neural network Jacobian, its output probabilities and the input features. We demonstrate the advantages of WJSMA and TJSMA through two computer vision applications on 1) LeNet-5, a well-known Neural Network classifier (NNC), on the MNIST database and on 2) a more challenging NNC on the CIFAR-10 dataset. We obtain that WJSMA and TJSMA significantly outperform JSMA in success rate, speed and average number of changed features. For instance, on LeNet-5 (with $100\%$ and $99.49\%$ accuracies on the training and test sets), WJSMA and TJSMA respectively exceed $97\%$ and $98.60\%$ in success rate for a maximum authorised distortion of $14.5\%$, outperforming JSMA with more than $9.5$ and $11$ percentage points. The new attacks are then used to defend and create more robust models than those trained against JSMA. Like JSMA, our attacks are not scalable on large datasets such as IMAGENET but despite this fact, they remain attractive for relatively small datasets like MNIST, CIFAR-10 and may be potential tools for future applications.

* 21 pages

Via

Access Paper or Ask Questions

Geomstats: A Python Package for Riemannian Geometry in Machine Learning

Apr 07, 2020

Nina Miolane, Alice Le Brigant, Johan Mathe, Benjamin Hou, Nicolas Guigui, Yann Thanwerdas, Stefan Heyder, Olivier Peltre, Niklas Koep, Hadi Zaatiti(+9 more)

Figure 1 for Geomstats: A Python Package for Riemannian Geometry in Machine Learning

Figure 2 for Geomstats: A Python Package for Riemannian Geometry in Machine Learning

Figure 3 for Geomstats: A Python Package for Riemannian Geometry in Machine Learning

Figure 4 for Geomstats: A Python Package for Riemannian Geometry in Machine Learning

Abstract:We introduce Geomstats, an open-source Python toolbox for computations and statistics on nonlinear manifolds, such as hyperbolic spaces, spaces of symmetric positive definite matrices, Lie groups of transformations, and many more. We provide object-oriented and extensively unit-tested implementations. Among others, manifolds come equipped with families of Riemannian metrics, with associated exponential and logarithmic maps, geodesics and parallel transport. Statistics and learning algorithms provide methods for estimation, clustering and dimension reduction on manifolds. All associated operations are vectorized for batch computation and provide support for different execution backends, namely NumPy, PyTorch and TensorFlow, enabling GPU acceleration. This paper presents the package, compares it with related libraries and provides relevant code examples. We show that Geomstats provides reliable building blocks to foster research in differential geometry and statistics, and to democratize the use of Riemannian geometry in machine learning applications. The source code is freely available under the MIT license at \url{geomstats.ai}.

Via

Access Paper or Ask Questions

FRSign: A Large-Scale Traffic Light Dataset for Autonomous Trains

Feb 05, 2020

Jeanine Harb, Nicolas Rébéna, Raphaël Chosidow, Grégoire Roblin, Roman Potarusov, Hatem Hajri

Figure 1 for FRSign: A Large-Scale Traffic Light Dataset for Autonomous Trains

Figure 2 for FRSign: A Large-Scale Traffic Light Dataset for Autonomous Trains

Figure 3 for FRSign: A Large-Scale Traffic Light Dataset for Autonomous Trains

Figure 4 for FRSign: A Large-Scale Traffic Light Dataset for Autonomous Trains

Abstract:In the realm of autonomous transportation, there have been many initiatives for open-sourcing self-driving cars datasets, but much less for alternative methods of transportation such as trains. In this paper, we aim to bridge the gap by introducing FRSign, a large-scale and accurate dataset for vision-based railway traffic light detection and recognition. Our recordings were made on selected running trains in France and benefited from carefully hand-labeled annotations. An illustrative dataset which corresponds to ten percent of the acquired data to date is published in open source with the paper. It contains more than 100,000 images illustrating six types of French railway traffic lights and their possible color combinations, together with the relevant information regarding their acquisition such as date, time, sensor parameters, and bounding boxes. This dataset is published in open-source at the address \url{https://frsign.irt-systemx.fr}. We compare, analyze various properties of the dataset and provide metrics to express its variability. We also discuss specific challenges and particularities related to autonomous trains in comparison to autonomous cars.

Via

Access Paper or Ask Questions