Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guillaume Sicard

End-to-End Deep Neural Networks and Transfer Learning for Automatic Analysis of Nation-State Malware

Nov 30, 2019

Ishai Rosenberg, Guillaume Sicard, Eli David

Figure 1 for End-to-End Deep Neural Networks and Transfer Learning for Automatic Analysis of Nation-State Malware

Figure 2 for End-to-End Deep Neural Networks and Transfer Learning for Automatic Analysis of Nation-State Malware

Abstract:Malware allegedly developed by nation-states, also known as advanced persistent threats (APT), are becoming more common. The task of attributing an APT to a specific nation-state or classifying it to the correct APT family is challenging for several reasons. First, each nation-state has more than a single cyber unit that develops such malware, rendering traditional authorship attribution algorithms useless. Furthermore, the dataset of such available APTs is still extremely small. Finally, those APTs use state-of-the-art evasion techniques, making feature extraction challenging. In this paper, we use a deep neural network (DNN) as a classifier for nation-state APT attribution. We record the dynamic behavior of the APT when run in a sandbox and use it as raw input for the neural network, allowing the DNN to learn high level feature abstractions of the APTs itself. We also use the same raw features for APT family classification. Finally, we use the feature abstractions learned by the APT family classifier to solve the attribution problem. Using a test set of 1000 Chinese and Russian developed APTs, we achieved an accuracy rate of 98.6%.

* Entropy, Vol. 20, No. 5, pp. 390-401, May 2018
* arXiv admin note: substantial text overlap with arXiv:1711.09666

Via

Access Paper or Ask Questions

DeepAPT: Nation-State APT Attribution Using End-to-End Deep Neural Networks

Nov 27, 2017

Ishai Rosenberg, Guillaume Sicard, Eli David

Abstract:In recent years numerous advanced malware, aka advanced persistent threats (APT) are allegedly developed by nation-states. The task of attributing an APT to a specific nation-state is extremely challenging for several reasons. Each nation-state has usually more than a single cyber unit that develops such advanced malware, rendering traditional authorship attribution algorithms useless. Furthermore, those APTs use state-of-the-art evasion techniques, making feature extraction challenging. Finally, the dataset of such available APTs is extremely small. In this paper we describe how deep neural networks (DNN) could be successfully employed for nation-state APT attribution. We use sandbox reports (recording the behavior of the APT when run dynamically) as raw input for the neural network, allowing the DNN to learn high level feature abstractions of the APTs itself. Using a test set of 1,000 Chinese and Russian developed APTs, we achieved an accuracy rate of 94.6%.

* International Conference on Artificial Neural Networks (ICANN), Springer LNCS, Vol. 10614, pp. 91-99, Alghero, Italy, September, 2017

Via

Access Paper or Ask Questions

Deep Self-Taught Learning for Handwritten Character Recognition

Sep 18, 2010

Frédéric Bastien, Yoshua Bengio, Arnaud Bergeron, Nicolas Boulanger-Lewandowski, Thomas Breuel, Youssouf Chherawala, Moustapha Cisse, Myriam Côté, Dumitru Erhan, Jeremy Eustache(+7 more)

Figure 1 for Deep Self-Taught Learning for Handwritten Character Recognition

Figure 2 for Deep Self-Taught Learning for Handwritten Character Recognition

Figure 3 for Deep Self-Taught Learning for Handwritten Character Recognition

Figure 4 for Deep Self-Taught Learning for Handwritten Character Recognition

Abstract:Recent theoretical and empirical work in statistical machine learning has demonstrated the importance of learning algorithms for deep architectures, i.e., function classes obtained by composing multiple non-linear transformations. Self-taught learning (exploiting unlabeled examples or examples from other distributions) has already been applied to deep learners, but mostly to show the advantage of unlabeled examples. Here we explore the advantage brought by {\em out-of-distribution examples}. For this purpose we developed a powerful generator of stochastic variations and noise processes for character images, including not only affine transformations but also slant, local elastic deformations, changes in thickness, background images, grey level changes, contrast, occlusion, and various types of noise. The out-of-distribution examples are obtained from these highly distorted images or by including examples of object classes different from those in the target test set. We show that {\em deep learners benefit more from out-of-distribution examples than a corresponding shallow learner}, at least in the area of handwritten character recognition. In fact, we show that they beat previously published results and reach human-level performance on both handwritten digit classification and 62-class handwritten character recognition.

Via

Access Paper or Ask Questions