Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hardik Goel

Time Series Deinterleaving of DNS Traffic

Jul 16, 2018

Amir Asiaee, Hardik Goel, Shalini Ghosh, Vinod Yegneswaran, Arindam Banerjee

Figure 1 for Time Series Deinterleaving of DNS Traffic

Figure 2 for Time Series Deinterleaving of DNS Traffic

Figure 3 for Time Series Deinterleaving of DNS Traffic

Figure 4 for Time Series Deinterleaving of DNS Traffic

Abstract:Stream deinterleaving is an important problem with various applications in the cybersecurity domain. In this paper, we consider the specific problem of deinterleaving DNS data streams using machine-learning techniques, with the objective of automating the extraction of malware domain sequences. We first develop a generative model for user request generation and DNS stream interleaving. Based on these we evaluate various inference strategies for deinterleaving including augmented HMMs and LSTMs on synthetic datasets. Our results demonstrate that state-of-the-art LSTMs outperform more traditional augmented HMMs in this application domain.

Via

Access Paper or Ask Questions

R2N2: Residual Recurrent Neural Networks for Multivariate Time Series Forecasting

Sep 10, 2017

Hardik Goel, Igor Melnyk, Arindam Banerjee

Figure 1 for R2N2: Residual Recurrent Neural Networks for Multivariate Time Series Forecasting

Figure 2 for R2N2: Residual Recurrent Neural Networks for Multivariate Time Series Forecasting

Figure 3 for R2N2: Residual Recurrent Neural Networks for Multivariate Time Series Forecasting

Figure 4 for R2N2: Residual Recurrent Neural Networks for Multivariate Time Series Forecasting

Abstract:Multivariate time-series modeling and forecasting is an important problem with numerous applications. Traditional approaches such as VAR (vector auto-regressive) models and more recent approaches such as RNNs (recurrent neural networks) are indispensable tools in modeling time-series data. In many multivariate time series modeling problems, there is usually a significant linear dependency component, for which VARs are suitable, and a nonlinear component, for which RNNs are suitable. Modeling such times series with only VAR or only RNNs can lead to poor predictive performance or complex models with large training times. In this work, we propose a hybrid model called R2N2 (Residual RNN), which first models the time series with a simple linear model (like VAR) and then models its residual errors using RNNs. R2N2s can be trained using existing algorithms for VARs and RNNs. Through an extensive empirical evaluation on two real world datasets (aviation and climate domains), we show that R2N2 is competitive, usually better than VAR or RNN, used alone. We also show that R2N2 is faster to train as compared to an RNN, while requiring less number of hidden units.

Via

Access Paper or Ask Questions