Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohit Rajpal

Hessian-Aware Bayesian Optimization for Decision Making Systems

Aug 17, 2023

Mohit Rajpal, Lac Gia Tran, Yehong Zhang, Bryan Kian Hsiang Low

Figure 1 for Hessian-Aware Bayesian Optimization for Decision Making Systems

Figure 2 for Hessian-Aware Bayesian Optimization for Decision Making Systems

Figure 3 for Hessian-Aware Bayesian Optimization for Decision Making Systems

Figure 4 for Hessian-Aware Bayesian Optimization for Decision Making Systems

Abstract:Many approaches for optimizing decision making systems rely on gradient based methods requiring informative feedback from the environment. However, in the case where such feedback is sparse or uninformative, such approaches may result in poor performance. Derivative-free approaches such as Bayesian Optimization mitigate the dependency on the quality of gradient feedback, but are known to scale poorly in the high-dimension setting of complex decision making systems. This problem is exacerbated if the system requires interactions between several actors cooperating to accomplish a shared goal. To address the dimensionality challenge, we propose a compact multi-layered architecture modeling the dynamics of actor interactions through the concept of role. Additionally, we introduce Hessian-aware Bayesian Optimization to efficiently optimize the multi-layered architecture parameterized by a large number of parameters. Experimental results demonstrate that our method (HA-GP-UCB) works effectively on several benchmarks under resource constraints and malformed feedback settings.

* Included important citation

Via

Access Paper or Ask Questions

A Unifying Framework of Bilinear LSTMs

Oct 23, 2019

Mohit Rajpal, Bryan Kian Hsiang Low

Figure 1 for A Unifying Framework of Bilinear LSTMs

Figure 2 for A Unifying Framework of Bilinear LSTMs

Figure 3 for A Unifying Framework of Bilinear LSTMs

Figure 4 for A Unifying Framework of Bilinear LSTMs

Abstract:This paper presents a novel unifying framework of bilinear LSTMs that can represent and utilize the nonlinear interaction of the input features present in sequence datasets for achieving superior performance over a linear LSTM and yet not incur more parameters to be learned. To realize this, our unifying framework allows the expressivity of the linear vs. bilinear terms to be balanced by correspondingly trading off between the hidden state vector size vs. approximation quality of the weight matrix in the bilinear term so as to optimize the performance of our bilinear LSTM, while not incurring more parameters to be learned. We empirically evaluate the performance of our bilinear LSTM in several language-based sequence learning tasks to demonstrate its general applicability.

Via

Access Paper or Ask Questions

Not all bytes are equal: Neural byte sieve for fuzzing

Nov 10, 2017

Mohit Rajpal, William Blum, Rishabh Singh

Figure 1 for Not all bytes are equal: Neural byte sieve for fuzzing

Figure 2 for Not all bytes are equal: Neural byte sieve for fuzzing

Figure 3 for Not all bytes are equal: Neural byte sieve for fuzzing

Figure 4 for Not all bytes are equal: Neural byte sieve for fuzzing

Abstract:Fuzzing is a popular dynamic program analysis technique used to find vulnerabilities in complex software. Fuzzing involves presenting a target program with crafted malicious input designed to cause crashes, buffer overflows, memory errors, and exceptions. Crafting malicious inputs in an efficient manner is a difficult open problem and often the best approach to generating such inputs is through applying uniform random mutations to pre-existing valid inputs (seed files). We present a learning technique that uses neural networks to learn patterns in the input files from past fuzzing explorations to guide future fuzzing explorations. In particular, the neural models learn a function to predict good (and bad) locations in input files to perform fuzzing mutations based on the past mutations and corresponding code coverage information. We implement several neural models including LSTMs and sequence-to-sequence models that can encode variable length input files. We incorporate our models in the state-of-the-art AFL (American Fuzzy Lop) fuzzer and show significant improvements in terms of code coverage, unique code paths, and crashes for various input formats including ELF, PNG, PDF, and XML.

Via

Access Paper or Ask Questions