Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sukriti Verma

SARC: Soft Actor Retrospective Critic

Jun 28, 2023

Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

Figure 1 for SARC: Soft Actor Retrospective Critic

Figure 2 for SARC: Soft Actor Retrospective Critic

Figure 3 for SARC: Soft Actor Retrospective Critic

Figure 4 for SARC: Soft Actor Retrospective Critic

Abstract:The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence. Since gradient estimates depend upon the critic, we posit that improving the critic can provide a better gradient estimate for the actor at each time. Utilizing this, we propose Soft Actor Retrospective Critic (SARC), where we augment the SAC critic loss with another loss term - retrospective loss - leading to faster critic convergence and consequently, better policy gradient estimates for the actor. An existing implementation of SAC can be easily adapted to SARC with minimal modifications. Through extensive experimentation and analysis, we show that SARC provides consistent improvement over SAC on benchmark environments. We plan to open-source the code and all experiment data at: https://github.com/sukritiverma1996/SARC.

* Accepted at RLDM 2022

Via

Access Paper or Ask Questions

Information-theoretic Evolution of Model Agnostic Global Explanations

May 14, 2021

Sukriti Verma, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

Figure 1 for Information-theoretic Evolution of Model Agnostic Global Explanations

Figure 2 for Information-theoretic Evolution of Model Agnostic Global Explanations

Figure 3 for Information-theoretic Evolution of Model Agnostic Global Explanations

Figure 4 for Information-theoretic Evolution of Model Agnostic Global Explanations

Abstract:Explaining the behavior of black box machine learning models through human interpretable rules is an important research area. Recent work has focused on explaining model behavior locally i.e. for specific predictions as well as globally across the fields of vision, natural language, reinforcement learning and data science. We present a novel model-agnostic approach that derives rules to globally explain the behavior of classification models trained on numerical and/or categorical data. Our approach builds on top of existing local model explanation methods to extract conditions important for explaining model behavior for specific instances followed by an evolutionary algorithm that optimizes an information theory based fitness function to construct rules that explain global model behavior. We show how our approach outperforms existing approaches on a variety of datasets. Further, we introduce a parameter to evaluate the quality of interpretation under the scenario of distributional shift. This parameter evaluates how well the interpretation can predict model behavior for previously unseen data distributions. We show how existing approaches for interpreting models globally lack distributional robustness. Finally, we show how the quality of the interpretation can be improved under the scenario of distributional shift by adding out of distribution samples to the dataset used to learn the interpretation and thereby, increase robustness. All of the datasets used in our paper are open and publicly available. Our approach has been deployed in a leading digital marketing suite of products.

Via

Access Paper or Ask Questions

MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Sep 03, 2020

Anubha Kabra, Ayush Chopra, Nikaash Puri, Pinkesh Badjatiya, Sukriti Verma, Piyush Gupta, Balaji K

Figure 1 for MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Figure 2 for MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Figure 3 for MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Figure 4 for MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Abstract:Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging problem. Such imbalanced datasets are standard in real-world situations such as fraud detection, medical diagnosis, and computational advertising. We propose an iterative data augmentation method, MixBoost, which intelligently selects (Boost) and then combines (Mix) instances from the majority and minority classes to generate synthetic hybrid instances that have characteristics of both classes. We evaluate MixBoost on 20 benchmark datasets, show that it outperforms existing approaches, and test its efficacy through significance testing. We also present ablation studies to analyze the impact of the different components of MixBoost.

* Work done as part of internship at MDSR

Via

Access Paper or Ask Questions

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Dec 31, 2019

Piyush Gupta, Nikaash Puri, Sukriti Verma, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Figure 1 for Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Figure 2 for Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Figure 3 for Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Figure 4 for Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Abstract:As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned agents. Saliency maps explain agent behavior by highlighting the features of the input state that are most relevant for the agent in taking an action. Existing perturbation-based approaches to compute saliency often highlight regions of the input that are not relevant to the action taken by the agent. Our approach generates more focused saliency maps by balancing two aspects (specificity and relevance) that capture different desiderata of saliency. The first captures the impact of perturbation on the relative expected reward of the action to be explained. The second downweights irrelevant features that alter the relative expected rewards of actions other than the action to be explained. We compare our approach with existing approaches on agents trained to play board games (Chess and Go) and Atari games (Breakout, Pong and Space Invaders). We show through illustrative examples (Chess, Atari, Go), human studies (Chess), and automated evaluation methods (Chess) that our approach generates saliency maps that are more interpretable for humans than existing approaches.

* Accepted at the International Conference on Learning Representations (ICLR) 2020

Via

Access Paper or Ask Questions

MAGIX: Model Agnostic Globally Interpretable Explanations

Jun 15, 2018

Nikaash Puri, Piyush Gupta, Pratiksha Agarwal, Sukriti Verma, Balaji Krishnamurthy

Figure 1 for MAGIX: Model Agnostic Globally Interpretable Explanations

Figure 2 for MAGIX: Model Agnostic Globally Interpretable Explanations

Figure 3 for MAGIX: Model Agnostic Globally Interpretable Explanations

Figure 4 for MAGIX: Model Agnostic Globally Interpretable Explanations

Abstract:Explaining the behavior of a black box machine learning model at the instance level is useful for building trust. However, it is also important to understand how the model behaves globally. Such an understanding provides insight into both the data on which the model was trained and the patterns that it learned. We present here an approach that learns if-then rules to globally explain the behavior of black box machine learning models that have been used to solve classification problems. The approach works by first extracting conditions that were important at the instance level and then evolving rules through a genetic algorithm with an appropriate fitness function. Collectively, these rules represent the patterns followed by the model for decisioning and are useful for understanding its behavior. We demonstrate the validity and usefulness of the approach by interpreting black box models created using publicly available data sets as well as a private digital marketing data set.

Via

Access Paper or Ask Questions

Extractive Summarization using Deep Learning

Aug 15, 2017

Sukriti Verma, Vagisha Nidhi

Figure 1 for Extractive Summarization using Deep Learning

Figure 2 for Extractive Summarization using Deep Learning

Figure 3 for Extractive Summarization using Deep Learning

Figure 4 for Extractive Summarization using Deep Learning

Abstract:This paper proposes a text summarization approach for factual reports using a deep learning model. This approach consists of three phases: feature extraction, feature enhancement, and summary generation, which work together to assimilate core information and generate a coherent, understandable summary. We are exploring various features to improve the set of sentences selected for the summary, and are using a Restricted Boltzmann Machine to enhance and abstract those features to improve resultant accuracy without losing any important information. The sentences are scored based on those enhanced features and an extractive summary is constructed. Experimentation carried out on several articles demonstrates the effectiveness of the proposed approach.

* Accepted to 18th International Conference on Computational Linguistics and Intelligent Text Processing

Via

Access Paper or Ask Questions