Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Parth Shah

On Adversarial Robustness and Out-of-Distribution Robustness of Large Language Models

Dec 13, 2024

April Yang, Jordan Tab, Parth Shah, Paul Kotchavong

Abstract:The increasing reliance on large language models (LLMs) for diverse applications necessitates a thorough understanding of their robustness to adversarial perturbations and out-of-distribution (OOD) inputs. In this study, we investigate the correlation between adversarial robustness and OOD robustness in LLMs, addressing a critical gap in robustness evaluation. By applying methods originally designed to improve one robustness type across both contexts, we analyze their performance on adversarial and out-of-distribution benchmark datasets. The input of the model consists of text samples, with the output prediction evaluated in terms of accuracy, precision, recall, and F1 scores in various natural language inference tasks. Our findings highlight nuanced interactions between adversarial robustness and OOD robustness, with results indicating limited transferability between the two robustness types. Through targeted ablations, we evaluate how these correlations evolve with different model sizes and architectures, uncovering model-specific trends: smaller models like LLaMA2-7b exhibit neutral correlations, larger models like LLaMA2-13b show negative correlations, and Mixtral demonstrates positive correlations, potentially due to domain-specific alignment. These results underscore the importance of hybrid robustness frameworks that integrate adversarial and OOD strategies tailored to specific models and domains. Further research is needed to evaluate these interactions across larger models and varied architectures, offering a pathway to more reliable and generalizable LLMs.

Via

Access Paper or Ask Questions

Adaptive Fine-Grained Sketch-Based Image Retrieval

Jul 06, 2022

Ayan Kumar Bhunia, Aneeshan Sain, Parth Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

Figure 1 for Adaptive Fine-Grained Sketch-Based Image Retrieval

Figure 2 for Adaptive Fine-Grained Sketch-Based Image Retrieval

Figure 3 for Adaptive Fine-Grained Sketch-Based Image Retrieval

Figure 4 for Adaptive Fine-Grained Sketch-Based Image Retrieval

Abstract:The recent focus on Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) has shifted towards generalising a model to new categories without any training data from them. In real-world applications, however, a trained FG-SBIR model is often applied to both new categories and different human sketchers, i.e., different drawing styles. Although this complicates the generalisation problem, fortunately, a handful of examples are typically available, enabling the model to adapt to the new category/style. In this paper, we offer a novel perspective -- instead of asking for a model that generalises, we advocate for one that quickly adapts, with just very few samples during testing (in a few-shot manner). To solve this new problem, we introduce a novel model-agnostic meta-learning (MAML) based framework with several key modifications: (1) As a retrieval task with a margin-based contrastive loss, we simplify the MAML training in the inner loop to make it more stable and tractable. (2) The margin in our contrastive loss is also meta-learned with the rest of the model. (3) Three additional regularisation losses are introduced in the outer loop, to make the meta-learned FG-SBIR model more effective for category/style adaptation. Extensive experiments on public datasets suggest a large gain over generalisation and zero-shot based approaches, and a few strong few-shot baselines.

* Accepted in ECCV 2022

Via

Access Paper or Ask Questions

Knowledge Base Inference for Regular Expression Queries

May 01, 2020

Vaibhav Adlakha, Parth Shah, Srikanta Bedathur, Mausam

Figure 1 for Knowledge Base Inference for Regular Expression Queries

Figure 2 for Knowledge Base Inference for Regular Expression Queries

Figure 3 for Knowledge Base Inference for Regular Expression Queries

Figure 4 for Knowledge Base Inference for Regular Expression Queries

Abstract:Two common types of tasks on Knowledge Bases have been studied -- single link prediction (Knowledge Base Completion) and path query answering. However, our analysis of user queries on a real-world knowledge base reveals that a significant fraction of queries specify paths using regular expressions(regex). Such regex queries cannot be handled by any of the existing link prediction or path query answering models. In response, we present Regex Query Answering, the novel task of answering regex queries on incomplete KBs. We contribute two datasets for the task, including one where test queries are harvested from actual user querylogs. We train baseline neural models for our new task and propose novel ways to handle disjunction and Kleene plus regex operators.

Via

Access Paper or Ask Questions

Neural Machine Translation System of Indic Languages -- An Attention based Approach

Feb 02, 2020

Parth Shah, Vishvajit Bakrola

Figure 1 for Neural Machine Translation System of Indic Languages -- An Attention based Approach

Figure 2 for Neural Machine Translation System of Indic Languages -- An Attention based Approach

Figure 3 for Neural Machine Translation System of Indic Languages -- An Attention based Approach

Figure 4 for Neural Machine Translation System of Indic Languages -- An Attention based Approach

Abstract:Neural machine translation (NMT) is a recent and effective technique which led to remarkable improvements in comparison of conventional machine translation techniques. Proposed neural machine translation model developed for the Gujarati language contains encoder-decoder with attention mechanism. In India, almost all the languages are originated from their ancestral language - Sanskrit. They are having inevitable similarities including lexical and named entity similarity. Translating into Indic languages is always be a challenging task. In this paper, we have presented the neural machine translation system (NMT) that can efficiently translate Indic languages like Hindi and Gujarati that together covers more than 58.49 percentage of total speakers in the country. We have compared the performance of our NMT model with automatic evaluation matrices such as BLEU, perplexity and TER matrix. The comparison of our network with Google translate is also presented where it outperformed with a margin of 6 BLEU score on English-Gujarati translation.

* 2019 Second International Conference on Advanced Computational and Communication Paradigms (ICACCP), Gangtok, India, 2019, pp. 1-5

Via

Access Paper or Ask Questions

Nonlinear Semi-Parametric Models for Survival Analysis

May 14, 2019

Chirag Nagpal, Rohan Sangave, Amit Chahar, Parth Shah, Artur Dubrawski, Bhiksha Raj

Figure 1 for Nonlinear Semi-Parametric Models for Survival Analysis

Figure 2 for Nonlinear Semi-Parametric Models for Survival Analysis

Figure 3 for Nonlinear Semi-Parametric Models for Survival Analysis

Figure 4 for Nonlinear Semi-Parametric Models for Survival Analysis

Abstract:Semi-parametric survival analysis methods like the Cox Proportional Hazards (CPH) regression (Cox, 1972) are a popular approach for survival analysis. These methods involve fitting of the log-proportional hazard as a function of the covariates and are convenient as they do not require estimation of the baseline hazard rate. Recent approaches have involved learning non-linear representations of the input covariates and demonstrate improved performance. In this paper we argue against such deep parameterizations for survival analysis and experimentally demonstrate that more interpretable semi-parametric models inspired from mixtures of experts perform equally well or in some cases better than such overly parameterized deep models.

Via

Access Paper or Ask Questions

Optimal Approach for Image Recognition using Deep Convolutional Architecture

Apr 25, 2019

Parth Shah, Vishvajit Bakrola, Supriya Pati

Figure 1 for Optimal Approach for Image Recognition using Deep Convolutional Architecture

Figure 2 for Optimal Approach for Image Recognition using Deep Convolutional Architecture

Figure 3 for Optimal Approach for Image Recognition using Deep Convolutional Architecture

Figure 4 for Optimal Approach for Image Recognition using Deep Convolutional Architecture

Abstract:In the recent time deep learning has achieved huge popularity due to its performance in various machine learning algorithms. Deep learning as hierarchical or structured learning attempts to model high level abstractions in data by using a group of processing layers. The foundation of deep learning architectures is inspired by the understanding of information processing and neural responses in human brain. The architectures are created by stacking multiple linear or non-linear operations. The article mainly focuses on the state-of-art deep learning models and various real world applications specific training methods. Selecting optimal architecture for specific problem is a challenging task, at a closing stage of the article we proposed optimal approach to deep convolutional architecture for the application of image recognition.

Via

Access Paper or Ask Questions

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

Mar 08, 2019

Michelle A. Lee, Yuke Zhu, Krishnan Srinivasan, Parth Shah, Silvio Savarese, Li Fei-Fei, Animesh Garg, Jeannette Bohg

Figure 1 for Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

Figure 2 for Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

Figure 3 for Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

Figure 4 for Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

Abstract:Contact-rich manipulation tasks in unstructured environments often require both haptic and visual feedback. However, it is non-trivial to manually design a robot controller that combines modalities with very different characteristics. While deep reinforcement learning has shown success in learning control policies for high-dimensional inputs, these algorithms are generally intractable to deploy on real robots due to sample complexity. We use self-supervision to learn a compact and multimodal representation of our sensory inputs, which can then be used to improve the sample efficiency of our policy learning. We evaluate our method on a peg insertion task, generalizing over different geometry, configurations, and clearances, while being robust to external perturbations. Results for simulated and real robot experiments are presented.

* ICRA 2019

Via

Access Paper or Ask Questions

Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Jul 24, 2018

Lin Shao, Parth Shah, Vikranth Dwaracherla, Jeannette Bohg

Figure 1 for Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Figure 2 for Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Figure 3 for Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Figure 4 for Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Abstract:Given two consecutive RGB-D images, we propose a model that estimates a dense 3D motion field, also known as scene flow. We take advantage of the fact that in robot manipulation scenarios, scenes often consist of a set of rigidly moving objects. Our model jointly estimates (i) the segmentation of the scene into an unknown but finite number of objects, (ii) the motion trajectories of these objects and (iii) the object scene flow. We employ an hourglass, deep neural network architecture. In the encoding stage, the RGB and depth images undergo spatial compression and correlation. In the decoding stage, the model outputs three images containing a per-pixel estimate of the corresponding object center as well as object translation and rotation. This forms the basis for inferring the object segmentation and final object scene flow. To evaluate our model, we generated a new and challenging, large-scale, synthetic dataset that is specifically targeted at robotic manipulation: It contains a large number of scenes with a very diverse set of simultaneously moving 3D objects and is recorded with a simulated, static RGB-D camera. In quantitative experiments, we show that we outperform state-of-the-art scene flow and motion-segmentation methods on this data set. In qualitative experiments, we show how our learned model transfers to challenging real-world scenes, visually generating better results than existing methods.

* Accepted to IEEE Robotics and Automation Letters and selected by IROS'18 Program Committee for presentation at the Conference

Via

Access Paper or Ask Questions

Image Captioning using Deep Neural Architectures

Jan 17, 2018

Parth Shah, Vishvajit Bakarola, Supriya Pati

Figure 1 for Image Captioning using Deep Neural Architectures

Figure 2 for Image Captioning using Deep Neural Architectures

Figure 3 for Image Captioning using Deep Neural Architectures

Figure 4 for Image Captioning using Deep Neural Architectures

Abstract:Automatically creating the description of an image using any natural languages sentence like English is a very challenging task. It requires expertise of both image processing as well as natural language processing. This paper discuss about different available models for image captioning task. We have also discussed about how the advancement in the task of object recognition and machine translation has greatly improved the performance of image captioning model in recent years. In addition to that we have discussed how this model can be implemented. In the end, we have also evaluated the performance of model using standard evaluation matrices.

* Pre-print version of paper accepted at 2017 International Conference on Innovations in information Embedded and Communication Systems (ICIIECS)

Via

Access Paper or Ask Questions