Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Konda Reddy Mopuri

DermaCon-IN: A Multi-concept Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research

Jun 06, 2025

Shanawaj S Madarkar, Mahajabeen Madarkar, Madhumitha V, Teli Prakash, Konda Reddy Mopuri, Vinaykumar MV, KVL Sathwika, Adarsh Kasturi, Gandla Dilip Raj, PVN Supranitha(+1 more)

Abstract:Artificial intelligence is poised to augment dermatological care by enabling scalable image-based diagnostics. Yet, the development of robust and equitable models remains hindered by datasets that fail to capture the clinical and demographic complexity of real-world practice. This complexity stems from region-specific disease distributions, wide variation in skin tones, and the underrepresentation of outpatient scenarios from non-Western populations. We introduce DermaCon-IN, a prospectively curated dermatology dataset comprising over 5,450 clinical images from approximately 3,000 patients across outpatient clinics in South India. Each image is annotated by board-certified dermatologists with over 240 distinct diagnoses, structured under a hierarchical, etiology-based taxonomy adapted from Rook's classification. The dataset captures a wide spectrum of dermatologic conditions and tonal variation commonly seen in Indian outpatient care. We benchmark a range of architectures including convolutional models (ResNet, DenseNet, EfficientNet), transformer-based models (ViT, MaxViT, Swin), and Concept Bottleneck Models to establish baseline performance and explore how anatomical and concept-level cues may be integrated. These results are intended to guide future efforts toward interpretable and clinically realistic models. DermaCon-IN provides a scalable and representative foundation for advancing dermatology AI in real-world settings.

Via

Access Paper or Ask Questions

Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition

Nov 05, 2024

Sayanta Adhikari, Rishav Kumar, Konda Reddy Mopuri, Rajalakshmi Pachamuthu

Figure 1 for Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition

Figure 2 for Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition

Figure 3 for Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition

Figure 4 for Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition

Abstract:Contextual information plays a critical role in object recognition models within computer vision, where changes in context can significantly affect accuracy, underscoring models' dependence on contextual cues. This study investigates how context manipulation influences both model accuracy and feature attribution, providing insights into the reliance of object recognition models on contextual information as understood through the lens of feature attribution methods. We employ a range of feature attribution techniques to decipher the reliance of deep neural networks on context in object recognition tasks. Using the ImageNet-9 and our curated ImageNet-CS datasets, we conduct experiments to evaluate the impact of contextual variations, analyzed through feature attribution methods. Our findings reveal several key insights: (a) Correctly classified images predominantly emphasize object volume attribution over context volume attribution. (b) The dependence on context remains relatively stable across different context modifications, irrespective of classification accuracy. (c) Context change exerts a more pronounced effect on model performance than Context perturbations. (d) Surprisingly, context attribution in `no-information' scenarios is non-trivial. Our research moves beyond traditional methods by assessing the implications of broad-level modifications on object recognition, either in the object or its context.

* Published in ICVGIP 2024

Via

Access Paper or Ask Questions

Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation

Feb 28, 2023

Gaurav Patel, Konda Reddy Mopuri, Qiang Qiu

Abstract:Data-free Knowledge Distillation (DFKD) has gained popularity recently, with the fundamental idea of carrying out knowledge transfer from a Teacher neural network to a Student neural network in the absence of training data. However, in the Adversarial DFKD framework, the student network's accuracy, suffers due to the non-stationary distribution of the pseudo-samples under multiple generator updates. To this end, at every generator update, we aim to maintain the student's performance on previously encountered examples while acquiring knowledge from samples of the current distribution. Thus, we propose a meta-learning inspired framework by treating the task of Knowledge-Acquisition (learning from newly generated samples) and Knowledge-Retention (retaining knowledge on previously met samples) as meta-train and meta-test, respectively. Hence, we dub our method as Learning to Retain while Acquiring. Moreover, we identify an implicit aligning factor between the Knowledge-Retention and Knowledge-Acquisition tasks indicating that the proposed student update strategy enforces a common gradient direction for both tasks, alleviating interference between the two objectives. Finally, we support our hypothesis by exhibiting extensive evaluation and comparison of our method with prior arts on multiple datasets.

* Accepted at CVPR 2023

Via

Access Paper or Ask Questions

Class Balancing GAN with a Classifier in the Loop

Jun 17, 2021

Harsh Rangwani, Konda Reddy Mopuri, R. Venkatesh Babu

Figure 1 for Class Balancing GAN with a Classifier in the Loop

Figure 2 for Class Balancing GAN with a Classifier in the Loop

Figure 3 for Class Balancing GAN with a Classifier in the Loop

Figure 4 for Class Balancing GAN with a Classifier in the Loop

Abstract:Generative Adversarial Networks (GANs) have swiftly evolved to imitate increasingly complex image distributions. However, majority of the developments focus on performance of GANs on balanced datasets. We find that the existing GANs and their training regimes which work well on balanced datasets fail to be effective in case of imbalanced (i.e. long-tailed) datasets. In this work we introduce a novel theoretically motivated Class Balancing regularizer for training GANs. Our regularizer makes use of the knowledge from a pre-trained classifier to ensure balanced learning of all the classes in the dataset. This is achieved via modelling the effective class frequency based on the exponential forgetting observed in neural networks and encouraging the GAN to focus on underrepresented classes. We demonstrate the utility of our regularizer in learning representations for long-tailed distributions via achieving better performance than existing approaches over multiple datasets. Specifically, when applied to an unconditional GAN, it improves the FID from $13.03$ to $9.01$ on the long-tailed iNaturalist-$2019$ dataset.

* UAI 2021

Via

Access Paper or Ask Questions

Data Impressions: Mining Deep Models to Extract Samples for Data-free Applications

Jan 15, 2021

Gaurav Kumar Nayak, Konda Reddy Mopuri, Saksham Jain, Anirban Chakraborty

Figure 1 for Data Impressions: Mining Deep Models to Extract Samples for Data-free Applications

Figure 2 for Data Impressions: Mining Deep Models to Extract Samples for Data-free Applications

Figure 3 for Data Impressions: Mining Deep Models to Extract Samples for Data-free Applications

Figure 4 for Data Impressions: Mining Deep Models to Extract Samples for Data-free Applications

Abstract:Pretrained deep models hold their learnt knowledge in the form of the model parameters. These parameters act as memory for the trained models and help them generalize well on unseen data. However, in absence of training data, the utility of a trained model is merely limited to either inference or better initialization towards a target task. In this paper, we go further and extract synthetic data by leveraging the learnt model parameters. We dub them "Data Impressions", which act as proxy to the training data and can be used to realize a variety of tasks. These are useful in scenarios where only the pretrained models are available and the training data is not shared (e.g., due to privacy or sensitivity concerns). We show the applicability of data impressions in solving several computer vision tasks such as unsupervised domain adaptation, continual learning as well as knowledge distillation. We also study the adversarial robustness of the lightweight models trained via knowledge distillation using these data impressions. Further, we demonstrate the efficacy of data impressions in generating UAPs with better fooling rates. Extensive experiments performed on several benchmark datasets demonstrate competitive performance achieved using data impressions in absence of the original training data.

* PAMI Submission (Under Review). arXiv admin note: substantial text overlap with arXiv:1905.08114

Via

Access Paper or Ask Questions

Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation

Nov 18, 2020

Gaurav Kumar Nayak, Konda Reddy Mopuri, Anirban Chakraborty

Figure 1 for Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation

Figure 2 for Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation

Figure 3 for Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation

Figure 4 for Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation

Abstract:Knowledge Distillation is an effective method to transfer the learning across deep neural networks. Typically, the dataset originally used for training the Teacher model is chosen as the "Transfer Set" to conduct the knowledge transfer to the Student. However, this original training data may not always be freely available due to privacy or sensitivity concerns. In such scenarios, existing approaches either iteratively compose a synthetic set representative of the original training dataset, one sample at a time or learn a generative model to compose such a transfer set. However, both these approaches involve complex optimization (GAN training or several backpropagation steps to synthesize one sample) and are often computationally expensive. In this paper, as a simple alternative, we investigate the effectiveness of "arbitrary transfer sets" such as random noise, publicly available synthetic, and natural datasets, all of which are completely unrelated to the original training dataset in terms of their visual or semantic contents. Through extensive experiments on multiple benchmark datasets such as MNIST, FMNIST, CIFAR-10 and CIFAR-100, we discover and validate surprising effectiveness of using arbitrary data to conduct knowledge distillation when this dataset is "target-class balanced". We believe that this important observation can potentially lead to designing baselines for the data-free knowledge distillation task.

* Accepted in WACV 2021

Via

Access Paper or Ask Questions

Dataset Condensation with Gradient Matching

Jun 10, 2020

Bo Zhao, Konda Reddy Mopuri, Hakan Bilen

Figure 1 for Dataset Condensation with Gradient Matching

Figure 2 for Dataset Condensation with Gradient Matching

Figure 3 for Dataset Condensation with Gradient Matching

Figure 4 for Dataset Condensation with Gradient Matching

Abstract:Efficient training of deep neural networks is an increasingly important problem in the era of sophisticated architectures and large-scale datasets. This paper proposes a training set synthesis technique, called Dataset Condensation, that learns to produce a small set of informative samples for training deep neural networks from scratch in a small fraction of the required computational cost on the original data while achieving comparable results. We rigorously evaluate its performance in several computer vision benchmarks and show that it significantly outperforms the state-of-the-art methods. Finally we show promising applications of our method in continual learning and domain adaptation.

Via

Access Paper or Ask Questions

Adversarial Fooling Beyond "Flipping the Label"

Apr 27, 2020

Konda Reddy Mopuri, Vaisakh Shaj, R. Venkatesh Babu

Figure 1 for Adversarial Fooling Beyond "Flipping the Label"

Figure 2 for Adversarial Fooling Beyond "Flipping the Label"

Figure 3 for Adversarial Fooling Beyond "Flipping the Label"

Figure 4 for Adversarial Fooling Beyond "Flipping the Label"

Abstract:Recent advancements in CNNs have shown remarkable achievements in various CV/AI applications. Though CNNs show near human or better than human performance in many critical tasks, they are quite vulnerable to adversarial attacks. These attacks are potentially dangerous in real-life deployments. Though there have been many adversarial attacks proposed in recent years, there is no proper way of quantifying the effectiveness of these attacks. As of today, mere fooling rate is used for measuring the susceptibility of the models, or the effectiveness of adversarial attacks. Fooling rate just considers label flipping and does not consider the cost of such flipping, for instance, in some deployments, flipping between two species of dogs may not be as severe as confusing a dog category with that of a vehicle. Therefore, the metric to quantify the vulnerability of the models should capture the severity of the flipping as well. In this work we first bring out the drawbacks of the existing evaluation and propose novel metrics to capture various aspects of the fooling. Further, for the first time, we present a comprehensive analysis of several important adversarial attacks over a set of distinct CNN architectures. We believe that the presented analysis brings valuable insights about the current adversarial attacks and the CNN models.

* CVPR-AMLCV-2020

Via

Access Paper or Ask Questions

iDLG: Improved Deep Leakage from Gradients

Jan 08, 2020

Bo Zhao, Konda Reddy Mopuri, Hakan Bilen

Figure 1 for iDLG: Improved Deep Leakage from Gradients

Figure 2 for iDLG: Improved Deep Leakage from Gradients

Abstract:It is widely believed that sharing gradients will not leak private training data in distributed learning systems such as Collaborative Learning and Federated Learning, etc. Recently, Zhu et al. presented an approach which shows the possibility to obtain private training data from the publicly shared gradients. In their Deep Leakage from Gradient (DLG) method, they synthesize the dummy data and corresponding labels with the supervision of shared gradients. However, DLG has difficulty in convergence and discovering the ground-truth labels consistently. In this paper, we find that sharing gradients definitely leaks the ground-truth labels. We propose a simple but reliable approach to extract accurate data from the gradients. Particularly, our approach can certainly extract the ground-truth labels as opposed to DLG, hence we name it Improved DLG (iDLG). Our approach is valid for any differentiable model trained with cross-entropy loss over one-hot labels. We mathematically illustrate how our method can extract ground-truth labels from the gradients and empirically demonstrate the advantages over DLG.

Via

Access Paper or Ask Questions

Zero-Shot Knowledge Distillation in Deep Networks

May 20, 2019

Gaurav Kumar Nayak, Konda Reddy Mopuri, Vaisakh Shaj, R. Venkatesh Babu, Anirban Chakraborty

Figure 1 for Zero-Shot Knowledge Distillation in Deep Networks

Figure 2 for Zero-Shot Knowledge Distillation in Deep Networks

Figure 3 for Zero-Shot Knowledge Distillation in Deep Networks

Figure 4 for Zero-Shot Knowledge Distillation in Deep Networks

Abstract:Knowledge distillation deals with the problem of training a smaller model (Student) from a high capacity source model (Teacher) so as to retain most of its performance. Existing approaches use either the training data or meta-data extracted from it in order to train the Student. However, accessing the dataset on which the Teacher has been trained may not always be feasible if the dataset is very large or it poses privacy or safety concerns (e.g., bio-metric or medical data). Hence, in this paper, we propose a novel data-free method to train the Student from the Teacher. Without even using any meta-data, we synthesize the Data Impressions from the complex Teacher model and utilize these as surrogates for the original training data samples to transfer its learning to Student via knowledge distillation. We, therefore, dub our method "Zero-Shot Knowledge Distillation" and demonstrate that our framework results in competitive generalization performance as achieved by distillation using the actual training data samples on multiple benchmark datasets.

* Accepted in ICML 2019, codes will be available at https://github.com/vcl-iisc/ZSKD

Via

Access Paper or Ask Questions