Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hemanth Venkateswara

Dataset Augmentation by Mixing Visual Concepts

Dec 19, 2024

Abdullah Al Rahat, Hemanth Venkateswara

Abstract:This paper proposes a dataset augmentation method by fine-tuning pre-trained diffusion models. Generating images using a pre-trained diffusion model with textual conditioning often results in domain discrepancy between real data and generated images. We propose a fine-tuning approach where we adapt the diffusion model by conditioning it with real images and novel text embeddings. We introduce a unique procedure called Mixing Visual Concepts (MVC) where we create novel text embeddings from image captions. The MVC enables us to generate multiple images which are diverse and yet similar to the real data enabling us to perform effective dataset augmentation. We perform comprehensive qualitative and quantitative evaluations with the proposed dataset augmentation approach showcasing both coarse-grained and finegrained changes in generated images. Our approach outperforms state-of-the-art augmentation techniques on benchmark classification tasks.

* Accepted at WACV 2025 main conference

Via

Access Paper or Ask Questions

Domain Adaptation Using Pseudo Labels

Feb 09, 2024

Sachin Chhabra, Hemanth Venkateswara, Baoxin Li

Abstract:In the absence of labeled target data, unsupervised domain adaptation approaches seek to align the marginal distributions of the source and target domains in order to train a classifier for the target. Unsupervised domain alignment procedures are category-agnostic and end up misaligning the categories. We address this problem by deploying a pretrained network to determine accurate labels for the target domain using a multi-stage pseudo-label refinement procedure. The filters are based on the confidence, distance (conformity), and consistency of the pseudo labels. Our results on multiple datasets demonstrate the effectiveness of our simple procedure in comparison with complex state-of-the-art techniques.

* 8 pages + 3 pages of references

Via

Access Paper or Ask Questions

Domain-Invariant Feature Alignment Using Variational Inference For Partial Domain Adaptation

Dec 03, 2022

Sandipan Choudhuri, Suli Adeniye, Arunabha Sen, Hemanth Venkateswara

Abstract:The standard closed-set domain adaptation approaches seek to mitigate distribution discrepancies between two domains under the constraint of both sharing identical label sets. However, in realistic scenarios, finding an optimal source domain with identical label space is a challenging task. Partial domain adaptation alleviates this problem of procuring a labeled dataset with identical label space assumptions and addresses a more practical scenario where the source label set subsumes the target label set. This, however, presents a few additional obstacles during adaptation. Samples with categories private to the source domain thwart relevant knowledge transfer and degrade model performance. In this work, we try to address these issues by coupling variational information and adversarial learning with a pseudo-labeling technique to enforce class distribution alignment and minimize the transfer of superfluous information from the source samples. The experimental findings in numerous cross-domain classification tasks demonstrate that the proposed technique delivers superior and comparable accuracy to existing methods.

* Accepted in the 56th Asilomar Conference on Signals, Systems, and Computers, 2022

Via

Access Paper or Ask Questions

PatchRot: A Self-Supervised Technique for Training Vision Transformers

Oct 27, 2022

Sachin Chhabra, Prabal Bijoy Dutta, Hemanth Venkateswara, Baoxin Li

Abstract:Vision transformers require a huge amount of labeled data to outperform convolutional neural networks. However, labeling a huge dataset is a very expensive process. Self-supervised learning techniques alleviate this problem by learning features similar to supervised learning in an unsupervised way. In this paper, we propose a self-supervised technique PatchRot that is crafted for vision transformers. PatchRot rotates images and image patches and trains the network to predict the rotation angles. The network learns to extract both global and local features from an image. Our extensive experiments on different datasets showcase PatchRot training learns rich features which outperform supervised learning and compared baseline.

* NeurIPS Workshop on Vision Transformers: Theory and Applications (VTTA)

Via

Access Paper or Ask Questions

Coupling Adversarial Learning with Selective Voting Strategy for Distribution Alignment in Partial Domain Adaptation

Jul 17, 2022

Sandipan Choudhuri, Hemanth Venkateswara, Arunabha Sen

Figure 1 for Coupling Adversarial Learning with Selective Voting Strategy for Distribution Alignment in Partial Domain Adaptation

Abstract:In contrast to a standard closed-set domain adaptation task, partial domain adaptation setup caters to a realistic scenario by relaxing the identical label set assumption. The fact of source label set subsuming the target label set, however, introduces few additional obstacles as training on private source category samples thwart relevant knowledge transfer and mislead the classification process. To mitigate these issues, we devise a mechanism for strategic selection of highly-confident target samples essential for the estimation of class-importance weights. Furthermore, we capture class-discriminative and domain-invariant features by coupling the process of achieving compact and distinct class distributions with an adversarial objective. Experimental findings over numerous cross-domain classification tasks demonstrate the potential of the proposed technique to deliver superior and comparable accuracy over existing methods.

Via

Access Paper or Ask Questions

Sparsity Regularization For Cold-Start Recommendation

Jan 28, 2022

Aksheshkumar Ajaykumar Shah, Hemanth Venkateswara

Figure 1 for Sparsity Regularization For Cold-Start Recommendation

Figure 2 for Sparsity Regularization For Cold-Start Recommendation

Figure 3 for Sparsity Regularization For Cold-Start Recommendation

Figure 4 for Sparsity Regularization For Cold-Start Recommendation

Abstract:Recently, Generative Adversarial Networks (GANs) have been applied to the problem of Cold-Start Recommendation, but the training performance of these models is hampered by the extreme sparsity in warm user purchase behavior. In this paper we introduce a novel representation for user-vectors by combining user demographics and user preferences, making the model a hybrid system which uses Collaborative Filtering and Content Based Recommendation. Our system models user purchase behavior using weighted user-product preferences (explicit feedback) rather than binary user-product interactions (implicit feedback). Using this we develop a novel sparse adversarial model, SRLGAN, for Cold-Start Recommendation leveraging the sparse user-purchase behavior which ensures training stability and avoids over-fitting on warm users. We evaluate the SRLGAN on two popular datasets and demonstrate state-of-the-art results.

Via

Access Paper or Ask Questions

Partial Domain Adaptation Using Selective Representation Learning For Class-Weight Computation

Jan 06, 2021

Sandipan Choudhuri, Riti Paul, Arunabha Sen, Baoxin Li, Hemanth Venkateswara

Figure 1 for Partial Domain Adaptation Using Selective Representation Learning For Class-Weight Computation

Figure 2 for Partial Domain Adaptation Using Selective Representation Learning For Class-Weight Computation

Abstract:The generalization power of deep-learning models is dependent on rich-labelled data. This supervision using large-scaled annotated information is restrictive in most real-world scenarios where data collection and their annotation involve huge cost. Various domain adaptation techniques exist in literature that bridge this distribution discrepancy. However, a majority of these models require the label sets of both the domains to be identical. To tackle a more practical and challenging scenario, we formulate the problem statement from a partial domain adaptation perspective, where the source label set is a super set of the target label set. Driven by the motivation that image styles are private to each domain, in this work, we develop a method that identifies outlier classes exclusively from image content information and train a label classifier exclusively on class-content from source images. Additionally, elimination of negative transfer of samples from classes private to the source domain is achieved by transforming the soft class-level weights into two clusters, 0 (outlier source classes) and 1 (shared classes) by maximizing the between-cluster variance between them.

Via

Access Paper or Ask Questions

Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning

Jul 19, 2020

Maunil R Vyas, Hemanth Venkateswara, Sethuraman Panchanathan

Figure 1 for Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning

Figure 2 for Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning

Figure 3 for Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning

Figure 4 for Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning

Abstract:Zero-shot learning (ZSL) addresses the unseen class recognition problem by leveraging semantic information to transfer knowledge from seen classes to unseen classes. Generative models synthesize the unseen visual features and convert ZSL into a classical supervised learning problem. These generative models are trained using the seen classes and are expected to implicitly transfer the knowledge from seen to unseen classes. However, their performance is stymied by overfitting, which leads to substandard performance on Generalized Zero-Shot learning (GZSL). To address this concern, we propose the novel LsrGAN, a generative model that Leverages the Semantic Relationship between seen and unseen categories and explicitly performs knowledge transfer by incorporating a novel Semantic Regularized Loss (SR-Loss). The SR-loss guides the LsrGAN to generate visual features that mirror the semantic relationships between seen and unseen classes. Experiments on seven benchmark datasets, including the challenging Wikipedia text-based CUB and NABirds splits, and Attribute-based AWA, CUB, and SUN, demonstrates the superiority of the LsrGAN compared to previous state-of-the-art approaches under both ZSL and GZSL. Code is available at https: // github. com/ Maunil/ LsrGAN

* 19 Pages, To be appear in ECCV 2020

Via

Access Paper or Ask Questions

Representation, Exploration and Recommendation of Music Playlists

Jul 01, 2019

Piyush Papreja, Hemanth Venkateswara, Sethuraman Panchanathan

Figure 1 for Representation, Exploration and Recommendation of Music Playlists

Figure 2 for Representation, Exploration and Recommendation of Music Playlists

Figure 3 for Representation, Exploration and Recommendation of Music Playlists

Figure 4 for Representation, Exploration and Recommendation of Music Playlists

Abstract:Playlists have become a significant part of our listening experience because of the digital cloud-based services such as Spotify, Pandora, Apple Music. Owing to the meteoric rise in the usage of playlists, recommending playlists is crucial to music services today. Although there has been a lot of work done in playlist prediction, the area of playlist representation hasn't received that level of attention. Over the last few years, sequence-to-sequence models, especially in the field of natural language processing, have shown the effectiveness of learned embeddings in capturing the semantic characteristics of sequences. We can apply similar concepts to music to learn fixed length representations for playlists and use those representations for downstream tasks such as playlist discovery, browsing, and recommendation. In this work, we formulate the problem of learning a fixed-length playlist representation in an unsupervised manner, using Sequence-to-sequence (Seq2seq) models, interpreting playlists as sentences and songs as words. We compare our model with two other encoding architectures for baseline comparison. We evaluate our work using the suite of tasks commonly used for assessing sentence embeddings, along with a few additional tasks pertaining to music, and a recommendation task to study the traits captured by the playlist embeddings and their effectiveness for the purpose of music recommendation.

Via

Access Paper or Ask Questions

A Strategy for an Uncompromising Incremental Learner

Jul 17, 2017

Ragav Venkatesan, Hemanth Venkateswara, Sethuraman Panchanathan, Baoxin Li

Figure 1 for A Strategy for an Uncompromising Incremental Learner

Figure 2 for A Strategy for an Uncompromising Incremental Learner

Figure 3 for A Strategy for an Uncompromising Incremental Learner

Figure 4 for A Strategy for an Uncompromising Incremental Learner

Abstract:Multi-class supervised learning systems require the knowledge of the entire range of labels they predict. Often when learnt incrementally, they suffer from catastrophic forgetting. To avoid this, generous leeways have to be made to the philosophy of incremental learning that either forces a part of the machine to not learn, or to retrain the machine again with a selection of the historic data. While these hacks work to various degrees, they do not adhere to the spirit of incremental learning. In this article, we redefine incremental learning with stringent conditions that do not allow for any undesirable relaxations and assumptions. We design a strategy involving generative models and the distillation of dark knowledge as a means of hallucinating data along with appropriate targets from past distributions. We call this technique, phantom sampling.We show that phantom sampling helps avoid catastrophic forgetting during incremental learning. Using an implementation based on deep neural networks, we demonstrate that phantom sampling dramatically avoids catastrophic forgetting. We apply these strategies to competitive multi-class incremental learning of deep neural networks. Using various benchmark datasets and through our strategy, we demonstrate that strict incremental learning could be achieved. We further put our strategy to test on challenging cases, including cross-domain increments and incrementing on a novel label space. We also propose a trivial extension to unbounded-continual learning and identify potential for future development.

* Under review at IEEE Transactions of Neural Networks and Learning Systems

Via

Access Paper or Ask Questions