Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Atin Ghosh

Pretrained equivariant features improve unsupervised landmark discovery

Apr 07, 2021

Rahul Rahaman, Atin Ghosh, Alexandre H. Thiery

Figure 1 for Pretrained equivariant features improve unsupervised landmark discovery

Figure 2 for Pretrained equivariant features improve unsupervised landmark discovery

Figure 3 for Pretrained equivariant features improve unsupervised landmark discovery

Figure 4 for Pretrained equivariant features improve unsupervised landmark discovery

Abstract:Locating semantically meaningful landmark points is a crucial component of a large number of computer vision pipelines. Because of the small number of available datasets with ground truth landmark annotations, it is important to design robust unsupervised and semi-supervised methods for landmark detection. Many of the recent unsupervised learning methods rely on the equivariance properties of landmarks to synthetic image deformations. Our work focuses on such widely used methods and sheds light on its core problem, its inability to produce equivariant intermediate convolutional features. This finding leads us to formulate a two-step unsupervised approach that overcomes this challenge by first learning powerful pixel-based features and then use the pre-trained features to learn a landmark detector by the traditional equivariance method. Our method produces state-of-the-art results in several challenging landmark detection datasets such as the BBC Pose dataset and the Cat-Head dataset. It performs comparably on a range of other benchmarks.

Via

Access Paper or Ask Questions

On Data-Augmentation and Consistency-Based Semi-Supervised Learning

Jan 18, 2021

Atin Ghosh, Alexandre H. Thiery

Figure 1 for On Data-Augmentation and Consistency-Based Semi-Supervised Learning

Figure 2 for On Data-Augmentation and Consistency-Based Semi-Supervised Learning

Figure 3 for On Data-Augmentation and Consistency-Based Semi-Supervised Learning

Figure 4 for On Data-Augmentation and Consistency-Based Semi-Supervised Learning

Abstract:Recently proposed consistency-based Semi-Supervised Learning (SSL) methods such as the $\Pi$-model, temporal ensembling, the mean teacher, or the virtual adversarial training, have advanced the state of the art in several SSL tasks. These methods can typically reach performances that are comparable to their fully supervised counterparts while using only a fraction of labelled examples. Despite these methodological advances, the understanding of these methods is still relatively limited. In this text, we analyse (variations of) the $\Pi$-model in settings where analytically tractable results can be obtained. We establish links with Manifold Tangent Classifiers and demonstrate that the quality of the perturbations is key to obtaining reasonable SSL performances. Importantly, we propose a simple extension of the Hidden Manifold Model that naturally incorporates data-augmentation schemes and offers a framework for understanding and experimenting with SSL methods.

* ICLR 2021

Via

Access Paper or Ask Questions

ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Mar 16, 2018

Srayanta Mukherjee, Devashish Shankar, Atin Ghosh, Nilam Tathawadekar, Pramod Kompalli, Sunita Sarawagi, Krishnendu Chaudhury

Figure 1 for ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Figure 2 for ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Figure 3 for ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Figure 4 for ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Abstract:Accurate demand forecasts can help on-line retail organizations better plan their supply-chain processes. The challenge, however, is the large number of associative factors that result in large, non-stationary shifts in demand, which traditional time series and regression approaches fail to model. In this paper, we propose a Neural Network architecture called AR-MDN, that simultaneously models associative factors, time-series trends and the variance in the demand. We first identify several causal features and use a combination of feature embeddings, MLP and LSTM to represent them. We then model the output density as a learned mixture of Gaussian distributions. The AR-MDN can be trained end-to-end without the need for additional supervision. We experiment on a dataset of an year's worth of data over tens-of-thousands of products from Flipkart. The proposed architecture yields a significant improvement in forecasting accuracy when compared with existing alternatives.

Via

Access Paper or Ask Questions