Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aman Mohanty

Harmonic Machine Learning Models are Robust

Apr 29, 2024

Nicholas S. Kersting, Yi Li, Aman Mohanty, Oyindamola Obisesan, Raphael Okochu

Abstract:We introduce Harmonic Robustness, a powerful and intuitive method to test the robustness of any machine-learning model either during training or in black-box real-time inference monitoring without ground-truth labels. It is based on functional deviation from the harmonic mean value property, indicating instability and lack of explainability. We show implementation examples in low-dimensional trees and feedforward NNs, where the method reliably identifies overfitting, as well as in more complex high-dimensional models such as ResNet-50 and Vision Transformer where it efficiently measures adversarial vulnerability across image classes.

* 18 pages, 13 figures

Via

Access Paper or Ask Questions

Efficient CNN-LSTM based Image Captioning using Neural Network Compression

Dec 17, 2020

Harshit Rampal, Aman Mohanty

Figure 1 for Efficient CNN-LSTM based Image Captioning using Neural Network Compression

Figure 2 for Efficient CNN-LSTM based Image Captioning using Neural Network Compression

Figure 3 for Efficient CNN-LSTM based Image Captioning using Neural Network Compression

Figure 4 for Efficient CNN-LSTM based Image Captioning using Neural Network Compression

Abstract:Modern Neural Networks are eminent in achieving state of the art performance on tasks under Computer Vision, Natural Language Processing and related verticals. However, they are notorious for their voracious memory and compute appetite which further obstructs their deployment on resource limited edge devices. In order to achieve edge deployment, researchers have developed pruning and quantization algorithms to compress such networks without compromising their efficacy. Such compression algorithms are broadly experimented on standalone CNN and RNN architectures while in this work, we present an unconventional end to end compression pipeline of a CNN-LSTM based Image Captioning model. The model is trained using VGG16 or ResNet50 as an encoder and an LSTM decoder on the flickr8k dataset. We then examine the effects of different compression architectures on the model and design a compression architecture that achieves a 73.1% reduction in model size, 71.3% reduction in inference time and a 7.7% increase in BLEU score as compared to its uncompressed counterpart.

Via

Access Paper or Ask Questions