Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Arun Goel

A Hybrid Model for Combining Neural Image Caption and k-Nearest Neighbor Approach for Image Captioning

May 09, 2021

Kartik Arora, Ajul Raj, Arun Goel, Seba Susan

Figure 1 for A Hybrid Model for Combining Neural Image Caption and k-Nearest Neighbor Approach for Image Captioning

Figure 2 for A Hybrid Model for Combining Neural Image Caption and k-Nearest Neighbor Approach for Image Captioning

Figure 3 for A Hybrid Model for Combining Neural Image Caption and k-Nearest Neighbor Approach for Image Captioning

Figure 4 for A Hybrid Model for Combining Neural Image Caption and k-Nearest Neighbor Approach for Image Captioning

Abstract:A hybrid model is proposed that integrates two popular image captioning methods to generate a text-based summary describing the contents of the image. The two image captioning models are the Neural Image Caption (NIC) and the k-nearest neighbor approach. These are trained individually on the training set. We extract a set of five features, from the validation set, for evaluating the results of the two models that in turn is used to train a logistic regression classifier. The BLEU-4 scores of the two models are compared for generating the binary-value ground truth for the logistic regression classifier. For the test set, the input images are first passed separately through the two models to generate the individual captions. The five-dimensional feature set extracted from the two models is passed to the logistic regression classifier to take a decision regarding the final caption generated which is the best of two captions generated by the models. Our implementation of the k-nearest neighbor model achieves a BLEU-4 score of 15.95 and the NIC model achieves a BLEU-4 score of 16.01, on the benchmark Flickr8k dataset. The proposed hybrid model is able to achieve a BLEU-4 score of 18.20 proving the validity of our approach.

* Included in Proceedings of 3rd ICSCSP 2020

Via

Access Paper or Ask Questions