Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aprameya Bharadwaj

Graph Based Temporal Aggregation for Video Retrieval

Nov 04, 2020

Arvind Srinivasan, Aprameya Bharadwaj, Aveek Saha, Subramanyam Natarajan

Figure 1 for Graph Based Temporal Aggregation for Video Retrieval

Figure 2 for Graph Based Temporal Aggregation for Video Retrieval

Figure 3 for Graph Based Temporal Aggregation for Video Retrieval

Figure 4 for Graph Based Temporal Aggregation for Video Retrieval

Abstract:Large scale video retrieval is a field of study with a lot of ongoing research. Most of the work in the field is on video retrieval through text queries using techniques such as VSE++. However, there is little research done on video retrieval through image queries, and the work that has been done in this field either uses image queries from within the video dataset or iterates through videos frame by frame. These approaches are not generalized for queries from outside the dataset and do not scale well for large video datasets. To overcome these issues, we propose a new approach for video retrieval through image queries where an undirected graph is constructed from the combined set of frames from all videos to be searched. The node features of this graph are used in the task of video retrieval. Experimentation is done on the MSR-VTT dataset by using query images from outside the dataset. To evaluate this novel approach P@5, P@10 and P@20 metrics are calculated. Two different ResNet models namely, ResNet-152 and ResNet-50 are used in this study.

* 6 pages, 6 figures, 7 tables

Via

Access Paper or Ask Questions

Optimization of Image Embeddings for Few Shot Learning

Apr 04, 2020

Arvind Srinivasan, Aprameya Bharadwaj, Manasa Sathyan, S Natarajan

Figure 1 for Optimization of Image Embeddings for Few Shot Learning

Figure 2 for Optimization of Image Embeddings for Few Shot Learning

Figure 3 for Optimization of Image Embeddings for Few Shot Learning

Figure 4 for Optimization of Image Embeddings for Few Shot Learning

Abstract:In this paper we improve the image embeddings generated in the graph neural network solution for few shot learning. We propose alternate architectures for existing networks such as Inception-Net, U-Net, Attention U-Net, and Squeeze-Net to generate embeddings and increase the accuracy of the models. We improve the quality of embeddings created at the cost of the time taken to generate them. The proposed implementations outperform the existing state of the art methods for 1-shot and 5-shot learning on the Omniglot dataset. The experiments involved a testing set and training set which had no common classes between them. The results for 5-way and 10-way/20-way tests have been tabulated.

* 6 pages, 8 figures

Via

Access Paper or Ask Questions