Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations

Jun 23, 2021

Charaf Eddine Benarab

Figure 1 for Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations

Figure 2 for Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations

Figure 3 for Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations

Figure 4 for Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations

Share this with someone who'll enjoy it:

Abstract:Knowledge is acquired by humans through experience, and no boundary is set between the kinds of knowledge or skill levels we can achieve on different tasks at the same time. When it comes to Neural Networks, that is not the case, the major breakthroughs in the field are extremely task and domain specific. Vision and language are dealt with in separate manners, using separate methods and different datasets. In this work, we propose to use knowledge acquired by benchmark Vision Models which are trained on ImageNet to help a much smaller architecture learn to classify text. After transforming the textual data contained in the IMDB dataset to gray scale images. An analysis of different domains and the Transfer Learning method is carried out. Despite the challenge posed by the very different datasets, promising results are achieved. The main contribution of this work is a novel approach which links large pretrained models on both language and vision to achieve state-of-the-art results in different sub-fields from the original task. Without needing high compute capacity resources. Specifically, Sentiment Analysis is achieved after transferring knowledge between vision and language models. BERT embeddings are transformed into grayscale images, these images are then used as training examples for pretrained vision models such as VGG16 and ResNet Index Terms: Natural language, Vision, BERT, Transfer Learning, CNN, Domain Adaptation.

* Paper contains: 5 pages, 6 figures, 1 table

View paper on

Share this with someone who'll enjoy it:

Title:Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations

Paper and Code