Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification

Sep 26, 2019

Jianming Zheng, Fei Cai, Honghui Chen, Maarten de Rijke

Figure 1 for Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification

Figure 2 for Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification

Figure 3 for Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification

Figure 4 for Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification

Share this with someone who'll enjoy it:

Abstract:Text representation can aid machines in understanding text. Previous work on text representation often focuses on the so-called forward implication, i.e., preceding words are taken as the context of later words for creating representations, thus ignoring the fact that the semantics of a text segment is a product of the mutual implication of words in the text: later words contribute to the meaning of preceding words. We introduce the concept of interaction and propose a two-perspective interaction representation, that encapsulates a local and a global interaction representation. Here, a local interaction representation is one that interacts among words with parent-children relationships on the syntactic trees and a global interaction interpretation is one that interacts among all the words in a sentence. We combine the two interaction representations to develop a Hybrid Interaction Representation (HIR). Inspired by existing feature-based and fine-tuning-based pretrain-finetuning approaches to language models, we integrate the advantages of feature-based and fine-tuning-based methods to propose the Pre-train, Interact, Fine-tune (PIF) architecture. We evaluate our proposed models on five widely-used datasets for text classification tasks. Our ensemble method, outperforms state-of-the-art baselines with improvements ranging from 2.03% to 3.15% in terms of error rate. In addition, we find that, the improvements of PIF against most state-of-the-art methods is not affected by increasing of the length of the text.

* 32 pages, 5 figures

View paper on

Share this with someone who'll enjoy it:

Title:Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification

Paper and Code