Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kyoungho Choi

Sequential Targeting: an incremental learning approach for data imbalance in text classification

Nov 23, 2020

Joel Jang, Yoonjeon Kim, Kyoungho Choi, Sungho Suh

Figure 1 for Sequential Targeting: an incremental learning approach for data imbalance in text classification

Figure 2 for Sequential Targeting: an incremental learning approach for data imbalance in text classification

Figure 3 for Sequential Targeting: an incremental learning approach for data imbalance in text classification

Figure 4 for Sequential Targeting: an incremental learning approach for data imbalance in text classification

Abstract:Classification tasks require a balanced distribution of data to ensure the learner to be trained to generalize over all classes. In real-world datasets, however, the number of instances vary substantially among classes. This typically leads to a learner that promotes bias towards the majority group due to its dominating property. Therefore, methods to handle imbalanced datasets are crucial for alleviating distributional skews and fully utilizing the under-represented data, especially in text classification. While addressing the imbalance in text data, most methods utilize sampling methods on the numerical representation of the data, which limits its efficiency on how effective the representation is. We propose a novel training method, Sequential Targeting(ST), independent of the effectiveness of the representation method, which enforces an incremental learning setting by splitting the data into mutually exclusive subsets and training the learner adaptively. To address problems that arise within incremental learning, we apply elastic weight consolidation. We demonstrate the effectiveness of our method through experiments on simulated benchmark datasets (IMDB) and data collected from NAVER.

* 9 pages, 7 figures, submitted to the journal of Expert Systems with Applications

Via

Access Paper or Ask Questions