Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:REAL: A Representative Error-Driven Approach for Active Learning

Jul 06, 2023

Cheng Chen, Yong Wang, Lizi Liao, Yueguo Chen, Xiaoyong Du

Figure 1 for REAL: A Representative Error-Driven Approach for Active Learning

Figure 2 for REAL: A Representative Error-Driven Approach for Active Learning

Figure 3 for REAL: A Representative Error-Driven Approach for Active Learning

Figure 4 for REAL: A Representative Error-Driven Approach for Active Learning

Share this with someone who'll enjoy it:

Abstract:Given a limited labeling budget, active learning (AL) aims to sample the most informative instances from an unlabeled pool to acquire labels for subsequent model training. To achieve this, AL typically measures the informativeness of unlabeled instances based on uncertainty and diversity. However, it does not consider erroneous instances with their neighborhood error density, which have great potential to improve the model performance. To address this limitation, we propose $REAL$, a novel approach to select data instances with $\underline{R}$epresentative $\underline{E}$rrors for $\underline{A}$ctive $\underline{L}$earning. It identifies minority predictions as \emph{pseudo errors} within a cluster and allocates an adaptive sampling budget for the cluster based on estimated error density. Extensive experiments on five text classification datasets demonstrate that $REAL$ consistently outperforms all best-performing baselines regarding accuracy and F1-macro scores across a wide range of hyperparameter settings. Our analysis also shows that $REAL$ selects the most representative pseudo errors that match the distribution of ground-truth errors along the decision boundary. Our code is publicly available at https://github.com/withchencheng/ECML_PKDD_23_Real.

* Accepted by ECML/PKDD 2023

View paper on

Share this with someone who'll enjoy it:

Title:REAL: A Representative Error-Driven Approach for Active Learning

Paper and Code