Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ye Nan

NUS

Optimizing F-measure: A Tale of Two Approaches

Jun 18, 2012

Ye Nan, Kian Ming Chai, Wee Sun Lee, Hai Leong Chieu

Figure 1 for Optimizing F-measure: A Tale of Two Approaches

Figure 2 for Optimizing F-measure: A Tale of Two Approaches

Figure 3 for Optimizing F-measure: A Tale of Two Approaches

Figure 4 for Optimizing F-measure: A Tale of Two Approaches

Abstract:F-measures are popular performance metrics, particularly for tasks with imbalanced data sets. Algorithms for learning to maximize F-measures follow two approaches: the empirical utility maximization (EUM) approach learns a classifier having optimal performance on training data, while the decision-theoretic approach learns a probabilistic model and then predicts labels with maximum expected F-measure. In this paper, we investigate the theoretical justifications and connections for these two approaches, and we study the conditions under which one approach is preferable to the other using synthetic and real datasets. Given accurate models, our results suggest that the two approaches are asymptotically equivalent given large training and test sets. Nevertheless, empirically, the EUM approach appears to be more robust against model misspecification, and given a good model, the decision-theoretic approach appears to be better for handling rare classes and a common domain adaptation scenario.

* ICML2012

Via

Access Paper or Ask Questions