Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions

Oct 02, 2024

Qian Ruan, Ilia Kuznetsov, Iryna Gurevych

Figure 1 for Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions

Figure 2 for Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions

Figure 3 for Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions

Figure 4 for Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions

Share this with someone who'll enjoy it:

Abstract:Classification is a core NLP task architecture with many potential applications. While large language models (LLMs) have brought substantial advancements in text generation, their potential for enhancing classification tasks remains underexplored. To address this gap, we propose a framework for thoroughly investigating fine-tuning LLMs for classification, including both generation- and encoding-based approaches. We instantiate this framework in edit intent classification (EIC), a challenging and underexplored classification task. Our extensive experiments and systematic comparisons with various training approaches and a representative selection of LLMs yield new insights into their application for EIC. We investigate the generalizability of these findings on five further classification tasks. To demonstrate the proposed methods and address the data shortage for empirical edit analysis, we use our best-performing EIC model to create Re3-Sci2.0, a new large-scale dataset of 1,780 scientific document revisions with over 94k labeled edits. The quality of the dataset is assessed through human evaluation. The new dataset enables an in-depth empirical study of human editing behavior in academic writing. We make our experimental framework, models and data publicly available.

* EMNLP2024 Main

View paper on

Share this with someone who'll enjoy it:

Title:Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions

Paper and Code