Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shufan Ming

Multi-label Sequential Sentence Classification via Large Language Model

Nov 23, 2024

Mengfei Lan, Lecheng Zheng, Shufan Ming, Halil Kilicoglu

Figure 1 for Multi-label Sequential Sentence Classification via Large Language Model

Figure 2 for Multi-label Sequential Sentence Classification via Large Language Model

Figure 3 for Multi-label Sequential Sentence Classification via Large Language Model

Figure 4 for Multi-label Sequential Sentence Classification via Large Language Model

Abstract:Sequential sentence classification (SSC) in scientific publications is crucial for supporting downstream tasks such as fine-grained information retrieval and extractive summarization. However, current SSC methods are constrained by model size, sequence length, and single-label setting. To address these limitations, this paper proposes LLM-SSC, a large language model (LLM)-based framework for both single- and multi-label SSC tasks. Unlike previous approaches that employ small- or medium-sized language models, the proposed framework utilizes LLMs to generate SSC labels through designed prompts, which enhance task understanding by incorporating demonstrations and a query to describe the prediction target. We also present a multi-label contrastive learning loss with auto-weighting scheme, enabling the multi-label classification task. To support our multi-label SSC analysis, we introduce and release a new dataset, biorc800, which mainly contains unstructured abstracts in the biomedical domain with manual annotations. Experiments demonstrate LLM-SSC's strong performance in SSC under both in-context learning and task-specific tuning settings. We release biorc800 and our code at: https://github.com/ScienceNLP-Lab/LLM-SSC.

* Accepted by EMNLP 2024

Via

Access Paper or Ask Questions