Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yixi Ding

CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

Oct 16, 2024

Yixi Ding, Jiaying Wu, Tongyao Zhu, Yanxia Qin, Qian Liu, Min-Yen Kan

Figure 1 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

Figure 2 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

Figure 3 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

Figure 4 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

Abstract:To broaden the dissemination of scientific knowledge to diverse audiences, scientific document summarization must simultaneously control multiple attributes such as length and empirical focus. However, existing research typically focuses on controlling single attributes, leaving the compositional control of multiple attributes underexplored. To address this gap, we introduce CCSBench, a benchmark for compositional controllable summarization in the scientific domain. Our benchmark enables fine-grained control over both explicit attributes (e.g., length), which are objective and straightforward, and implicit attributes (e.g., empirical focus), which are more subjective and conceptual. We conduct extensive experiments on GPT-4, LLaMA2, and other popular LLMs under various settings. Our findings reveal significant limitations in large language models' ability to balance trade-offs between control attributes, especially implicit ones that require deeper understanding and abstract reasoning.

Via

Access Paper or Ask Questions