Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dongqi Pu

RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization

May 01, 2024

Dongqi Pu, Vera Demberg

Abstract:For long document summarization, discourse structure is important to discern the key content of the text and the differences in importance level between sentences. Unfortunately, the integration of rhetorical structure theory (RST) into parameter-efficient fine-tuning strategies for long document summarization remains unexplored. Therefore, this paper introduces RST-LoRA and proposes four RST-aware variants to explicitly incorporate RST into the LoRA model. Our empirical evaluation demonstrates that incorporating the type and uncertainty of rhetorical relations can complementarily enhance the performance of LoRA in summarization tasks. Furthermore, the best-performing variant we introduced outperforms the vanilla LoRA and full-parameter fine-tuning models, as confirmed by multiple automatic and human evaluations, and even surpasses previous state-of-the-art methods.

* NAACL 2024 Main & Long Conference Paper (Oral Presentation)

Via

Access Paper or Ask Questions

SciNews: From Scholarly Complexities to Public Narratives -- A Dataset for Scientific News Report Generation

Mar 26, 2024

Dongqi Pu, Yifan Wang, Jia Loy, Vera Demberg

Abstract:Scientific news reports serve as a bridge, adeptly translating complex research articles into reports that resonate with the broader public. The automated generation of such narratives enhances the accessibility of scholarly insights. In this paper, we present a new corpus to facilitate this paradigm development. Our corpus comprises a parallel compilation of academic publications and their corresponding scientific news reports across nine disciplines. To demonstrate the utility and reliability of our dataset, we conduct an extensive analysis, highlighting the divergences in readability and brevity between scientific news narratives and academic manuscripts. We benchmark our dataset employing state-of-the-art text generation models. The evaluation process involves both automatic and human evaluation, which lays the groundwork for future explorations into the automated generation of scientific news reports. The dataset and code related to this work are available at https://dongqi.me/projects/SciNews.

* LREC-COLING 2024 Main Conference Paper

Via

Access Paper or Ask Questions

ChatGPT vs Human-authored Text: Insights into Controllable Text Summarization and Sentence Style Transfer

Jun 13, 2023

Dongqi Pu, Vera Demberg

Abstract:Large-scale language models, like ChatGPT, have garnered significant media attention and stunned the public with their remarkable capacity for generating coherent text from short natural language prompts. In this paper, we aim to conduct a systematic inspection of ChatGPT's performance in two controllable generation tasks, with respect to ChatGPT's ability to adapt its output to different target audiences (expert vs. layman) and writing styles (formal vs. informal). Additionally, we evaluate the faithfulness of the generated text, and compare the model's performance with human-authored texts. Our findings indicate that the stylistic variations produced by humans are considerably larger than those demonstrated by ChatGPT, and the generated texts diverge from human samples in several characteristics, such as the distribution of word types. Moreover, we observe that ChatGPT sometimes incorporates factual errors or hallucinations when adapting the text to suit a specific style.

* ACL-SRW 2023

Via

Access Paper or Ask Questions

Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization

May 26, 2023

Dongqi Pu, Yifan Wang, Vera Demberg

Figure 1 for Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization

Figure 2 for Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization

Figure 3 for Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization

Figure 4 for Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization

Abstract:For text summarization, the role of discourse structure is pivotal in discerning the core content of a text. Regrettably, prior studies on incorporating Rhetorical Structure Theory (RST) into transformer-based summarization models only consider the nuclearity annotation, thereby overlooking the variety of discourse relation types. This paper introduces the 'RSTformer', a novel summarization model that comprehensively incorporates both the types and uncertainty of rhetorical relations. Our RST-attention mechanism, rooted in document-level rhetorical structure, is an extension of the recently devised Longformer framework. Through rigorous evaluation, the model proposed herein exhibits significant superiority over state-of-the-art models, as evidenced by its notable performance on several automatic metrics and human evaluation.

* Accepted to ACL 2023 (Main conference)

Via

Access Paper or Ask Questions