Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bingxuan Hou

Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method

Oct 22, 2024

Jiayi Lin, Chenyang Zhang, Haibo Tong, Dongyu Zhang, Qingqing Hong, Bingxuan Hou, Junli Wang

Figure 1 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method

Figure 2 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method

Figure 3 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method

Figure 4 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method

Abstract:Multi-Span Question Answering (MSQA) requires models to extract one or multiple answer spans from a given context to answer a question. Prior work mainly focuses on designing specific methods or applying heuristic strategies to encourage models to predict more correct predictions. However, these models are trained on gold answers and fail to consider the incorrect predictions. Through a statistical analysis, we observe that models with stronger abilities do not predict less incorrect predictions compared with other models. In this work, we propose Answering-Classifying-Correcting (ACC) framework, which employs a post-processing strategy to handle incorrect predictions. Specifically, the ACC framework first introduces a classifier to classify the predictions into three types and exclude "wrong predictions", then introduces a corrector to modify "partially correct predictions". Experiments on several MSQA datasets show that ACC framework significantly improves the Exact Match (EM) scores, and further analysis demostrates that ACC framework efficiently reduces the number of incorrect predictions, improving the quality of predictions.

* Accepted by EMNLP 2024 Findings

Via

Access Paper or Ask Questions

A Lightweight Multi Aspect Controlled Text Generation Solution For Large Language Models

Oct 18, 2024

Chenyang Zhang, Jiayi Lin, Haibo Tong, Bingxuan Hou, Dongyu Zhang, Jialin Li, Junli Wang

Figure 1 for A Lightweight Multi Aspect Controlled Text Generation Solution For Large Language Models

Figure 2 for A Lightweight Multi Aspect Controlled Text Generation Solution For Large Language Models

Figure 3 for A Lightweight Multi Aspect Controlled Text Generation Solution For Large Language Models

Figure 4 for A Lightweight Multi Aspect Controlled Text Generation Solution For Large Language Models

Abstract:Large language models (LLMs) show remarkable abilities with instruction tuning. However, they fail to achieve ideal tasks when lacking high-quality instruction tuning data on target tasks. Multi-Aspect Controllable Text Generation (MCTG) is a representative task for this dilemma, where aspect datasets are usually biased and correlated. Existing work exploits additional model structures and strategies for solutions, limiting adaptability to LLMs. To activate MCTG ability of LLMs, we propose a lightweight MCTG pipeline based on data augmentation. We analyze bias and correlations in traditional datasets, and address these concerns with augmented control attributes and sentences. Augmented datasets are feasible for instruction tuning. In our experiments, LLMs perform better in MCTG after data augmentation, with a 20% accuracy rise and less aspect correlations.

Via

Access Paper or Ask Questions