Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:RxWhyQA: a clinical question-answering dataset with the challenge of multi-answer questions

Jan 07, 2022

Sungrim Moon, Huan He, Hongfang Liu, Jungwei W. Fan

Figure 1 for RxWhyQA: a clinical question-answering dataset with the challenge of multi-answer questions

Figure 2 for RxWhyQA: a clinical question-answering dataset with the challenge of multi-answer questions

Figure 3 for RxWhyQA: a clinical question-answering dataset with the challenge of multi-answer questions

Figure 4 for RxWhyQA: a clinical question-answering dataset with the challenge of multi-answer questions

Share this with someone who'll enjoy it:

Abstract:Objectives Create a dataset for the development and evaluation of clinical question-answering (QA) systems that can handle multi-answer questions. Materials and Methods We leveraged the annotated relations from the 2018 National NLP Clinical Challenges (n2c2) corpus to generate a QA dataset. The 1-to-0 and 1-to-N drug-reason relations formed the unanswerable and multi-answer entries, which represent challenging scenarios lacking in the existing clinical QA datasets. Results The result RxWhyQA dataset contains 91,440 QA entries, of which half are unanswerable, and 21% (n=19,269) of the answerable ones require multiple answers. The dataset conforms to the community-vetted Stanford Question Answering Dataset (SQuAD) format. Discussion The RxWhyQA is useful for comparing different systems that need to handle the zero- and multi-answer challenges, demanding dual mitigation of both false positive and false negative answers. Conclusion We created and shared a clinical QA dataset with a focus on multi-answer questions to represent real-world scenarios.

* 2 tables, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:RxWhyQA: a clinical question-answering dataset with the challenge of multi-answer questions

Paper and Code