Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jennifer Liang

Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

Dec 08, 2020

Angelo Ziletti, Christoph Berns, Oliver Treichel, Thomas Weber, Jennifer Liang, Stephanie Kammerath, Marion Schwaerzler, Jagatheswari Virayah, David Ruau, Xin Ma(+1 more)

Figure 1 for Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

Figure 2 for Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

Figure 3 for Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

Figure 4 for Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

Abstract:Millions of unsolicited medical inquiries are received by pharmaceutical companies every year. It has been hypothesized that these inquiries represent a treasure trove of information, potentially giving insight into matters regarding medicinal products and the associated medical treatments. However, due to the large volume and specialized nature of the inquiries, it is difficult to perform timely, recurrent, and comprehensive analyses. Here, we propose a machine learning approach based on natural language processing and unsupervised learning to automatically discover key topics in real-world medical inquiries from customers. This approach does not require ontologies nor annotations. The discovered topics are meaningful and medically relevant, as judged by medical information specialists, thus demonstrating that unsolicited medical inquiries are a source of valuable customer insights. Our work paves the way for the machine-learning-driven analysis of medical inquiries in the pharmaceutical industry, which ultimately aims at improving patient care.

Via

Access Paper or Ask Questions

emrQA: A Large Corpus for Question Answering on Electronic Medical Records

Sep 03, 2018

Anusri Pampari, Preethi Raghavan, Jennifer Liang, Jian Peng

Figure 1 for emrQA: A Large Corpus for Question Answering on Electronic Medical Records

Figure 2 for emrQA: A Large Corpus for Question Answering on Electronic Medical Records

Figure 3 for emrQA: A Large Corpus for Question Answering on Electronic Medical Records

Figure 4 for emrQA: A Large Corpus for Question Answering on Electronic Medical Records

Abstract:We propose a novel methodology to generate domain-specific large-scale question answering (QA) datasets by re-purposing existing annotations for other NLP tasks. We demonstrate an instance of this methodology in generating a large-scale QA dataset for electronic medical records by leveraging existing expert annotations on clinical notes for various NLP tasks from the community shared i2b2 datasets. The resulting corpus (emrQA) has 1 million question-logical form and 400,000+ question-answer evidence pairs. We characterize the dataset and explore its learning potential by training baseline models for question to logical form and question to answer mapping.

* Accepted at Conference on Empirical Methods in Natural Language Processing (EMNLP) 2018

Via

Access Paper or Ask Questions