Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pengshan Cai

PaniniQA: Enhancing Patient Education Through Interactive Question Answering

Aug 21, 2023

Pengshan Cai, Zonghai Yao, Fei Liu, Dakuo Wang, Meghan Reilly, Huixue Zhou, Lingxi Li, Yi Cao, Alok Kapoor, Adarsha Bajracharya(+2 more)

Abstract:Patient portal allows discharged patients to access their personalized discharge instructions in electronic health records (EHRs). However, many patients have difficulty understanding or memorizing their discharge instructions. In this paper, we present PaniniQA, a patient-centric interactive question answering system designed to help patients understand their discharge instructions. PaniniQA first identifies important clinical content from patients' discharge instructions and then formulates patient-specific educational questions. In addition, PaniniQA is also equipped with answer verification functionality to provide timely feedback to correct patients' misunderstandings. Our comprehensive automatic and human evaluation results demonstrate our PaniniQA is capable of improving patients' mastery of their medical instructions through effective interactions

* Accepted to TACL 2023. Equal contribution for the first two authors. This arXiv version is a pre-MIT Press publication version

Via

Access Paper or Ask Questions

Re2G: Retrieve, Rerank, Generate

Jul 13, 2022

Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita Rajaram Naik, Pengshan Cai, Alfio Gliozzo

Figure 1 for Re2G: Retrieve, Rerank, Generate

Figure 2 for Re2G: Retrieve, Rerank, Generate

Figure 3 for Re2G: Retrieve, Rerank, Generate

Figure 4 for Re2G: Retrieve, Rerank, Generate

Abstract:As demonstrated by GPT-3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non-parametric memory allows models to grow dramatically with a sub-linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. These models incorporate neural initial retrieval from a corpus of passages. We build on this line of research, proposing Re2G, which combines both neural initial retrieval and reranking into a BART-based sequence-to-sequence generation. Our reranking approach also permits merging retrieval results from sources with incomparable scores, enabling an ensemble of BM25 and neural initial retrieval. To train our system end-to-end, we introduce a novel variation of knowledge distillation to train the initial retrieval, reranker, and generation using only ground truth on the target sequence output. We find large gains in four diverse tasks: zero-shot slot filling, question answering, fact-checking, and dialog, with relative gains of 9% to 34% over the previous state-of-the-art on the KILT leaderboard. We make our code available as open source at https://github.com/IBM/kgi-slot-filling/tree/re2g.

* Accepted at NAACL 2022

Via

Access Paper or Ask Questions

Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition

May 29, 2022

Pengshan Cai, Hui Wan, Fei Liu, Mo Yu, Hong Yu, Sachindra Joshi

Figure 1 for Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition

Figure 2 for Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition

Figure 3 for Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition

Figure 4 for Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition

Abstract:We propose novel AI-empowered chat bots for learning as conversation where a user does not read a passage but gains information and knowledge through conversation with a teacher bot. Our information-acquisition-oriented dialogue system employs a novel adaptation of reinforced self-play so that the system can be transferred to various domains without in-domain dialogue data, and can carry out conversations both informative and attentive to users. Our extensive subjective and objective evaluations on three large public data corpora demonstrate the effectiveness of our system to deliver knowledge-intensive and attentive conversations and help end users substantially gain knowledge without reading passages. Our code and datasets are publicly available for follow-up research.

* 10 pages, accepted by NAACL 2022

Via

Access Paper or Ask Questions

Generating Classical Chinese Poems from Vernacular Chinese

Aug 31, 2019

Zhichao Yang, Pengshan Cai, Yansong Feng, Fei Li, Weijiang Feng, Elena Suet-Ying Chiu, Hong Yu

Figure 1 for Generating Classical Chinese Poems from Vernacular Chinese

Figure 2 for Generating Classical Chinese Poems from Vernacular Chinese

Figure 3 for Generating Classical Chinese Poems from Vernacular Chinese

Figure 4 for Generating Classical Chinese Poems from Vernacular Chinese

Abstract:Classical Chinese poetry is a jewel in the treasure house of Chinese culture. Previous poem generation models only allow users to employ keywords to interfere the meaning of generated poems, leaving the dominion of generation to the model. In this paper, we propose a novel task of generating classical Chinese poems from vernacular, which allows users to have more control over the semantic of generated poems. We adapt the approach of unsupervised machine translation (UMT) to our task. We use segmentation-based padding and reinforcement learning to address under-translation and over-translation respectively. According to experiments, our approach significantly improve the perplexity and BLEU compared with typical UMT models. Furthermore, we explored guidelines on how to write the input vernacular to generate better poems. Human evaluation showed our approach can generate high-quality poems which are comparable to amateur poems.

* Published in EMNLP 2019

Via

Access Paper or Ask Questions

Path-Based Attention Neural Model for Fine-Grained Entity Typing

Jan 09, 2018

Denghui Zhang, Pengshan Cai, Yantao Jia, Manling Li, Yuanzhuo Wang, Xueqi Cheng

Figure 1 for Path-Based Attention Neural Model for Fine-Grained Entity Typing

Figure 2 for Path-Based Attention Neural Model for Fine-Grained Entity Typing

Abstract:Fine-grained entity typing aims to assign entity mentions in the free text with types arranged in a hierarchical structure. Traditional distant supervision based methods employ a structured data source as a weak supervision and do not need hand-labeled data, but they neglect the label noise in the automatically labeled training corpus. Although recent studies use many features to prune wrong data ahead of training, they suffer from error propagation and bring much complexity. In this paper, we propose an end-to-end typing model, called the path-based attention neural model (PAN), to learn a noise- robust performance by leveraging the hierarchical structure of types. Experiments demonstrate its effectiveness.

* AAAI 2018

Via

Access Paper or Ask Questions