Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Courtney J. Diamond

Large Language Models for Granularized Barrett's Esophagus Diagnosis Classification

Aug 16, 2023

Jenna Kefeli, Ali Soroush, Courtney J. Diamond, Haley M. Zylberberg, Benjamin May, Julian A. Abrams, Chunhua Weng, Nicholas Tatonetti

Figure 1 for Large Language Models for Granularized Barrett's Esophagus Diagnosis Classification

Figure 2 for Large Language Models for Granularized Barrett's Esophagus Diagnosis Classification

Figure 3 for Large Language Models for Granularized Barrett's Esophagus Diagnosis Classification

Figure 4 for Large Language Models for Granularized Barrett's Esophagus Diagnosis Classification

Abstract:Diagnostic codes for Barrett's esophagus (BE), a precursor to esophageal cancer, lack granularity and precision for many research or clinical use cases. Laborious manual chart review is required to extract key diagnostic phenotypes from BE pathology reports. We developed a generalizable transformer-based method to automate data extraction. Using pathology reports from Columbia University Irving Medical Center with gastroenterologist-annotated targets, we performed binary dysplasia classification as well as granularized multi-class BE-related diagnosis classification. We utilized two clinically pre-trained large language models, with best model performance comparable to a highly tailored rule-based system developed using the same data. Binary dysplasia extraction achieves 0.964 F1-score, while the multi-class model achieves 0.911 F1-score. Our method is generalizable and faster to implement as compared to a tailored rule-based approach.

Via

Access Paper or Ask Questions