Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Jul 05, 2023

Michael Wornow, Rahul Thapa, Ethan Steinberg, Jason Fries, Nigam Shah

Figure 1 for EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Figure 2 for EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Figure 3 for EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Figure 4 for EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Share this with someone who'll enjoy it:

Abstract:While the general machine learning (ML) community has benefited from public datasets, tasks, and models, the progress of ML in healthcare has been hampered by a lack of such shared assets. The success of foundation models creates new challenges for healthcare ML by requiring access to shared pretrained models to validate performance benefits. We help address these challenges through three contributions. First, we publish a new dataset, EHRSHOT, containing de-identified structured data from the electronic health records (EHRs) of 6,712 patients from Stanford Medicine. Unlike MIMIC-III/IV and other popular EHR datasets, EHRSHOT is longitudinal and not restricted to ICU/ED patients. Second, we publish the weights of a 141M parameter clinical foundation model pretrained on the structured EHR data of 2.57M patients. We are one of the first to fully release such a model for coded EHR data; in contrast, most prior models released for clinical data (e.g. GatorTron, ClinicalBERT) only work with unstructured text and cannot process the rich, structured data within an EHR. We provide an end-to-end pipeline for the community to validate and build upon its performance. Third, we define 15 few-shot clinical prediction tasks, enabling evaluation of foundation models on benefits such as sample efficiency and task adaption. The code to reproduce our results, as well as the model and dataset (via a research data use agreement), are available at our Github repo here: https://github.com/som-shahlab/ehrshot-benchmark

View paper on

Share this with someone who'll enjoy it:

Title:EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Paper and Code