Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Karger

Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes

Aug 09, 2023

Sharon Jiang, Shannon Shen, Monica Agrawal, Barbara Lam, Nicholas Kurtzman, Steven Horng, David Karger, David Sontag

Abstract:The large amount of time clinicians spend sifting through patient notes and documenting in electronic health records (EHRs) is a leading cause of clinician burnout. By proactively and dynamically retrieving relevant notes during the documentation process, we can reduce the effort required to find relevant patient history. In this work, we conceptualize the use of EHR audit logs for machine learning as a source of supervision of note relevance in a specific clinical context, at a particular point in time. Our evaluation focuses on the dynamic retrieval in the emergency department, a high acuity setting with unique patterns of information retrieval and note writing. We show that our methods can achieve an AUC of 0.963 for predicting which notes will be read in an individual note writing session. We additionally conduct a user study with several clinicians and find that our framework can help clinicians retrieve relevant information more efficiently. Demonstrating that our framework and methods can perform well in this demanding setting is a promising proof of concept that they will translate to other clinical settings and data modalities (e.g., labs, medications, imaging).

* To be published in Proceedings of Machine Learning Research Volume 219; accepted to the Machine Learning for Healthcare 2023 conference

Via

Access Paper or Ask Questions

Fast, Structured Clinical Documentation via Contextual Autocomplete

Jul 29, 2020

Divya Gopinath, Monica Agrawal, Luke Murray, Steven Horng, David Karger, David Sontag

Figure 1 for Fast, Structured Clinical Documentation via Contextual Autocomplete

Figure 2 for Fast, Structured Clinical Documentation via Contextual Autocomplete

Figure 3 for Fast, Structured Clinical Documentation via Contextual Autocomplete

Figure 4 for Fast, Structured Clinical Documentation via Contextual Autocomplete

Abstract:We present a system that uses a learned autocompletion mechanism to facilitate rapid creation of semi-structured clinical documentation. We dynamically suggest relevant clinical concepts as a doctor drafts a note by leveraging features from both unstructured and structured medical data. By constraining our architecture to shallow neural networks, we are able to make these suggestions in real time. Furthermore, as our algorithm is used to write a note, we can automatically annotate the documentation with clean labels of clinical concepts drawn from medical vocabularies, making notes more structured and readable for physicians, patients, and future algorithms. To our knowledge, this system is the only machine learning-based documentation utility for clinical notes deployed in a live hospital setting, and it reduces keystroke burden of clinical concepts by 67% in real environments.

* Published in Machine Learning for Healthcare 2020 conference

Via

Access Paper or Ask Questions

ARDA: Automatic Relational Data Augmentation for Machine Learning

Mar 21, 2020

Nadiia Chepurko, Ryan Marcus, Emanuel Zgraggen, Raul Castro Fernandez, Tim Kraska, David Karger

Figure 1 for ARDA: Automatic Relational Data Augmentation for Machine Learning

Figure 2 for ARDA: Automatic Relational Data Augmentation for Machine Learning

Figure 3 for ARDA: Automatic Relational Data Augmentation for Machine Learning

Figure 4 for ARDA: Automatic Relational Data Augmentation for Machine Learning

Abstract:Automatic machine learning (\AML) is a family of techniques to automate the process of training predictive models, aiming to both improve performance and make machine learning more accessible. While many recent works have focused on aspects of the machine learning pipeline like model selection, hyperparameter tuning, and feature selection, relatively few works have focused on automatic data augmentation. Automatic data augmentation involves finding new features relevant to the user's predictive task with minimal ``human-in-the-loop'' involvement. We present \system, an end-to-end system that takes as input a dataset and a data repository, and outputs an augmented data set such that training a predictive model on this augmented dataset results in improved performance. Our system has two distinct components: (1) a framework to search and join data with the input data, based on various attributes of the input, and (2) an efficient feature selection algorithm that prunes out noisy or irrelevant features from the resulting join. We perform an extensive empirical evaluation of different system components and benchmark our feature selection algorithm on real-world datasets.

Via

Access Paper or Ask Questions

Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Jul 15, 2017

Nicholas Harvey, Vahab Mirrokni, David Karger, Virginia Savova, Leonid Peshkin

Figure 1 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Figure 2 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Figure 3 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Figure 4 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Abstract:This paper formulates a novel problem on graphs: find the minimal subset of edges in a fully connected graph, such that the resulting graph contains all spanning trees for a set of specifed sub-graphs. This formulation is motivated by an un-supervised grammar induction problem from computational linguistics. We present a reduction to some known problems and algorithms from graph theory, provide computational complexity results, and describe an approximation algorithm.

* 11 pages 4 figures

Via

Access Paper or Ask Questions