Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jimbo Wilson

The What-If Tool: Interactive Probing of Machine Learning Models

Jul 09, 2019

James Wexler, Mahima Pushkarna, Tolga Bolukbasi, Martin Wattenberg, Fernanda Viegas, Jimbo Wilson

Figure 1 for The What-If Tool: Interactive Probing of Machine Learning Models

Figure 2 for The What-If Tool: Interactive Probing of Machine Learning Models

Figure 3 for The What-If Tool: Interactive Probing of Machine Learning Models

Figure 4 for The What-If Tool: Interactive Probing of Machine Learning Models

Abstract:A key challenge in developing and deploying Machine Learning (ML) systems is understanding their performance across a wide range of inputs. To address this challenge, we created the What-If Tool, an open-source application that allows practitioners to probe, visualize, and analyze ML systems, with minimal coding. The What-If Tool lets practitioners test performance in hypothetical situations, analyze the importance of different data features, and visualize model behavior across multiple models and subsets of input data. It also lets practitioners measure systems according to multiple ML fairness metrics. We describe the design of the tool, and report on real-life usage at different organizations.

* 10 pages, 6 figures, 2 tables. To be presented at IEEE VAST 2019

Via

Access Paper or Ask Questions

Scalable and accurate deep learning for electronic health records

May 11, 2018

Alvin Rajkomar, Eyal Oren, Kai Chen, Andrew M. Dai, Nissan Hajaj, Peter J. Liu, Xiaobing Liu, Mimi Sun, Patrik Sundberg, Hector Yee(+24 more)

Figure 1 for Scalable and accurate deep learning for electronic health records

Abstract:Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that discards the vast majority of information in each patient's record. We propose a representation of patients' entire, raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. We validated our approach using de-identified EHR data from two U.S. academic medical centers with 216,221 adult patients hospitalized for at least 24 hours. In the sequential format we propose, this volume of EHR data unrolled into a total of 46,864,534,945 data points, including clinical notes. Deep learning models achieved high accuracy for tasks such as predicting in-hospital mortality (AUROC across sites 0.93-0.94), 30-day unplanned readmission (AUROC 0.75-0.76), prolonged length of stay (AUROC 0.85-0.86), and all of a patient's final discharge diagnoses (frequency-weighted AUROC 0.90). These models outperformed state-of-the-art traditional predictive models in all cases. We also present a case-study of a neural-network attribution system, which illustrates how clinicians can gain some transparency into the predictions. We believe that this approach can be used to create accurate and scalable predictions for a variety of clinical scenarios, complete with explanations that directly highlight evidence in the patient's chart.

* npj Digital Medicine 1:18 (2018)
* Published version from https://www.nature.com/articles/s41746-018-0029-1

Via

Access Paper or Ask Questions

No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Developing World

Nov 22, 2017

Shreya Shankar, Yoni Halpern, Eric Breck, James Atwood, Jimbo Wilson, D. Sculley

Figure 1 for No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Developing World

Figure 2 for No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Developing World

Figure 3 for No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Developing World

Figure 4 for No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Developing World

Abstract:Modern machine learning systems such as image classifiers rely heavily on large scale data sets for training. Such data sets are costly to create, thus in practice a small number of freely available, open source data sets are widely used. We suggest that examining the geo-diversity of open data sets is critical before adopting a data set for use cases in the developing world. We analyze two large, publicly available image data sets to assess geo-diversity and find that these data sets appear to exhibit an observable amerocentric and eurocentric representation bias. Further, we analyze classifiers trained on these data sets to assess the impact of these training distributions and find strong differences in the relative performance on images from different locales. These results emphasize the need to ensure geo-representation when constructing data sets for use in the developing world.

* Presented at NIPS 2017 Workshop on Machine Learning for the Developing World

Via

Access Paper or Ask Questions