Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

Nov 15, 2022

Grace Yang, Ming Cao, Lavender Y. Jiang, Xujin C. Liu, Alexander T. M. Cheung, Hannah Weiss, David Kurland, Kyunghyun Cho, Eric K. Oermann

Figure 1 for Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

Figure 2 for Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

Figure 3 for Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

Figure 4 for Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

Share this with someone who'll enjoy it:

Abstract:Traditional evaluation metrics for classification in natural language processing such as accuracy and area under the curve fail to differentiate between models with different predictive behaviors despite their similar performance metrics. We introduce sensitivity score, a metric that scrutinizes models' behaviors at the vocabulary level to provide insights into disparities in their decision-making logic. We assess the sensitivity score on a set of representative words in the test set using two classifiers trained for hospital readmission classification with similar performance statistics. Our experiments compare the decision-making logic of clinicians and classifiers based on rank correlations of sensitivity scores. The results indicate that the language model's sensitivity score aligns better with the professionals than the xgboost classifier on tf-idf embeddings, which suggests that xgboost uses some spurious features. Overall, this metric offers a novel perspective on assessing models' robustness by quantifying their discrepancy with professional opinions. Our code is available on GitHub (https://github.com/nyuolab/Model_Sensitivity).

* Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 13 pages

View paper on

Share this with someone who'll enjoy it:

Title:Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

Paper and Code