Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhengze Zhou

Examining spatial heterogeneity of ridesourcing demand determinants with explainable machine learning

Sep 16, 2022

Xiaojian Zhang, Xiang Yan, Zhengze Zhou, Yiming Xu, Xilei Zhao

Figure 1 for Examining spatial heterogeneity of ridesourcing demand determinants with explainable machine learning

Figure 2 for Examining spatial heterogeneity of ridesourcing demand determinants with explainable machine learning

Figure 3 for Examining spatial heterogeneity of ridesourcing demand determinants with explainable machine learning

Figure 4 for Examining spatial heterogeneity of ridesourcing demand determinants with explainable machine learning

Abstract:The growing significance of ridesourcing services in recent years suggests a need to examine the key determinants of ridesourcing demand. However, little is known regarding the nonlinear effects and spatial heterogeneity of ridesourcing demand determinants. This study applies an explainable-machine-learning-based analytical framework to identify the key factors that shape ridesourcing demand and to explore their nonlinear associations across various spatial contexts (airport, downtown, and neighborhood). We use the ridesourcing-trip data in Chicago for empirical analysis. The results reveal that the importance of built environment varies across spatial contexts, and it collectively contributes the largest importance in predicting ridesourcing demand for airport trips. Additionally, the nonlinear effects of built environment on ridesourcing demand show strong spatial variations. Ridesourcing demand is usually most responsive to the built environment changes for downtown trips, followed by neighborhood trips and airport trips. These findings offer transportation professionals nuanced insights for managing ridesourcing services.

Via

Access Paper or Ask Questions

S-LIME: Stabilized-LIME for Model Explanation

Jun 15, 2021

Zhengze Zhou, Giles Hooker, Fei Wang

Figure 1 for S-LIME: Stabilized-LIME for Model Explanation

Figure 2 for S-LIME: Stabilized-LIME for Model Explanation

Figure 3 for S-LIME: Stabilized-LIME for Model Explanation

Figure 4 for S-LIME: Stabilized-LIME for Model Explanation

Abstract:An increasing number of machine learning models have been deployed in domains with high stakes such as finance and healthcare. Despite their superior performances, many models are black boxes in nature which are hard to explain. There are growing efforts for researchers to develop methods to interpret these black-box models. Post hoc explanations based on perturbations, such as LIME, are widely used approaches to interpret a machine learning model after it has been built. This class of methods has been shown to exhibit large instability, posing serious challenges to the effectiveness of the method itself and harming user trust. In this paper, we propose S-LIME, which utilizes a hypothesis testing framework based on central limit theorem for determining the number of perturbation points needed to guarantee stability of the resulting explanation. Experiments on both simulated and real world data sets are provided to demonstrate the effectiveness of our method.

* In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '21), August 14--18, 2021, Virtual Event, Singapore

Via

Access Paper or Ask Questions

Asymptotic Normality and Variance Estimation For Supervised Ensembles

Dec 02, 2019

Zhengze Zhou, Lucas Mentch, Giles Hooker

Figure 1 for Asymptotic Normality and Variance Estimation For Supervised Ensembles

Figure 2 for Asymptotic Normality and Variance Estimation For Supervised Ensembles

Figure 3 for Asymptotic Normality and Variance Estimation For Supervised Ensembles

Abstract:Ensemble methods based on bootstrapping have improved the predictive accuracy of base learners, but fail to provide a framework in which formal statistical inference can be conducted. Recent theoretical developments suggest taking subsamples without replacement and analyze the resulting estimator in the context of a U-statistic, thus demonstrating asymptotic normality properties. However, we observe that current methods for variance estimation exhibit severe bias when the number of base learners is not large enough, compromising the validity of the resulting confidence intervals or hypothesis tests. This paper shows that similar asymptotics can be achieved by means of V-statistics, corresponding to taking subsamples with replacement. Further, we develop a bias correction algorithm for estimating variance in the limiting distribution, which yields satisfactory results with moderate size of base learners.

Via

Access Paper or Ask Questions

Distilling Black-Box Travel Mode Choice Model for Behavioral Interpretation

Oct 30, 2019

Xilei Zhao, Zhengze Zhou, Xiang Yan, Pascal Van Hentenryck

Figure 1 for Distilling Black-Box Travel Mode Choice Model for Behavioral Interpretation

Figure 2 for Distilling Black-Box Travel Mode Choice Model for Behavioral Interpretation

Figure 3 for Distilling Black-Box Travel Mode Choice Model for Behavioral Interpretation

Figure 4 for Distilling Black-Box Travel Mode Choice Model for Behavioral Interpretation

Abstract:Machine learning has proved to be very successful for making predictions in travel behavior modeling. However, most machine-learning models have complex model structures and offer little or no explanation as to how they arrive at these predictions. Interpretations about travel behavior models are essential for decision makers to understand travelers' preferences and plan policy interventions accordingly. Therefore, this paper proposes to apply and extend the model distillation approach, a model-agnostic machine-learning interpretation method, to explain how a black-box travel mode choice model makes predictions for the entire population and subpopulations of interest. Model distillation aims at compressing knowledge from a complex model (teacher) into an understandable and interpretable model (student). In particular, the paper integrates model distillation with market segmentation to generate more insights by accounting for heterogeneity. Furthermore, the paper provides a comprehensive comparison of student models with the benchmark model (decision tree) and the teacher model (gradient boosting trees) to quantify the fidelity and accuracy of the students' interpretations.

* 17 pages, 3 figures

Via

Access Paper or Ask Questions

Unbiased Measurement of Feature Importance in Tree-Based Methods

Mar 12, 2019

Zhengze Zhou, Giles Hooker

Figure 1 for Unbiased Measurement of Feature Importance in Tree-Based Methods

Figure 2 for Unbiased Measurement of Feature Importance in Tree-Based Methods

Figure 3 for Unbiased Measurement of Feature Importance in Tree-Based Methods

Figure 4 for Unbiased Measurement of Feature Importance in Tree-Based Methods

Abstract:We propose a modification that corrects for split-improvement variable importance measures in Random Forests and other tree-based methods. These methods have been shown to be biased towards increasing the importance of features with more potential splits. We show that by appropriately incorporating split-improvement as measured on out of sample data, this bias can be corrected yielding better summaries and screening tools.

Via

Access Paper or Ask Questions

Approximation Trees: Statistical Stability in Model Distillation

Aug 22, 2018

Yichen Zhou, Zhengze Zhou, Giles Hooker

Figure 1 for Approximation Trees: Statistical Stability in Model Distillation

Figure 2 for Approximation Trees: Statistical Stability in Model Distillation

Figure 3 for Approximation Trees: Statistical Stability in Model Distillation

Figure 4 for Approximation Trees: Statistical Stability in Model Distillation

Abstract:This paper examines the stability of learned explanations for black-box predictions via model distillation with decision trees. One approach to intelligibility in machine learning is to use an understandable `student' model to mimic the output of an accurate `teacher'. Here, we consider the use of regression trees as a student model, in which nodes of the tree can be used as `explanations' for particular predictions, and the whole structure of the tree can be used as a global representation of the resulting function. However, individual trees are sensitive to the particular data sets used to train them, and an interpretation of a student model may be suspect if small changes in the training data have a large effect on it. In this context, access to outcomes from a teacher helps to stabilize the greedy splitting strategy by generating a much larger corpus of training examples than was originally available. We develop tests to ensure that enough examples are generated at each split so that the same splitting rule would be chosen with high probability were the tree to be re trained. Further, we develop a stopping rule to indicate how deep the tree should be built based on recent results on the variability of Random Forests when these are used as the teacher. We provide concrete examples of these procedures on the CAD-MDD and COMPAS data sets.

* This paper supercedes arXiv:1610.09036

Via

Access Paper or Ask Questions