Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhenke Wu

Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing

Jan 14, 2025

Jitao Wang, Chengchun Shi, John D. Piette, Joshua R. Loftus, Donglin Zeng, Zhenke Wu

Abstract:When applied in healthcare, reinforcement learning (RL) seeks to dynamically match the right interventions to subjects to maximize population benefit. However, the learned policy may disproportionately allocate efficacious actions to one subpopulation, creating or exacerbating disparities in other socioeconomically-disadvantaged subgroups. These biases tend to occur in multi-stage decision making and can be self-perpetuating, which if unaccounted for could cause serious unintended consequences that limit access to care or treatment benefit. Counterfactual fairness (CF) offers a promising statistical tool grounded in causal inference to formulate and study fairness. In this paper, we propose a general framework for fair sequential decision making. We theoretically characterize the optimal CF policy and prove its stationarity, which greatly simplifies the search for optimal CF policies by leveraging existing RL algorithms. The theory also motivates a sequential data preprocessing algorithm to achieve CF decision making under an additive noise assumption. We prove and then validate our policy learning approach in controlling unfairness and attaining optimal value through simulations. Analysis of a digital health dataset designed to reduce opioid misuse shows that our proposal greatly enhances fair access to counseling.

Via

Access Paper or Ask Questions

Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking

Nov 06, 2024

Xingran Chen, Zhenke Wu, Xu Shi, Hyunghoon Cho, Bhramar Mukherjee

Figure 1 for Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking

Figure 2 for Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking

Figure 3 for Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking

Figure 4 for Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking

Abstract:We conduct a scoping review of existing approaches for synthetic EHR data generation, and benchmark major methods with proposed open-source software to offer recommendations for practitioners. We search three academic databases for our scoping review. Methods are benchmarked on open-source EHR datasets, MIMIC-III/IV. Seven existing methods covering major categories and two baseline methods are implemented and compared. Evaluation metrics concern data fidelity, downstream utility, privacy protection, and computational cost. 42 studies are identified and classified into five categories. Seven open-source methods covering all categories are selected, trained on MIMIC-III, and evaluated on MIMIC-III or MIMIC-IV for transportability considerations. Among them, GAN-based methods demonstrate competitive performance in fidelity and utility on MIMIC-III; rule-based methods excel in privacy protection. Similar findings are observed on MIMIC-IV, except that GAN-based methods further outperform the baseline methods in preserving fidelity. A Python package, ``SynthEHRella'', is provided to integrate various choices of approaches and evaluation metrics, enabling more streamlined exploration and evaluation of multiple methods. We found that method choice is governed by the relative importance of the evaluation metrics in downstream use cases. We provide a decision tree to guide the choice among the benchmarked methods. Based on the decision tree, GAN-based methods excel when distributional shifts exist between the training and testing populations. Otherwise, CorGAN and MedGAN are most suitable for association modeling and predictive modeling, respectively. Future research should prioritize enhancing fidelity of the synthetic data while controlling privacy exposure, and comprehensive benchmarking of longitudinal or conditional generation methods.

Via

Access Paper or Ask Questions

A Reinforcement Learning Framework for Dynamic Mediation Analysis

Jan 31, 2023

Lin Ge, Jitao Wang, Chengchun Shi, Zhenke Wu, Rui Song

Abstract:Mediation analysis learns the causal effect transmitted via mediator variables between treatments and outcomes and receives increasing attention in various scientific domains to elucidate causal relations. Most existing works focus on point-exposure studies where each subject only receives one treatment at a single time point. However, there are a number of applications (e.g., mobile health) where the treatments are sequentially assigned over time and the dynamic mediation effects are of primary interest. Proposing a reinforcement learning (RL) framework, we are the first to evaluate dynamic mediation effects in settings with infinite horizons. We decompose the average treatment effect into an immediate direct effect, an immediate mediation effect, a delayed direct effect, and a delayed mediation effect. Upon the identification of each effect component, we further develop robust and semi-parametrically efficient estimators under the RL framework to infer these causal effects. The superior performance of the proposed method is demonstrated through extensive numerical studies, theoretical results, and an analysis of a mobile health dataset.

Via

Access Paper or Ask Questions

Doubly Inhomogeneous Reinforcement Learning

Nov 12, 2022

Liyuan Hu, Mengbing Li, Chengchun Shi, Zhenke Wu, Piotr Fryzlewicz

Abstract:This paper studies reinforcement learning (RL) in doubly inhomogeneous environments under temporal non-stationarity and subject heterogeneity. In a number of applications, it is commonplace to encounter datasets generated by system dynamics that may change over time and population, challenging high-quality sequential decision making. Nonetheless, most existing RL solutions require either temporal stationarity or subject homogeneity, which would result in sub-optimal policies if both assumptions were violated. To address both challenges simultaneously, we propose an original algorithm to determine the ``best data chunks" that display similar dynamics over time and across individuals for policy learning, which alternates between most recent change point detection and cluster identification. Our method is general, and works with a wide range of clustering and change point detection algorithms. It is multiply robust in the sense that it takes multiple initial estimators as input and only requires one of them to be consistent. Moreover, by borrowing information over time and population, it allows us to detect weaker signals and has better convergence properties when compared to applying the clustering algorithm per time or the change point detection algorithm per subject. Empirically, we demonstrate the usefulness of our method through extensive simulations and a real data application.

Via

Access Paper or Ask Questions

Dynamic Survival Transformers for Causal Inference with Electronic Health Records

Oct 25, 2022

Prayag Chatha, Yixin Wang, Zhenke Wu, Jeffrey Regier

Figure 1 for Dynamic Survival Transformers for Causal Inference with Electronic Health Records

Figure 2 for Dynamic Survival Transformers for Causal Inference with Electronic Health Records

Figure 3 for Dynamic Survival Transformers for Causal Inference with Electronic Health Records

Figure 4 for Dynamic Survival Transformers for Causal Inference with Electronic Health Records

Abstract:In medicine, researchers often seek to infer the effects of a given treatment on patients' outcomes. However, the standard methods for causal survival analysis make simplistic assumptions about the data-generating process and cannot capture complex interactions among patient covariates. We introduce the Dynamic Survival Transformer (DynST), a deep survival model that trains on electronic health records (EHRs). Unlike previous transformers used in survival analysis, DynST can make use of time-varying information to predict evolving survival probabilities. We derive a semi-synthetic EHR dataset from MIMIC-III to show that DynST can accurately estimate the causal effect of a treatment intervention on restricted mean survival time (RMST). We demonstrate that DynST achieves better predictive and causal estimation than two alternative models.

* Accepted to the NeurIPS 2022 Workshop on Learning from Time Series for Health

Via

Access Paper or Ask Questions

Reinforcement Learning in Possibly Nonstationary Environments

Mar 03, 2022

Mengbing Li, Chengchun Shi, Zhenke Wu, Piotr Fryzlewicz

Figure 1 for Reinforcement Learning in Possibly Nonstationary Environments

Figure 2 for Reinforcement Learning in Possibly Nonstationary Environments

Figure 3 for Reinforcement Learning in Possibly Nonstationary Environments

Figure 4 for Reinforcement Learning in Possibly Nonstationary Environments

Abstract:We consider reinforcement learning (RL) methods in offline nonstationary environments. Many existing RL algorithms in the literature rely on the stationarity assumption that requires the system transition and the reward function to be constant over time. However, the stationarity assumption is restrictive in practice and is likely to be violated in a number of applications, including traffic signal control, robotics and mobile health. In this paper, we develop a consistent procedure to test the nonstationarity of the optimal policy based on pre-collected historical data, without additional online data collection. Based on the proposed test, we further develop a sequential change point detection method that can be naturally coupled with existing state-of-the-art RL methods for policy optimisation in nonstationary environments. The usefulness of our method is illustrated by theoretical results, simulation studies, and a real data example from the 2018 Intern Health Study. A Python implementation of the proposed procedure is available at https://github.com/limengbinggz/CUSUM-RL

Via

Access Paper or Ask Questions

Kernel Deformed Exponential Families for Sparse Continuous Attention

Nov 12, 2021

Alexander Moreno, Supriya Nagesh, Zhenke Wu, Walter Dempsey, James M. Rehg

Figure 1 for Kernel Deformed Exponential Families for Sparse Continuous Attention

Figure 2 for Kernel Deformed Exponential Families for Sparse Continuous Attention

Figure 3 for Kernel Deformed Exponential Families for Sparse Continuous Attention

Figure 4 for Kernel Deformed Exponential Families for Sparse Continuous Attention

Abstract:Attention mechanisms take an expectation of a data representation with respect to probability weights. This creates summary statistics that focus on important features. Recently, (Martins et al. 2020, 2021) proposed continuous attention mechanisms, focusing on unimodal attention densities from the exponential and deformed exponential families: the latter has sparse support. (Farinhas et al. 2021) extended this to use Gaussian mixture attention densities, which are a flexible class with dense support. In this paper, we extend this to two general flexible classes: kernel exponential families and our new sparse counterpart kernel deformed exponential families. Theoretically, we show new existence results for both kernel exponential and deformed exponential families, and that the deformed case has similar approximation capabilities to kernel exponential families. Experiments show that kernel deformed exponential families can attend to multiple compact regions of the data domain.

Via

Access Paper or Ask Questions

A Functional EM Algorithm for Panel Count Data with Missing Counts

Mar 28, 2020

Alexander Moreno, Zhenke Wu, Jamie Yap, David Wetter, Cho Lam, Inbal Nahum-Shani, Walter Dempsey, James M. Rehg

Figure 1 for A Functional EM Algorithm for Panel Count Data with Missing Counts

Figure 2 for A Functional EM Algorithm for Panel Count Data with Missing Counts

Figure 3 for A Functional EM Algorithm for Panel Count Data with Missing Counts

Figure 4 for A Functional EM Algorithm for Panel Count Data with Missing Counts

Abstract:Panel count data is recurrent events data where counts of events are observed at discrete time points. Panel counts naturally describe self-reported behavioral data, and the occurrence of missing or unreliable reports is common. Unfortunately, no prior work has tackled the problem of missingness in this setting. We address this gap in the literature by developing a novel functional EM algorithm that can be used as a wrapper around several popular panel count mean function inference methods when some counts are missing. We provide a novel theoretical analysis of our method showing strong consistency. Extending the methods in (Balakrishnan et al., 2017, Wu et al. 2016), we show that the functional EM algorithm recovers the true mean function of the counting process. We accomplish this by developing alternative regularity conditions for our objective function in order to show convergence of the population EM algorithm. We prove strong consistency of the M-step, thus giving strong consistency guarantees for the finite sample EM algorithm. We present experimental results for synthetic data, synthetic missingness on real data, and a smoking cessation study, where we find that participants may underestimate cigarettes smoked by approximately 18.6% over a 12 day period.

* 18 pages

Via

Access Paper or Ask Questions

Weakly-supervised Multi-output Regression via Correlated Gaussian Processes

Feb 19, 2020

Seokhyun Chung, Raed Al Kontar, Zhenke Wu

Figure 1 for Weakly-supervised Multi-output Regression via Correlated Gaussian Processes

Figure 2 for Weakly-supervised Multi-output Regression via Correlated Gaussian Processes

Figure 3 for Weakly-supervised Multi-output Regression via Correlated Gaussian Processes

Figure 4 for Weakly-supervised Multi-output Regression via Correlated Gaussian Processes

Abstract:Multi-output regression seeks to infer multiple latent functions using data from multiple groups/sources while accounting for potential between-group similarities. In this paper, we consider multi-output regression under a weakly-supervised setting where a subset of data points from multiple groups are unlabeled. We use dependent Gaussian processes for multiple outputs constructed by convolutions with shared latent processes. We introduce hyperpriors for the multinomial probabilities of the unobserved labels and optimize the hyperparameters which we show improves estimation. We derive two variational bounds: (i) a modified variational bound for fast and stable convergence in model inference, (ii) a scalable variational bound that is amenable to stochastic optimization. We use experiments on synthetic and real-world data to show that the proposed model outperforms state-of-the-art models with more accurate estimation of multiple latent functions and unobserved labels.

Via

Access Paper or Ask Questions