Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guohao Shen

Deep Distributional Learning with Non-crossing Quantile Network

Apr 11, 2025

Guohao Shen, Runpeng Dai, Guojun Wu, Shikai Luo, Chengchun Shi, Hongtu Zhu

Abstract:In this paper, we introduce a non-crossing quantile (NQ) network for conditional distribution learning. By leveraging non-negative activation functions, the NQ network ensures that the learned distributions remain monotonic, effectively addressing the issue of quantile crossing. Furthermore, the NQ network-based deep distributional learning framework is highly adaptable, applicable to a wide range of applications, from classical non-parametric quantile regression to more advanced tasks such as causal effect estimation and distributional reinforcement learning (RL). We also develop a comprehensive theoretical foundation for the deep NQ estimator and its application to distributional RL, providing an in-depth analysis that demonstrates its effectiveness across these domains. Our experimental results further highlight the robustness and versatility of the NQ network.

Via

Access Paper or Ask Questions

Conditional Stochastic Interpolation for Generative Learning

Dec 09, 2023

Ding Huang, Jian Huang, Ting Li, Guohao Shen

Figure 1 for Conditional Stochastic Interpolation for Generative Learning

Figure 2 for Conditional Stochastic Interpolation for Generative Learning

Figure 3 for Conditional Stochastic Interpolation for Generative Learning

Figure 4 for Conditional Stochastic Interpolation for Generative Learning

Abstract:We propose a conditional stochastic interpolation (CSI) approach to learning conditional distributions. CSI learns probability flow equations or stochastic differential equations that transport a reference distribution to the target conditional distribution. This is achieved by first learning the drift function and the conditional score function based on conditional stochastic interpolation, which are then used to construct a deterministic process governed by an ordinary differential equation or a diffusion process for conditional sampling. In our proposed CSI model, we incorporate an adaptive diffusion term to address the instability issues arising during the training process. We provide explicit forms of the conditional score function and the drift function in terms of conditional expectations under mild conditions, which naturally lead to an nonparametric regression approach to estimating these functions. Furthermore, we establish non-asymptotic error bounds for learning the target conditional distribution via conditional stochastic interpolation in terms of KL divergence, taking into account the neural network approximation error. We illustrate the application of CSI on image generation using a benchmark image dataset.

* 44 pages, 4 figures

Via

Access Paper or Ask Questions

Wasserstein Generative Regression

Jun 27, 2023

Shanshan Song, Tong Wang, Guohao Shen, Yuanyuan Lin, Jian Huang

Abstract:In this paper, we propose a new and unified approach for nonparametric regression and conditional distribution learning. Our approach simultaneously estimates a regression function and a conditional generator using a generative learning framework, where a conditional generator is a function that can generate samples from a conditional distribution. The main idea is to estimate a conditional generator that satisfies the constraint that it produces a good regression function estimator. We use deep neural networks to model the conditional generator. Our approach can handle problems with multivariate outcomes and covariates, and can be used to construct prediction intervals. We provide theoretical guarantees by deriving non-asymptotic error bounds and the distributional consistency of our approach under suitable assumptions. We also perform numerical experiments with simulated and real data to demonstrate the effectiveness and superiority of our approach over some existing approaches in various scenarios.

* 50 pages, including appendix. 5 figures and 6 tables in the main text. 1 figure and 7 tables in the appendix

Via

Access Paper or Ask Questions

Complexity of Feed-Forward Neural Networks from the Perspective of Functional Equivalence

May 19, 2023

Guohao Shen

Figure 1 for Complexity of Feed-Forward Neural Networks from the Perspective of Functional Equivalence

Figure 2 for Complexity of Feed-Forward Neural Networks from the Perspective of Functional Equivalence

Abstract:In this paper, we investigate the complexity of feed-forward neural networks by examining the concept of functional equivalence, which suggests that different network parameterizations can lead to the same function. We utilize the permutation invariance property to derive a novel covering number bound for the class of feedforward neural networks, which reveals that the complexity of a neural network can be reduced by exploiting this property. Furthermore, based on the symmetric structure of parameter space, we demonstrate that an appropriate strategy of random parameter initialization can increase the probability of convergence for optimization. We found that overparameterized networks tend to be easier to train in the sense that increasing the width of neural networks leads to a vanishing volume of the effective parameter space. Our findings offer new insights into overparameterization and have significant implications for understanding generalization and optimization in deep learning.

Via

Access Paper or Ask Questions

Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

May 14, 2023

Guohao Shen, Yuling Jiao, Yuanyuan Lin, Jian Huang

Figure 1 for Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

Figure 2 for Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

Figure 3 for Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

Abstract:We study the properties of differentiable neural networks activated by rectified power unit (RePU) functions. We show that the partial derivatives of RePU neural networks can be represented by RePUs mixed-activated networks and derive upper bounds for the complexity of the function class of derivatives of RePUs networks. We establish error bounds for simultaneously approximating $C^s$ smooth functions and their derivatives using RePU-activated deep neural networks. Furthermore, we derive improved approximation error bounds when data has an approximate low-dimensional support, demonstrating the ability of RePU networks to mitigate the curse of dimensionality. To illustrate the usefulness of our results, we consider a deep score matching estimator (DSME) and propose a penalized deep isotonic regression (PDIR) using RePU networks. We establish non-asymptotic excess risk bounds for DSME and PDIR under the assumption that the target functions belong to a class of $C^s$ smooth functions. We also show that PDIR has a robustness property in the sense it is consistent with vanishing penalty parameters even when the monotonicity assumption is not satisfied. Furthermore, if the data distribution is supported on an approximate low-dimensional manifold, we show that DSME and PDIR can mitigate the curse of dimensionality.

* 66 pages, 20 figures, and 6 tables. arXiv admin note: text overlap with arXiv:2207.10442

Via

Access Paper or Ask Questions

Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Oct 18, 2022

Wenlu Tang, Guohao Shen, Yuanyuan Lin, Jian Huang

Figure 1 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Figure 2 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Figure 3 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Figure 4 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Abstract:We propose a nonparametric quantile regression method using deep neural networks with a rectified linear unit penalty function to avoid quantile crossing. This penalty function is computationally feasible for enforcing non-crossing constraints in multi-dimensional nonparametric quantile regression. We establish non-asymptotic upper bounds for the excess risk of the proposed nonparametric quantile regression function estimators. Our error bounds achieve optimal minimax rate of convergence for the Holder class, and the prefactors of the error bounds depend polynomially on the dimension of the predictor, instead of exponentially. Based on the proposed non-crossing penalized deep quantile regression, we construct conformal prediction intervals that are fully adaptive to heterogeneity. The proposed prediction interval is shown to have good properties in terms of validity and accuracy under reasonable conditions. We also derive non-asymptotic upper bounds for the difference of the lengths between the proposed non-crossing conformal prediction interval and the theoretically oracle prediction interval. Numerical experiments including simulation studies and a real data example are conducted to demonstrate the effectiveness of the proposed method.

* 8 figures, 3 tables

Via

Access Paper or Ask Questions

Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Jul 21, 2022

Guohao Shen, Yuling Jiao, Yuanyuan Lin, Joel L. Horowitz, Jian Huang

Figure 1 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Figure 2 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Figure 3 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Figure 4 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Abstract:We propose a penalized nonparametric approach to estimating the quantile regression process (QRP) in a nonseparable model using rectifier quadratic unit (ReQU) activated deep neural networks and introduce a novel penalty function to enforce non-crossing of quantile regression curves. We establish the non-asymptotic excess risk bounds for the estimated QRP and derive the mean integrated squared error for the estimated QRP under mild smoothness and regularity conditions. To establish these non-asymptotic risk and estimation error bounds, we also develop a new error bound for approximating $C^s$ smooth functions with $s >0$ and their derivatives using ReQU activated neural networks. This is a new approximation result for ReQU networks and is of independent interest and may be useful in other problems. Our numerical experiments demonstrate that the proposed method is competitive with or outperforms two existing methods, including methods using reproducing kernels and random forests, for nonparametric quantile regression.

* 44 pages, 10 figures, 6 tables

Via

Access Paper or Ask Questions

Non-asymptotic Excess Risk Bounds for Classification with Deep Convolutional Neural Networks

May 01, 2021

Guohao Shen, Yuling Jiao, Yuanyuan Lin, Jian Huang

Figure 1 for Non-asymptotic Excess Risk Bounds for Classification with Deep Convolutional Neural Networks

Figure 2 for Non-asymptotic Excess Risk Bounds for Classification with Deep Convolutional Neural Networks

Abstract:In this paper, we consider the problem of binary classification with a class of general deep convolutional neural networks, which includes fully-connected neural networks and fully convolutional neural networks as special cases. We establish non-asymptotic excess risk bounds for a class of convex surrogate losses and target functions with different modulus of continuity. An important feature of our results is that we clearly define the prefactors of the risk bounds in terms of the input data dimension and other model parameters and show that they depend polynomially on the dimensionality in some important models. We also show that the classification methods with CNNs can circumvent the curse of dimensionality if the input data is supported on an approximate low-dimensional manifold. To establish these results, we derive an upper bound for the covering number for the class of general convolutional neural networks with a bias term in each convolutional layer, and derive new results on the approximation power of CNNs for any uniformly-continuous target functions. These results provide further insights into the complexity and the approximation power of general convolutional neural networks, which are of independent interest and may have other applications. Finally, we apply our general results to analyze the non-asymptotic excess risk bounds for four widely used methods with different loss functions using CNNs, including the least squares, the logistic, the exponential and the SVM hinge losses.

* Guohao Shen and Yuling Jiao contributed equally to this work. Co-corresponding authors: Yuanyuan Lin (Email: ylin@sta.cuhk.edu.hk) and Jian Huang (Email: jian-huang@uiowa.edu)

Via

Access Paper or Ask Questions