Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuanyuan Lin

Thinking with Drafting: Optical Decompression via Logical Reconstruction

Feb 12, 2026

Jingxuan Wei, Honghao He, Caijun Jia, Siyuan Li, Zheng Sun, Yuhang Xu, Yuanyuan Lin, Linzhuang Sun, Yuchen Wu, Bihui Yu(+2 more)

Abstract:Existing multimodal large language models have achieved high-fidelity visual perception and exploratory visual generation. However, a precision paradox persists in complex reasoning tasks: optical perception systems transcribe symbols without capturing logical topology, while pixel-based generative models produce visual artifacts lacking mathematical exactness. To bridge this gap, we propose that reasoning over visual inputs be reconceptualized as optical decompression-the process of reconstructing latent logical structures from compressed visual tokens. Guided by the axiom that Parsing is Reasoning, we introduce Thinking with Drafting (TwD), which utilizes a minimalist Domain-Specific Language (DSL) as a grounding intermediate representation. Unlike standard approaches that hallucinate answers directly, TwD forces the model to draft its mental model into executable code, rendering deterministic visual proofs for self-verification. To validate this, we present VisAlg, a visual algebra benchmark. Experiments demonstrate that TwD serve as a superior cognitive scaffold. Our work establishes a closed-loop system where visual generation acts not as a creative output but as a logical verifier, offering a generalizable path for visual reasoning.

Via

Access Paper or Ask Questions

Distributionally Robust Online Markov Game with Linear Function Approximation

Nov 11, 2025

Zewu Zheng, Yuanyuan Lin

Figure 1 for Distributionally Robust Online Markov Game with Linear Function Approximation

Figure 2 for Distributionally Robust Online Markov Game with Linear Function Approximation

Figure 3 for Distributionally Robust Online Markov Game with Linear Function Approximation

Figure 4 for Distributionally Robust Online Markov Game with Linear Function Approximation

Abstract:The sim-to-real gap, where agents trained in a simulator face significant performance degradation during testing, is a fundamental challenge in reinforcement learning. Extansive works adopt the framework of distributionally robust RL, to learn a policy that acts robustly under worst case environment shift. Within this framework, our objective is to devise algorithms that are sample efficient with interactive data collection and large state spaces. By assuming d-rectangularity of environment dynamic shift, we identify a fundamental hardness result for learning in online Markov game, and address it by adopting minimum value assumption. Then, a novel least square value iteration type algorithm, DR-CCE-LSI, with exploration bonus devised specifically for multiple agents, is proposed to find an \episilon-approximate robust Coarse Correlated Equilibrium(CCE). To obtain sample efficient learning, we find that: when the feature mapping function satisfies certain properties, our algorithm, DR-CCE-LSI, is able to achieve ε-approximate CCE with a regret bound of O{dHmin{H,1/min{σ_i}}\sqrt{K}}, where K is the number of interacting episodes, H is the horizon length, d is the feature dimension, and \simga_i represents the uncertainty level of player i. Our work introduces the first sample-efficient algorithm for this setting, matches the best result so far in single agent setting, and achieves minimax optimalsample complexity in terms of the feature dimension d. Meanwhile, we also conduct simulation study to validate the efficacy of our algorithm in learning a robust equilibrium.

* To be published in the Proceedings of AAAI 2026

Via

Access Paper or Ask Questions

OCDB: Revisiting Causal Discovery with a Comprehensive Benchmark and Evaluation Framework

Jun 07, 2024

Wei Zhou, Hong Huang, Guowen Zhang, Ruize Shi, Kehan Yin, Yuanyuan Lin, Bang Liu

Figure 1 for OCDB: Revisiting Causal Discovery with a Comprehensive Benchmark and Evaluation Framework

Figure 2 for OCDB: Revisiting Causal Discovery with a Comprehensive Benchmark and Evaluation Framework

Figure 3 for OCDB: Revisiting Causal Discovery with a Comprehensive Benchmark and Evaluation Framework

Figure 4 for OCDB: Revisiting Causal Discovery with a Comprehensive Benchmark and Evaluation Framework

Abstract:Large language models (LLMs) have excelled in various natural language processing tasks, but challenges in interpretability and trustworthiness persist, limiting their use in high-stakes fields. Causal discovery offers a promising approach to improve transparency and reliability. However, current evaluations are often one-sided and lack assessments focused on interpretability performance. Additionally, these evaluations rely on synthetic data and lack comprehensive assessments of real-world datasets. These lead to promising methods potentially being overlooked. To address these issues, we propose a flexible evaluation framework with metrics for evaluating differences in causal structures and causal effects, which are crucial attributes that help improve the interpretability of LLMs. We introduce the Open Causal Discovery Benchmark (OCDB), based on real data, to promote fair comparisons and drive optimization of algorithms. Additionally, our new metrics account for undirected edges, enabling fair comparisons between Directed Acyclic Graphs (DAGs) and Completed Partially Directed Acyclic Graphs (CPDAGs). Experimental results show significant shortcomings in existing algorithms' generalization capabilities on real data, highlighting the potential for performance improvement and the importance of our framework in advancing causal discovery techniques.

Via

Access Paper or Ask Questions

Adaptive debiased SGD in high-dimensional GLMs with steaming data

May 28, 2024

Ruijian Han, Lan Luo, Yuanhang Luo, Yuanyuan Lin, Jian Huang

Figure 1 for Adaptive debiased SGD in high-dimensional GLMs with steaming data

Figure 2 for Adaptive debiased SGD in high-dimensional GLMs with steaming data

Figure 3 for Adaptive debiased SGD in high-dimensional GLMs with steaming data

Figure 4 for Adaptive debiased SGD in high-dimensional GLMs with steaming data

Abstract:Online statistical inference facilitates real-time analysis of sequentially collected data, making it different from traditional methods that rely on static datasets. This paper introduces a novel approach to online inference in high-dimensional generalized linear models, where we update regression coefficient estimates and their standard errors upon each new data arrival. In contrast to existing methods that either require full dataset access or large-dimensional summary statistics storage, our method operates in a single-pass mode, significantly reducing both time and space complexity. The core of our methodological innovation lies in an adaptive stochastic gradient descent algorithm tailored for dynamic objective functions, coupled with a novel online debiasing procedure. This allows us to maintain low-dimensional summary statistics while effectively controlling optimization errors introduced by the dynamically changing loss functions. We demonstrate that our method, termed the Approximated Debiased Lasso (ADL), not only mitigates the need for the bounded individual probability condition but also significantly improves numerical performance. Numerical experiments demonstrate that the proposed ADL method consistently exhibits robust performance across various covariance matrix structures.

* 37 pages, 4 figures

Via

Access Paper or Ask Questions

Efficient semi-supervised inference for logistic regression under case-control studies

Feb 23, 2024

Zhuojun Quan, Yuanyuan Lin, Kani Chen, Wen Yu

Figure 1 for Efficient semi-supervised inference for logistic regression under case-control studies

Figure 2 for Efficient semi-supervised inference for logistic regression under case-control studies

Figure 3 for Efficient semi-supervised inference for logistic regression under case-control studies

Figure 4 for Efficient semi-supervised inference for logistic regression under case-control studies

Abstract:Semi-supervised learning has received increasingly attention in statistics and machine learning. In semi-supervised learning settings, a labeled data set with both outcomes and covariates and an unlabeled data set with covariates only are collected. We consider an inference problem in semi-supervised settings where the outcome in the labeled data is binary and the labeled data is collected by case-control sampling. Case-control sampling is an effective sampling scheme for alleviating imbalance structure in binary data. Under the logistic model assumption, case-control data can still provide consistent estimator for the slope parameter of the regression model. However, the intercept parameter is not identifiable. Consequently, the marginal case proportion cannot be estimated from case-control data. We find out that with the availability of the unlabeled data, the intercept parameter can be identified in semi-supervised learning setting. We construct the likelihood function of the observed labeled and unlabeled data and obtain the maximum likelihood estimator via an iterative algorithm. The proposed estimator is shown to be consistent, asymptotically normal, and semiparametrically efficient. Extensive simulation studies are conducted to show the finite sample performance of the proposed method. The results imply that the unlabeled data not only helps to identify the intercept but also improves the estimation efficiency of the slope parameter. Meanwhile, the marginal case proportion can be estimated accurately by the proposed method.

Via

Access Paper or Ask Questions

Wasserstein Generative Regression

Jun 27, 2023

Shanshan Song, Tong Wang, Guohao Shen, Yuanyuan Lin, Jian Huang

Abstract:In this paper, we propose a new and unified approach for nonparametric regression and conditional distribution learning. Our approach simultaneously estimates a regression function and a conditional generator using a generative learning framework, where a conditional generator is a function that can generate samples from a conditional distribution. The main idea is to estimate a conditional generator that satisfies the constraint that it produces a good regression function estimator. We use deep neural networks to model the conditional generator. Our approach can handle problems with multivariate outcomes and covariates, and can be used to construct prediction intervals. We provide theoretical guarantees by deriving non-asymptotic error bounds and the distributional consistency of our approach under suitable assumptions. We also perform numerical experiments with simulated and real data to demonstrate the effectiveness and superiority of our approach over some existing approaches in various scenarios.

* 50 pages, including appendix. 5 figures and 6 tables in the main text. 1 figure and 7 tables in the appendix

Via

Access Paper or Ask Questions

Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

May 14, 2023

Guohao Shen, Yuling Jiao, Yuanyuan Lin, Jian Huang

Figure 1 for Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

Figure 2 for Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

Figure 3 for Differentiable Neural Networks with RePU Activation: with Applications to Score Estimation and Isotonic Regression

Abstract:We study the properties of differentiable neural networks activated by rectified power unit (RePU) functions. We show that the partial derivatives of RePU neural networks can be represented by RePUs mixed-activated networks and derive upper bounds for the complexity of the function class of derivatives of RePUs networks. We establish error bounds for simultaneously approximating $C^s$ smooth functions and their derivatives using RePU-activated deep neural networks. Furthermore, we derive improved approximation error bounds when data has an approximate low-dimensional support, demonstrating the ability of RePU networks to mitigate the curse of dimensionality. To illustrate the usefulness of our results, we consider a deep score matching estimator (DSME) and propose a penalized deep isotonic regression (PDIR) using RePU networks. We establish non-asymptotic excess risk bounds for DSME and PDIR under the assumption that the target functions belong to a class of $C^s$ smooth functions. We also show that PDIR has a robustness property in the sense it is consistent with vanishing penalty parameters even when the monotonicity assumption is not satisfied. Furthermore, if the data distribution is supported on an approximate low-dimensional manifold, we show that DSME and PDIR can mitigate the curse of dimensionality.

* 66 pages, 20 figures, and 6 tables. arXiv admin note: text overlap with arXiv:2207.10442

Via

Access Paper or Ask Questions

Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Oct 18, 2022

Wenlu Tang, Guohao Shen, Yuanyuan Lin, Jian Huang

Figure 1 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Figure 2 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Figure 3 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Figure 4 for Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction

Abstract:We propose a nonparametric quantile regression method using deep neural networks with a rectified linear unit penalty function to avoid quantile crossing. This penalty function is computationally feasible for enforcing non-crossing constraints in multi-dimensional nonparametric quantile regression. We establish non-asymptotic upper bounds for the excess risk of the proposed nonparametric quantile regression function estimators. Our error bounds achieve optimal minimax rate of convergence for the Holder class, and the prefactors of the error bounds depend polynomially on the dimension of the predictor, instead of exponentially. Based on the proposed non-crossing penalized deep quantile regression, we construct conformal prediction intervals that are fully adaptive to heterogeneity. The proposed prediction interval is shown to have good properties in terms of validity and accuracy under reasonable conditions. We also derive non-asymptotic upper bounds for the difference of the lengths between the proposed non-crossing conformal prediction interval and the theoretically oracle prediction interval. Numerical experiments including simulation studies and a real data example are conducted to demonstrate the effectiveness of the proposed method.

* 8 figures, 3 tables

Via

Access Paper or Ask Questions

Deep Sufficient Representation Learning via Mutual Information

Jul 21, 2022

Siming Zheng, Yuanyuan Lin, Jian Huang

Figure 1 for Deep Sufficient Representation Learning via Mutual Information

Figure 2 for Deep Sufficient Representation Learning via Mutual Information

Figure 3 for Deep Sufficient Representation Learning via Mutual Information

Figure 4 for Deep Sufficient Representation Learning via Mutual Information

Abstract:We propose a mutual information-based sufficient representation learning (MSRL) approach, which uses the variational formulation of the mutual information and leverages the approximation power of deep neural networks. MSRL learns a sufficient representation with the maximum mutual information with the response and a user-selected distribution. It can easily handle multi-dimensional continuous or categorical response variables. MSRL is shown to be consistent in the sense that the conditional probability density function of the response variable given the learned representation converges to the conditional probability density function of the response variable given the predictor. Non-asymptotic error bounds for MSRL are also established under suitable conditions. To establish the error bounds, we derive a generalized Dudley's inequality for an order-two U-process indexed by deep neural networks, which may be of independent interest. We discuss how to determine the intrinsic dimension of the underlying data distribution. Moreover, we evaluate the performance of MSRL via extensive numerical experiments and real data analysis and demonstrate that MSRL outperforms some existing nonlinear sufficient dimension reduction methods.

* 43 pages, 6 figures and 5 tables

Via

Access Paper or Ask Questions

Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Jul 21, 2022

Guohao Shen, Yuling Jiao, Yuanyuan Lin, Joel L. Horowitz, Jian Huang

Figure 1 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Figure 2 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Figure 3 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Figure 4 for Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Abstract:We propose a penalized nonparametric approach to estimating the quantile regression process (QRP) in a nonseparable model using rectifier quadratic unit (ReQU) activated deep neural networks and introduce a novel penalty function to enforce non-crossing of quantile regression curves. We establish the non-asymptotic excess risk bounds for the estimated QRP and derive the mean integrated squared error for the estimated QRP under mild smoothness and regularity conditions. To establish these non-asymptotic risk and estimation error bounds, we also develop a new error bound for approximating $C^s$ smooth functions with $s >0$ and their derivatives using ReQU activated neural networks. This is a new approximation result for ReQU networks and is of independent interest and may be useful in other problems. Our numerical experiments demonstrate that the proposed method is competitive with or outperforms two existing methods, including methods using reproducing kernels and random forests, for nonparametric quantile regression.

* 44 pages, 10 figures, 6 tables

Via

Access Paper or Ask Questions