Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chin-Wei Huang

Accurate and scalable exchange-correlation with deep learning

Jun 18, 2025

Giulia Luise, Chin-Wei Huang, Thijs Vogels, Derk P. Kooi, Sebastian Ehlert, Stephanie Lanius, Klaas J. H. Giesbertz, Amir Karton, Deniz Gunceler, Megan Stanley(+14 more)

Abstract:Density Functional Theory (DFT) is the most widely used electronic structure method for predicting the properties of molecules and materials. Although DFT is, in principle, an exact reformulation of the Schr\"odinger equation, practical applications rely on approximations to the unknown exchange-correlation (XC) functional. Most existing XC functionals are constructed using a limited set of increasingly complex, hand-crafted features that improve accuracy at the expense of computational efficiency. Yet, no current approximation achieves the accuracy and generality for predictive modeling of laboratory experiments at chemical accuracy -- typically defined as errors below 1 kcal/mol. In this work, we present Skala, a modern deep learning-based XC functional that bypasses expensive hand-designed features by learning representations directly from data. Skala achieves chemical accuracy for atomization energies of small molecules while retaining the computational efficiency typical of semi-local DFT. This performance is enabled by training on an unprecedented volume of high-accuracy reference data generated using computationally intensive wavefunction-based methods. Notably, Skala systematically improves with additional training data covering diverse chemistry. By incorporating a modest amount of additional high-accuracy data tailored to chemistry beyond atomization energies, Skala achieves accuracy competitive with the best-performing hybrid functionals across general main group chemistry, at the cost of semi-local DFT. As the training dataset continues to expand, Skala is poised to further enhance the predictive power of first-principles simulations.

* Main: 13 pages plus references, 11 figures and tables. Supplementary information: 19 pages, 12 figures and tables. v2 update: fix rendering of figure 1 and part of figure 5 in Safari PDF viewer

Via

Access Paper or Ask Questions

LTCXNet: Advancing Chest X-Ray Analysis with Solutions for Long-Tailed Multi-Label Classification and Fairness Challenges

Nov 16, 2024

Chin-Wei Huang, Mu-Yi Shen, Kuan-Chang Shih, Shih-Chih Lin, Chi-Yu Chen, Po-Chih Kuo

Abstract:Chest X-rays (CXRs) often display various diseases with disparate class frequencies, leading to a long-tailed, multi-label data distribution. In response to this challenge, we explore the Pruned MIMIC-CXR-LT dataset, a curated collection derived from the MIMIC-CXR dataset, specifically designed to represent a long-tailed and multi-label data scenario. We introduce LTCXNet, a novel framework that integrates the ConvNeXt model, ML-Decoder, and strategic data augmentation, further enhanced by an ensemble approach. We demonstrate that LTCXNet improves the performance of CXR interpretation across all classes, especially enhancing detection in rarer classes like `Pneumoperitoneum' and `Pneumomediastinum' by 79\% and 48\%, respectively. Beyond performance metrics, our research extends into evaluating fairness, highlighting that some methods, while improving model accuracy, could inadvertently affect fairness across different demographic groups negatively. This work contributes to advancing the understanding and management of long-tailed, multi-label data distributions in medical imaging, paving the way for more equitable and effective diagnostic tools.

* 8 pages, 5 figures

Via

Access Paper or Ask Questions

Two for One: Diffusion Models and Force Fields for Coarse-Grained Molecular Dynamics

Feb 01, 2023

Marloes Arts, Victor Garcia Satorras, Chin-Wei Huang, Daniel Zuegner, Marco Federici, Cecilia Clementi, Frank Noé, Robert Pinsler, Rianne van den Berg

Abstract:Coarse-grained (CG) molecular dynamics enables the study of biological processes at temporal and spatial scales that would be intractable at an atomistic resolution. However, accurately learning a CG force field remains a challenge. In this work, we leverage connections between score-based generative models, force fields and molecular dynamics to learn a CG force field without requiring any force inputs during training. Specifically, we train a diffusion generative model on protein structures from molecular dynamics simulations, and we show that its score function approximates a force field that can directly be used to simulate CG molecular dynamics. While having a vastly simplified training setup compared to previous work, we demonstrate that our approach leads to improved performance across several small- to medium-sized protein simulations, reproducing the CG equilibrium distribution, and preserving dynamics of all-atom simulations such as protein folding events.

Via

Access Paper or Ask Questions

Waveform Design for Optimal PSL Under Spectral and Unimodular Constraints via Alternating Minimization

Oct 16, 2022

Chin-Wei Huang, Li-Fu Chen, Borching Su

Figure 1 for Waveform Design for Optimal PSL Under Spectral and Unimodular Constraints via Alternating Minimization

Figure 2 for Waveform Design for Optimal PSL Under Spectral and Unimodular Constraints via Alternating Minimization

Figure 3 for Waveform Design for Optimal PSL Under Spectral and Unimodular Constraints via Alternating Minimization

Figure 4 for Waveform Design for Optimal PSL Under Spectral and Unimodular Constraints via Alternating Minimization

Abstract:In an active sensing system, waveforms with good auto-correlations are preferred for accurate parameter estimation. Furthermore, spectral compatibility is required to avoid mutual interference between devices as the electromagnetic environment becomes increasingly crowded. Waveforms should also be unimodular due to hardware limits. In this paper, a new approach to generating a unimodular sequence with an approximately optimal peak side-lobe level (PSL) in auto-correlation and adjustable stopband attenuation is proposed. The proposed method is based on alternating minimization (AM) and numerical results suggest that it outperforms existing methods in terms of PSL. We also develop a theoretical lower bound for the PSL minimization problem under spectral constraints and unimodular constraints, which can be used for the evaluation of the results in various works about this waveform design problem. It is observed in the numerical results that the PSL of the proposed algorithm is close to the derived lower bound.

* 11 pages, 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions

Riemannian Diffusion Models

Aug 16, 2022

Chin-Wei Huang, Milad Aghajohari, Avishek Joey Bose, Prakash Panangaden, Aaron Courville

Figure 1 for Riemannian Diffusion Models

Figure 2 for Riemannian Diffusion Models

Figure 3 for Riemannian Diffusion Models

Figure 4 for Riemannian Diffusion Models

Abstract:Diffusion models are recent state-of-the-art methods for image generation and likelihood estimation. In this work, we generalize continuous-time diffusion models to arbitrary Riemannian manifolds and derive a variational framework for likelihood estimation. Computationally, we propose new methods for computing the Riemannian divergence which is needed in the likelihood estimation. Moreover, in generalizing the Euclidean case, we prove that maximizing this variational lower-bound is equivalent to Riemannian score matching. Empirically, we demonstrate the expressive power of Riemannian diffusion models on a wide spectrum of smooth manifolds, such as spheres, tori, hyperboloids, and orthogonal groups. Our proposed method achieves new state-of-the-art likelihoods on all benchmarks.

Via

Access Paper or Ask Questions

A Variational Perspective on Diffusion-Based Generative Models and Score Matching

Jun 05, 2021

Chin-Wei Huang, Jae Hyun Lim, Aaron Courville

Figure 1 for A Variational Perspective on Diffusion-Based Generative Models and Score Matching

Figure 2 for A Variational Perspective on Diffusion-Based Generative Models and Score Matching

Figure 3 for A Variational Perspective on Diffusion-Based Generative Models and Score Matching

Figure 4 for A Variational Perspective on Diffusion-Based Generative Models and Score Matching

Abstract:Discrete-time diffusion-based generative models and score matching methods have shown promising results in modeling high-dimensional image data. Recently, Song et al. (2021) show that diffusion processes that transform data into noise can be reversed via learning the score function, i.e. the gradient of the log-density of the perturbed data. They propose to plug the learned score function into an inverse formula to define a generative diffusion process. Despite the empirical success, a theoretical underpinning of this procedure is still lacking. In this work, we approach the (continuous-time) generative diffusion directly and derive a variational framework for likelihood estimation, which includes continuous-time normalizing flows as a special case, and can be seen as an infinitely deep variational autoencoder. Under this framework, we show that minimizing the score-matching loss is equivalent to maximizing a lower bound of the likelihood of the plug-in reverse SDE proposed by Song et al. (2021), bridging the theoretical gap.

Via

Access Paper or Ask Questions

Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

Dec 10, 2020

Chin-Wei Huang, Ricky T. Q. Chen, Christos Tsirigotis, Aaron Courville

Figure 1 for Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

Figure 2 for Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

Figure 3 for Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

Figure 4 for Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

Abstract:Flow-based models are powerful tools for designing probabilistic models with tractable density. This paper introduces Convex Potential Flows (CP-Flow), a natural and efficient parameterization of invertible models inspired by the optimal transport (OT) theory. CP-Flows are the gradient map of a strongly convex neural potential function. The convexity implies invertibility and allows us to resort to convex optimization to solve the convex conjugate for efficient inversion. To enable maximum likelihood training, we derive a new gradient estimator of the log-determinant of the Jacobian, which involves solving an inverse-Hessian vector product using the conjugate gradient method. The gradient estimator has constant-memory cost, and can be made effectively unbiased by reducing the error tolerance level of the convex optimization routine. Theoretically, we prove that CP-Flows are universal density approximators and are optimal in the OT sense. Our empirical results show that CP-Flow performs competitively on standard benchmarks of density estimation and variational inference.

Via

Access Paper or Ask Questions

RealCause: Realistic Causal Inference Benchmarking

Nov 30, 2020

Brady Neal, Chin-Wei Huang, Sunand Raghupathi

Figure 1 for RealCause: Realistic Causal Inference Benchmarking

Figure 2 for RealCause: Realistic Causal Inference Benchmarking

Figure 3 for RealCause: Realistic Causal Inference Benchmarking

Figure 4 for RealCause: Realistic Causal Inference Benchmarking

Abstract:There are many different causal effect estimators in causal inference. However, it is unclear how to choose between these estimators because there is no ground-truth for causal effects. A commonly used option is to simulate synthetic data, where the ground-truth is known. However, the best causal estimators on synthetic data are unlikely to be the best causal estimators on realistic data. An ideal benchmark for causal estimators would both (a) yield ground-truth values of the causal effects and (b) be representative of real data. Using flexible generative models, we provide a benchmark that both yields ground-truth and is realistic. Using this benchmark, we evaluate 66 different causal estimators.

* Working paper

Via

Access Paper or Ask Questions

AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation

Jun 09, 2020

Jae Hyun Lim, Aaron Courville, Christopher Pal, Chin-Wei Huang

Figure 1 for AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation

Figure 2 for AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation

Figure 3 for AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation

Figure 4 for AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation

Abstract:Entropy is ubiquitous in machine learning, but it is in general intractable to compute the entropy of the distribution of an arbitrary continuous random variable. In this paper, we propose the amortized residual denoising autoencoder (AR-DAE) to approximate the gradient of the log density function, which can be used to estimate the gradient of entropy. Amortization allows us to significantly reduce the error of the gradient approximator by approaching asymptotic optimality of a regular DAE, in which case the estimation is in theory unbiased. We conduct theoretical and experimental analyses on the approximation error of the proposed method, as well as extensive studies on heuristics to ensure its robustness. Finally, using the proposed gradient approximator to estimate the gradient of entropy, we demonstrate state-of-the-art performance on density estimation with variational autoencoders and continuous control with soft actor-critic.

* accepted in ICML 2020

Via

Access Paper or Ask Questions

Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models

Feb 17, 2020

Chin-Wei Huang, Laurent Dinh, Aaron Courville

Figure 1 for Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models

Figure 2 for Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models

Figure 3 for Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models

Figure 4 for Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models

Abstract:In this work, we propose a new family of generative flows on an augmented data space, with an aim to improve expressivity without drastically increasing the computational cost of sampling and evaluation of a lower bound on the likelihood. Theoretically, we prove the proposed flow can approximate a Hamiltonian ODE as a universal transport map. Empirically, we demonstrate state-of-the-art performance on standard benchmarks of flow-based generative modeling.

* 27 pages, 12 figures

Via

Access Paper or Ask Questions