Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vahid Babaei

Uncertainty separation via ensemble quantile regression

Dec 18, 2024

Navid Ansari, Hans-Peter Seidel, Vahid Babaei

Figure 1 for Uncertainty separation via ensemble quantile regression

Figure 2 for Uncertainty separation via ensemble quantile regression

Figure 3 for Uncertainty separation via ensemble quantile regression

Figure 4 for Uncertainty separation via ensemble quantile regression

Abstract:This paper introduces a novel and scalable framework for uncertainty estimation and separation with applications in data driven modeling in science and engineering tasks where reliable uncertainty quantification is critical. Leveraging an ensemble of quantile regression (E-QR) models, our approach enhances aleatoric uncertainty estimation while preserving the quality of epistemic uncertainty, surpassing competing methods, such as Deep Ensembles (DE) and Monte Carlo (MC) dropout. To address challenges in separating uncertainty types, we propose an algorithm that iteratively improves separation through progressive sampling in regions of high uncertainty. Our framework is scalable to large datasets and demonstrates superior performance on synthetic benchmarks, offering a robust tool for uncertainty quantification in data-driven applications.

Via

Access Paper or Ask Questions

Two-Stage Pretraining for Molecular Property Prediction in the Wild

Nov 05, 2024

Kevin Tirta Wijaya, Minghao Guo, Michael Sun, Hans-Peter Seidel, Wojciech Matusik, Vahid Babaei

Figure 1 for Two-Stage Pretraining for Molecular Property Prediction in the Wild

Figure 2 for Two-Stage Pretraining for Molecular Property Prediction in the Wild

Figure 3 for Two-Stage Pretraining for Molecular Property Prediction in the Wild

Figure 4 for Two-Stage Pretraining for Molecular Property Prediction in the Wild

Abstract:Accurate property prediction is crucial for accelerating the discovery of new molecules. Although deep learning models have achieved remarkable success, their performance often relies on large amounts of labeled data that are expensive and time-consuming to obtain. Thus, there is a growing need for models that can perform well with limited experimentally-validated data. In this work, we introduce MoleVers, a versatile pretrained model designed for various types of molecular property prediction in the wild, i.e., where experimentally-validated molecular property labels are scarce. MoleVers adopts a two-stage pretraining strategy. In the first stage, the model learns molecular representations from large unlabeled datasets via masked atom prediction and dynamic denoising, a novel task enabled by a new branching encoder architecture. In the second stage, MoleVers is further pretrained using auxiliary labels obtained with inexpensive computational methods, enabling supervised learning without the need for costly experimental data. This two-stage framework allows MoleVers to learn representations that generalize effectively across various downstream datasets. We evaluate MoleVers on a new benchmark comprising 22 molecular datasets with diverse types of properties, the majority of which contain 50 or fewer training labels reflecting real-world conditions. MoleVers achieves state-of-the-art results on 20 out of the 22 datasets, and ranks second among the remaining two, highlighting its ability to bridge the gap between data-hungry models and real-world conditions where practically-useful labels are scarce.

Via

Access Paper or Ask Questions

TrustMol: Trustworthy Inverse Molecular Design via Alignment with Molecular Dynamics

Feb 26, 2024

Kevin Tirta Wijaya, Navid Ansari, Hans-Peter Seidel, Vahid Babaei

Figure 1 for TrustMol: Trustworthy Inverse Molecular Design via Alignment with Molecular Dynamics

Figure 2 for TrustMol: Trustworthy Inverse Molecular Design via Alignment with Molecular Dynamics

Figure 3 for TrustMol: Trustworthy Inverse Molecular Design via Alignment with Molecular Dynamics

Figure 4 for TrustMol: Trustworthy Inverse Molecular Design via Alignment with Molecular Dynamics

Abstract:Data-driven generation of molecules with desired properties, also known as inverse molecular design (IMD), has attracted significant attention in recent years. Despite the significant progress in the accuracy and diversity of solutions, existing IMD methods lag behind in terms of trustworthiness. The root issue is that the design process of these methods is increasingly more implicit and indirect, and this process is also isolated from the native forward process (NFP), the ground-truth function that models the molecular dynamics. Following this insight, we propose TrustMol, an IMD method built to be trustworthy. For this purpose, TrustMol relies on a set of technical novelties including a new variational autoencoder network. Moreover, we propose a latent-property pairs acquisition method to effectively navigate the complexities of molecular latent optimization, a process that seems intuitive yet challenging due to the high-frequency and discontinuous nature of molecule space. TrustMol also integrates uncertainty-awareness into molecular latent optimization. These lead to improvements in both explainability and reliability of the IMD process. We validate the trustworthiness of TrustMol through a wide range of experiments.

Via

Access Paper or Ask Questions

Large-Batch, Neural Multi-Objective Bayesian Optimization

Jun 12, 2023

Navid Ansari, Hans-Peter Seidel, Vahid Babaei

Figure 1 for Large-Batch, Neural Multi-Objective Bayesian Optimization

Figure 2 for Large-Batch, Neural Multi-Objective Bayesian Optimization

Figure 3 for Large-Batch, Neural Multi-Objective Bayesian Optimization

Figure 4 for Large-Batch, Neural Multi-Objective Bayesian Optimization

Abstract:Bayesian optimization provides a powerful framework for global optimization of black-box, expensive-to-evaluate functions. However, it has a limited capacity in handling data-intensive problems, especially in multi-objective settings, due to the poor scalability of default Gaussian Process surrogates. We present a novel Bayesian optimization framework specifically tailored to address these limitations. Our method leverages a Bayesian neural networks approach for surrogate modeling. This enables efficient handling of large batches of data, modeling complex problems, and generating the uncertainty of the predictions. In addition, our method incorporates a scalable, uncertainty-aware acquisition strategy based on the well-known, easy-to-deploy NSGA-II. This fully parallelizable strategy promotes efficient exploration of uncharted regions. Our framework allows for effective optimization in data-intensive environments with a minimum number of iterations. We demonstrate the superiority of our method by comparing it with state-of-the-art multi-objective optimizations. We perform our evaluation on two real-world problems - airfoil design and color printing - showcasing the applicability and efficiency of our approach. Code is available at: https://github.com/an-on-ym-ous/lbn_mobo

Via

Access Paper or Ask Questions

Autoinverse: Uncertainty Aware Inversion of Neural Networks

Aug 29, 2022

Navid Ansari, Hans-Peter Seidel, Nima Vahidi Ferdowsi, Vahid Babaei

Figure 1 for Autoinverse: Uncertainty Aware Inversion of Neural Networks

Figure 2 for Autoinverse: Uncertainty Aware Inversion of Neural Networks

Figure 3 for Autoinverse: Uncertainty Aware Inversion of Neural Networks

Figure 4 for Autoinverse: Uncertainty Aware Inversion of Neural Networks

Abstract:Neural networks are powerful surrogates for numerous forward processes. The inversion of such surrogates is extremely valuable in science and engineering. The most important property of a successful neural inverse method is the performance of its solutions when deployed in the real world, i.e., on the native forward process (and not only the learned surrogate). We propose Autoinverse, a highly automated approach for inverting neural network surrogates. Our main insight is to seek inverse solutions in the vicinity of reliable data which have been sampled form the forward process and used for training the surrogate model. Autoinverse finds such solutions by taking into account the predictive uncertainty of the surrogate and minimizing it during the inversion. Apart from high accuracy, Autoinverse enforces the feasibility of solutions, comes with embedded regularization, and is initialization free. We verify our proposed method through addressing a set of real-world problems in control, fabrication, and design.

Via

Access Paper or Ask Questions

Mixed Integer Neural Inverse Design

Sep 27, 2021

Navid Ansari, Hans-Peter Seidel, Vahid Babaei

Figure 1 for Mixed Integer Neural Inverse Design

Figure 2 for Mixed Integer Neural Inverse Design

Figure 3 for Mixed Integer Neural Inverse Design

Figure 4 for Mixed Integer Neural Inverse Design

Abstract:In computational design and fabrication, neural networks are becoming important surrogates for bulky forward simulations. A long-standing, intertwined question is that of inverse design: how to compute a design that satisfies a desired target performance? Here, we show that the piecewise linear property, very common in everyday neural networks, allows for an inverse design formulation based on mixed-integer linear programming. Our mixed-integer inverse design uncovers globally optimal or near optimal solutions in a principled manner. Furthermore, our method significantly facilitates emerging, but challenging, combinatorial inverse design tasks, such as material selection. For problems where finding the optimal solution is not desirable or tractable, we develop an efficient yet near-optimal hybrid optimization. Eventually, our method is able to find solutions provably robust to possible fabrication perturbations among multiple designs with similar performances.

Via

Access Paper or Ask Questions