Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nathan Urban

Multi-Objective Latent Space Optimization of Generative Molecular Design Models

Mar 01, 2022

A N M Nafiz Abeer, Nathan Urban, M Ryan Weil, Francis J. Alexander, Byung-Jun Yoon

Figure 1 for Multi-Objective Latent Space Optimization of Generative Molecular Design Models

Figure 2 for Multi-Objective Latent Space Optimization of Generative Molecular Design Models

Figure 3 for Multi-Objective Latent Space Optimization of Generative Molecular Design Models

Figure 4 for Multi-Objective Latent Space Optimization of Generative Molecular Design Models

Abstract:Molecular design based on generative models, such as variational autoencoders (VAEs), has become increasingly popular in recent years due to its efficiency for exploring high-dimensional molecular space to identify molecules with desired properties. While the efficacy of the initial model strongly depends on the training data, the sampling efficiency of the model for suggesting novel molecules with enhanced properties can be further enhanced via latent space optimization. In this paper, we propose a multi-objective latent space optimization (LSO) method that can significantly enhance the performance of generative molecular design (GMD). The proposed method adopts an iterative weighted retraining approach, where the respective weights of the molecules in the training data are determined by their Pareto efficiency. We demonstrate that our multi-objective GMD LSO method can significantly improve the performance of GMD for jointly optimizing multiple molecular properties.

* 13 pages, 6 figures

Via

Access Paper or Ask Questions

Relationship-aware Multivariate Sampling Strategy for Scientific Simulation Data

Aug 31, 2020

Subhashis Hazarika, Ayan Biswas, Phillip J. Wolfram, Earl Lawrence, Nathan Urban

Figure 1 for Relationship-aware Multivariate Sampling Strategy for Scientific Simulation Data

Figure 2 for Relationship-aware Multivariate Sampling Strategy for Scientific Simulation Data

Figure 3 for Relationship-aware Multivariate Sampling Strategy for Scientific Simulation Data

Figure 4 for Relationship-aware Multivariate Sampling Strategy for Scientific Simulation Data

Abstract:With the increasing computational power of current supercomputers, the size of data produced by scientific simulations is rapidly growing. To reduce the storage footprint and facilitate scalable post-hoc analyses of such scientific data sets, various data reduction/summarization methods have been proposed over the years. Different flavors of sampling algorithms exist to sample the high-resolution scientific data, while preserving important data properties required for subsequent analyses. However, most of these sampling algorithms are designed for univariate data and cater to post-hoc analyses of single variables. In this work, we propose a multivariate sampling strategy which preserves the original variable relationships and enables different multivariate analyses directly on the sampled data. Our proposed strategy utilizes principal component analysis to capture the variance of multivariate data and can be built on top of any existing state-of-the-art sampling algorithms for single variables. In addition, we also propose variants of different data partitioning schemes (regular and irregular) to efficiently model the local multivariate relationships. Using two real-world multivariate data sets, we demonstrate the efficacy of our proposed multivariate sampling strategy with respect to its data reduction capabilities as well as the ease of performing efficient post-hoc multivariate analyses.

* To appear as IEEE Vis 2020 Shortpaper

Via

Access Paper or Ask Questions