Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Johannes Schilling

SPT-NRTL: A physics-guided machine learning model to predict thermodynamically consistent activity coefficients

Sep 09, 2022

Benedikt Winter, Clemens Winter, Timm Esper, Johannes Schilling, André Bardow

Figure 1 for SPT-NRTL: A physics-guided machine learning model to predict thermodynamically consistent activity coefficients

Figure 2 for SPT-NRTL: A physics-guided machine learning model to predict thermodynamically consistent activity coefficients

Figure 3 for SPT-NRTL: A physics-guided machine learning model to predict thermodynamically consistent activity coefficients

Figure 4 for SPT-NRTL: A physics-guided machine learning model to predict thermodynamically consistent activity coefficients

Abstract:The availability of property data is one of the major bottlenecks in the development of chemical processes, often requiring time-consuming and expensive experiments or limiting the design space to a small number of known molecules. This bottleneck has been the motivation behind the continuing development of predictive property models. For the property prediction of novel molecules, group contribution methods have been groundbreaking. In recent times, machine learning has joined the more established property prediction models. However, even with recent successes, the integration of physical constraints into machine learning models remains challenging. Physical constraints are vital to many thermodynamic properties, such as the Gibbs-Dunham relation, introducing an additional layer of complexity into the prediction. Here, we introduce SPT-NRTL, a machine learning model to predict thermodynamically consistent activity coefficients and provide NRTL parameters for easy use in process simulations. The results show that SPT-NRTL achieves higher accuracy than UNIFAC in the prediction of activity coefficients across all functional groups and is able to predict many vapor-liquid-equilibria with near experimental accuracy, as illustrated for the exemplary mixtures water/ethanol and chloroform/n-hexane. To ease the application of SPT-NRTL, NRTL-parameters of 100 000 000 mixtures are calculated with SPT-NRTL and provided online.

* NRTL parameters for 100 000 000 are currently hosted here: https://polybox.ethz.ch/index.php/s/unM7rbgj2FQPFdy

Via

Access Paper or Ask Questions

A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing

Jun 15, 2022

Benedikt Winter, Clemens Winter, Johannes Schilling, André Bardow

Figure 1 for A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing

Figure 2 for A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing

Figure 3 for A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing

Figure 4 for A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing

Abstract:Knowledge of mixtures' phase equilibria is crucial in nature and technical chemistry. Phase equilibria calculations of mixtures require activity coefficients. However, experimental data on activity coefficients is often limited due to high cost of experiments. For an accurate and efficient prediction of activity coefficients, machine learning approaches have been recently developed. However, current machine learning approaches still extrapolate poorly for activity coefficients of unknown molecules. In this work, we introduce the SMILES-to-Properties-Transformer (SPT), a natural language processing network to predict binary limiting activity coefficients from SMILES codes. To overcome the limitations of available experimental data, we initially train our network on a large dataset of synthetic data sampled from COSMO-RS (10 Million data points) and then fine-tune the model on experimental data (20 870 data points). This training strategy enables SPT to accurately predict limiting activity coefficients even for unknown molecules, cutting the mean prediction error in half compared to state-of-the-art models for activity coefficient predictions such as COSMO-RS, UNIFAC, and improving on recent machine learning approaches.

* Code available at: https://github.com/Bene94/SMILES2PropertiesTransformer; Data available at: https://polybox.ethz.ch/index.php/s/kyVOt3pwHW26PP4

Via

Access Paper or Ask Questions