Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yoolhee Kim

Machine-learned metrics for predicting the likelihood of success in materials discovery

Nov 27, 2019

Yoolhee Kim, Edward Kim, Erin Antono, Bryce Meredig, Julia Ling

Figure 1 for Machine-learned metrics for predicting the likelihood of success in materials discovery

Figure 2 for Machine-learned metrics for predicting the likelihood of success in materials discovery

Figure 3 for Machine-learned metrics for predicting the likelihood of success in materials discovery

Figure 4 for Machine-learned metrics for predicting the likelihood of success in materials discovery

Abstract:Materials discovery is often compared to the challenge of finding a needle in a haystack. While much work has focused on accurately predicting the properties of candidate materials with machine learning (ML), which amounts to evaluating whether a given candidate is a piece of straw or a needle, less attention has been paid to a critical question: Are we searching in the right haystack? We refer to the haystack as the design space for a particular materials discovery problem (i.e. the set of possible candidate materials to synthesize), and thus frame this question as one of design space selection. In this paper, we introduce two metrics, the Predicted Fraction of Improved Candidates (PFIC), and the Cumulative Maximum Likelihood of Improvement (CMLI), which we demonstrate can identify discovery-rich and discovery-poor design spaces, respectively. Using CMLI and PFIC together to identify optimal design spaces can significantly accelerate ML-driven materials discovery.

* 13 pages, 10 figures, 2 tables

Via

Access Paper or Ask Questions

Assessing the Frontier: Active Learning, Model Accuracy, and Multi-objective Materials Discovery and Optimization

Nov 06, 2019

Zachary del Rosario, Yoolhee Kim, Matthias Rupp, Erin Antono, Julia Ling

Figure 1 for Assessing the Frontier: Active Learning, Model Accuracy, and Multi-objective Materials Discovery and Optimization

Figure 2 for Assessing the Frontier: Active Learning, Model Accuracy, and Multi-objective Materials Discovery and Optimization

Figure 3 for Assessing the Frontier: Active Learning, Model Accuracy, and Multi-objective Materials Discovery and Optimization

Figure 4 for Assessing the Frontier: Active Learning, Model Accuracy, and Multi-objective Materials Discovery and Optimization

Abstract:Discovering novel materials can be greatly accelerated by iterative machine learning-informed proposal of candidates---active learning. However, standard \emph{global-scope error} metrics for model quality are not predictive of discovery performance, and can be misleading. We introduce the notion of \emph{Pareto shell-scope error} to help judge the suitability of a model for proposing material candidates. Further, through synthetic cases and a thermoelectric dataset, we probe the relation between acquisition function fidelity and active learning performance. Results suggest novel diagnostic tools, as well as new insights for acquisition function design.

* 19 pages, 24 figures

Via

Access Paper or Ask Questions