Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giovanni Trezza

Energy-GNoME: A Living Database of Selected Materials for Energy Applications

Nov 15, 2024

Paolo De Angelis, Giovanni Trezza, Giulio Barletta, Pietro Asinari, Eliodoro Chiavazzo

Figure 1 for Energy-GNoME: A Living Database of Selected Materials for Energy Applications

Figure 2 for Energy-GNoME: A Living Database of Selected Materials for Energy Applications

Figure 3 for Energy-GNoME: A Living Database of Selected Materials for Energy Applications

Figure 4 for Energy-GNoME: A Living Database of Selected Materials for Energy Applications

Abstract:Artificial Intelligence (AI) in materials science is driving significant advancements in the discovery of advanced materials for energy applications. The recent GNoME protocol identifies over 380,000 novel stable crystals. From this, we identify over 33,000 materials with potential as energy materials forming the Energy-GNoME database. Leveraging Machine Learning (ML) and Deep Learning (DL) tools, our protocol mitigates cross-domain data bias using feature spaces to identify potential candidates for thermoelectric materials, novel battery cathodes, and novel perovskites. Classifiers with both structural and compositional features identify domains of applicability, where we expect enhanced accuracy of the regressors. Such regressors are trained to predict key materials properties like, thermoelectric figure of merit (zT), band gap (Eg), and cathode voltage ($\Delta V_c$). This method significantly narrows the pool of potential candidates, serving as an efficient guide for experimental and computational chemistry investigations and accelerating the discovery of materials suited for electricity generation, energy storage and conversion.

* 60 pages, 16 figures

Via

Access Paper or Ask Questions

Learning effective good variables from physical data

Jan 10, 2024

Giulio Barletta, Giovanni Trezza, Eliodoro Chiavazzo

Abstract:We assume that a sufficiently large database is available, where a physical property of interest and a number of associated ruling primitive variables or observables are stored. We introduce and test two machine learning approaches to discover possible groups or combinations of primitive variables: The first approach is based on regression models whereas the second on classification models. The variable group (here referred to as the new effective good variable) can be considered as successfully found, when the physical property of interest is characterized by the following effective invariant behaviour: In the first method, invariance of the group implies invariance of the property up to a given accuracy; in the other method, upon partition of the physical property values into two or more classes, invariance of the group implies invariance of the class. For the sake of illustration, the two methods are successfully applied to two popular empirical correlations describing the convective heat transfer phenomenon and to the Newton's law of universal gravitation.

* 24 pages (main), 8 pages (suppi), 12 figures (main), 3 figures (suppi)

Via

Access Paper or Ask Questions

On some elusive aspects of databases hindering AI based discovery: A case study on superconducting materials

Nov 16, 2023

Giovanni Trezza, Eliodoro Chiavazzo

Figure 1 for On some elusive aspects of databases hindering AI based discovery: A case study on superconducting materials

Figure 2 for On some elusive aspects of databases hindering AI based discovery: A case study on superconducting materials

Figure 3 for On some elusive aspects of databases hindering AI based discovery: A case study on superconducting materials

Figure 4 for On some elusive aspects of databases hindering AI based discovery: A case study on superconducting materials

Abstract:It stands to reason that the amount and the quality of big data is of key importance for setting up accurate AI-driven models. Nonetheless, we believe there are still critical roadblocks in the inherent generation of databases, that are often underestimated and poorly discussed in the literature. In our view, such issues can seriously hinder the AI-based discovery process, even when high quality, sufficiently large and highly reputable data sources are available. Here, considering superconducting and thermoelectric materials as two representative case studies, we specifically discuss three aspects, namely intrinsically biased sample selection, possible hidden variables, disparate data age. Importantly, to our knowledge, we suggest and test a first strategy capable of detecting and quantifying the presence of the intrinsic data bias.

* 20 pages, 3 figures (main), 3 figures (supp info)

Via

Access Paper or Ask Questions