Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LLM4Mat-Bench: Benchmarking Large Language Models for Materials Property Prediction

Oct 31, 2024

Andre Niyongabo Rubungo, Kangming Li, Jason Hattrick-Simpers, Adji Bousso Dieng

Figure 1 for LLM4Mat-Bench: Benchmarking Large Language Models for Materials Property Prediction

Figure 2 for LLM4Mat-Bench: Benchmarking Large Language Models for Materials Property Prediction

Figure 3 for LLM4Mat-Bench: Benchmarking Large Language Models for Materials Property Prediction

Figure 4 for LLM4Mat-Bench: Benchmarking Large Language Models for Materials Property Prediction

Share this with someone who'll enjoy it:

Abstract:Large language models (LLMs) are increasingly being used in materials science. However, little attention has been given to benchmarking and standardized evaluation for LLM-based materials property prediction, which hinders progress. We present LLM4Mat-Bench, the largest benchmark to date for evaluating the performance of LLMs in predicting the properties of crystalline materials. LLM4Mat-Bench contains about 1.9M crystal structures in total, collected from 10 publicly available materials data sources, and 45 distinct properties. LLM4Mat-Bench features different input modalities: crystal composition, CIF, and crystal text description, with 4.7M, 615.5M, and 3.1B tokens in total for each modality, respectively. We use LLM4Mat-Bench to fine-tune models with different sizes, including LLM-Prop and MatBERT, and provide zero-shot and few-shot prompts to evaluate the property prediction capabilities of LLM-chat-like models, including Llama, Gemma, and Mistral. The results highlight the challenges of general-purpose LLMs in materials science and the need for task-specific predictive models and task-specific instruction-tuned LLMs in materials property prediction.

* Accepted at NeurIPS 2024-AI4Mat Workshop. The Benchmark and code can be found at: https://github.com/vertaix/LLM4Mat-Bench

View paper on

Share this with someone who'll enjoy it:

Title:LLM4Mat-Bench: Benchmarking Large Language Models for Materials Property Prediction

Paper and Code