Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:$\mathtt{GeLLM^3O}$: Generalizing Large Language Models for Multi-property Molecule Optimization

Feb 19, 2025

Vishal Dey, Xiao Hu, Xia Ning

$Figure 1 for $\mathtt{GeLLM^3O}$: Generalizing Large Language Models for Multi-property Molecule Optimization$

$Figure 2 for $\mathtt{GeLLM^3O}$: Generalizing Large Language Models for Multi-property Molecule Optimization$

$Figure 3 for $\mathtt{GeLLM^3O}$: Generalizing Large Language Models for Multi-property Molecule Optimization$

$Figure 4 for $\mathtt{GeLLM^3O}$: Generalizing Large Language Models for Multi-property Molecule Optimization$

Share this with someone who'll enjoy it:

Abstract:Despite recent advancements, most computational methods for molecule optimization are constrained to single- or double-property optimization tasks and suffer from poor scalability and generalizability to novel optimization tasks. Meanwhile, Large Language Models (LLMs) demonstrate remarkable out-of-domain generalizability to novel tasks. To demonstrate LLMs' potential for molecule optimization, we introduce $\mathtt{MoMUInstruct}$, the first high-quality instruction-tuning dataset specifically focused on complex multi-property molecule optimization tasks. Leveraging $\mathtt{MoMUInstruct}$, we develop $\mathtt{GeLLM^3O}$s, a series of instruction-tuned LLMs for molecule optimization. Extensive evaluations across 5 in-domain and 5 out-of-domain tasks demonstrate that $\mathtt{GeLLM^3O}$s consistently outperform state-of-the-art baselines. $\mathtt{GeLLM^3O}$s also exhibit outstanding zero-shot generalization to unseen tasks, significantly outperforming powerful closed-source LLMs. Such strong generalizability demonstrates the tremendous potential of $\mathtt{GeLLM^3O}$s as foundational models for molecule optimization, thereby tackling novel optimization tasks without resource-intensive retraining. $\mathtt{MoMUInstruct}$, models, and code are accessible through https://github.com/ninglab/GeLLMO.

* Vishal Dey and Xiao Hu contributed equally to this paper

View paper on

Share this with someone who'll enjoy it:

Title:$\mathtt{GeLLM^3O}$: Generalizing Large Language Models for Multi-property Molecule Optimization

Paper and Code