Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Diego Calanzone

Mol-MoE: Training Preference-Guided Routers for Molecule Generation

Feb 08, 2025

Diego Calanzone, Pierluca D'Oro, Pierre-Luc Bacon

Figure 1 for Mol-MoE: Training Preference-Guided Routers for Molecule Generation

Figure 2 for Mol-MoE: Training Preference-Guided Routers for Molecule Generation

Figure 3 for Mol-MoE: Training Preference-Guided Routers for Molecule Generation

Figure 4 for Mol-MoE: Training Preference-Guided Routers for Molecule Generation

Abstract:Recent advances in language models have enabled framing molecule generation as sequence modeling. However, existing approaches often rely on single-objective reinforcement learning, limiting their applicability to real-world drug design, where multiple competing properties must be optimized. Traditional multi-objective reinforcement learning (MORL) methods require costly retraining for each new objective combination, making rapid exploration of trade-offs impractical. To overcome these limitations, we introduce Mol-MoE, a mixture-of-experts (MoE) architecture that enables efficient test-time steering of molecule generation without retraining. Central to our approach is a preference-based router training objective that incentivizes the router to combine experts in a way that aligns with user-specified trade-offs. This provides improved flexibility in exploring the chemical property space at test time, facilitating rapid trade-off exploration. Benchmarking against state-of-the-art methods, we show that Mol-MoE achieves superior sample quality and steerability.

* We release our code and data at: https://github.com/ddidacus/mol-moe

Via

Access Paper or Ask Questions

Towards Logically Consistent Language Models via Probabilistic Reasoning

Apr 19, 2024

Diego Calanzone, Stefano Teso, Antonio Vergari

Figure 1 for Towards Logically Consistent Language Models via Probabilistic Reasoning

Figure 2 for Towards Logically Consistent Language Models via Probabilistic Reasoning

Figure 3 for Towards Logically Consistent Language Models via Probabilistic Reasoning

Figure 4 for Towards Logically Consistent Language Models via Probabilistic Reasoning

Abstract:Large language models (LLMs) are a promising venue for natural language understanding and generation tasks. However, current LLMs are far from reliable: they are prone to generate non-factual information and, more crucially, to contradict themselves when prompted to reason about beliefs of the world. These problems are currently addressed with large scale fine-tuning or by delegating consistent reasoning to external tools. In this work, we strive for a middle ground and introduce a training objective based on principled probabilistic reasoning that teaches a LLM to be consistent with external knowledge in the form of a set of facts and rules. Fine-tuning with our loss on a limited set of facts enables our LLMs to be more logically consistent than previous baselines and allows them to extrapolate to unseen but semantically similar factual knowledge more systematically.

* Accepted at ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Via

Access Paper or Ask Questions