Picture for Mathieu Reymond

Mathieu Reymond

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

Add code
Mar 03, 2026
Viaarxiv icon

CoPeP: Benchmarking Continual Pretraining for Protein Language Models

Add code
Mar 03, 2026
Viaarxiv icon

Squeezing More from the Stream : Learning Representation Online for Streaming Reinforcement Learning

Add code
Feb 10, 2026
Viaarxiv icon

Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation

Add code
Oct 05, 2025
Figure 1 for Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Figure 2 for Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Figure 3 for Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Figure 4 for Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Viaarxiv icon

A Generalist Hanabi Agent

Add code
Mar 17, 2025
Viaarxiv icon

Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement Learning

Add code
Feb 11, 2024
Viaarxiv icon

Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning

Add code
Dec 06, 2022
Viaarxiv icon

Pareto Conditioned Networks

Add code
Apr 11, 2022
Figure 1 for Pareto Conditioned Networks
Figure 2 for Pareto Conditioned Networks
Figure 3 for Pareto Conditioned Networks
Figure 4 for Pareto Conditioned Networks
Viaarxiv icon

Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning

Add code
Apr 11, 2022
Figure 1 for Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning
Figure 2 for Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning
Figure 3 for Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning
Figure 4 for Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning
Viaarxiv icon

Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning

Add code
Dec 23, 2021
Figure 1 for Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon