Picture for Olivier Sigaud

Olivier Sigaud

ISIR

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Add code
Oct 29, 2024
Viaarxiv icon

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

Add code
Oct 16, 2024
Figure 1 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 2 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 3 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 4 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Viaarxiv icon

Bridging Environments and Language with Rendering Functions and Vision-Language Models

Add code
Sep 24, 2024
Figure 1 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Figure 2 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Figure 3 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Figure 4 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Viaarxiv icon

Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning

Add code
Jul 02, 2024
Figure 1 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Figure 2 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Figure 3 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Figure 4 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Viaarxiv icon

Single-Reset Divide & Conquer Imitation Learning

Add code
Feb 14, 2024
Viaarxiv icon

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

Add code
Nov 02, 2023
Viaarxiv icon

A Simple Open-Loop Baseline for Reinforcement Learning Locomotion Tasks

Add code
Oct 09, 2023
Viaarxiv icon

Utility-based Adaptive Teaching Strategies using Bayesian Theory of Mind

Add code
Sep 29, 2023
Viaarxiv icon

Enhancing Agent Communication and Learning through Action and Language

Add code
Aug 28, 2023
Viaarxiv icon

Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

Add code
Feb 06, 2023
Viaarxiv icon