Picture for Olivier Sigaud

Olivier Sigaud

ISIR

HERAKLES: Hierarchical Skill Compilation for Open-ended LLM Agents

Add code
Aug 20, 2025
Viaarxiv icon

Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

A tale of two goals: leveraging sequentiality in multi-goal scenarios

Add code
Mar 27, 2025
Viaarxiv icon

VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making

Add code
Mar 19, 2025
Viaarxiv icon

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

Add code
Feb 12, 2025
Figure 1 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Figure 2 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Figure 3 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Figure 4 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Viaarxiv icon

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Add code
Oct 29, 2024
Figure 1 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 2 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 3 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 4 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Viaarxiv icon

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

Add code
Oct 16, 2024
Figure 1 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 2 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 3 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 4 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Viaarxiv icon

Bridging Environments and Language with Rendering Functions and Vision-Language Models

Add code
Sep 24, 2024
Figure 1 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Figure 2 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Figure 3 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Figure 4 for Bridging Environments and Language with Rendering Functions and Vision-Language Models
Viaarxiv icon

Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning

Add code
Jul 02, 2024
Figure 1 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Figure 2 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Figure 3 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Figure 4 for Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Viaarxiv icon

Single-Reset Divide & Conquer Imitation Learning

Add code
Feb 14, 2024
Viaarxiv icon