Picture for Sylvain Lamprier

Sylvain Lamprier

MLIA

Offline Reinforcement Learning of High-Quality Behaviors Under Robust Style Alignment

Add code
Jan 30, 2026
Viaarxiv icon

Reward-Preserving Attacks For Robust Reinforcement Learning

Add code
Jan 12, 2026
Viaarxiv icon

Black-Box Combinatorial Optimization with Order-Invariant Reinforcement Learning

Add code
Oct 02, 2025
Viaarxiv icon

HERAKLES: Hierarchical Skill Compilation for Open-ended LLM Agents

Add code
Aug 20, 2025
Figure 1 for HERAKLES: Hierarchical Skill Compilation for Open-ended LLM Agents
Figure 2 for HERAKLES: Hierarchical Skill Compilation for Open-ended LLM Agents
Figure 3 for HERAKLES: Hierarchical Skill Compilation for Open-ended LLM Agents
Figure 4 for HERAKLES: Hierarchical Skill Compilation for Open-ended LLM Agents
Viaarxiv icon

Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

Offline Learning of Controllable Diverse Behaviors

Add code
Apr 25, 2025
Viaarxiv icon

Structural Deep Encoding for Table Question Answering

Add code
Mar 03, 2025
Viaarxiv icon

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

Add code
Feb 12, 2025
Figure 1 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Figure 2 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Figure 3 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Figure 4 for MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Viaarxiv icon

Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning

Add code
Nov 12, 2024
Figure 1 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Figure 2 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Figure 3 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Figure 4 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Viaarxiv icon

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Add code
Oct 29, 2024
Figure 1 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 2 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 3 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 4 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Viaarxiv icon