Picture for Sylvain Lamprier

Sylvain Lamprier

MLIA

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

Add code
Feb 12, 2025
Viaarxiv icon

Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning

Add code
Nov 12, 2024
Figure 1 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Figure 2 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Figure 3 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Figure 4 for Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
Viaarxiv icon

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Add code
Oct 29, 2024
Figure 1 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 2 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 3 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 4 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Viaarxiv icon

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

Add code
Oct 16, 2024
Figure 1 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 2 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 3 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 4 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Viaarxiv icon

Robust Deep Reinforcement Learning Through Adversarial Attacks and Training : A Survey

Add code
Mar 01, 2024
Viaarxiv icon

Training Table Question Answering via SQL Query Decomposition

Add code
Feb 19, 2024
Viaarxiv icon

Deinterleaving of Discrete Renewal Process Mixtures with Application to Electronic Support Measures

Add code
Feb 14, 2024
Viaarxiv icon

On the Fairness ROAD: Robust Optimization for Adversarial Debiasing

Add code
Oct 27, 2023
Viaarxiv icon

Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search

Add code
Feb 22, 2023
Viaarxiv icon

Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

Add code
Feb 06, 2023
Figure 1 for Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
Figure 2 for Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
Figure 3 for Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
Figure 4 for Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
Viaarxiv icon