Picture for Sylvain Lamprier

Sylvain Lamprier

MLIA

Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning

Add code
Nov 12, 2024
Viaarxiv icon

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Add code
Oct 29, 2024
Viaarxiv icon

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

Add code
Oct 16, 2024
Figure 1 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 2 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 3 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 4 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Viaarxiv icon

Robust Deep Reinforcement Learning Through Adversarial Attacks and Training : A Survey

Add code
Mar 01, 2024
Viaarxiv icon

Training Table Question Answering via SQL Query Decomposition

Add code
Feb 19, 2024
Viaarxiv icon

Deinterleaving of Discrete Renewal Process Mixtures with Application to Electronic Support Measures

Add code
Feb 14, 2024
Viaarxiv icon

On the Fairness ROAD: Robust Optimization for Adversarial Debiasing

Add code
Oct 27, 2023
Viaarxiv icon

Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search

Add code
Feb 22, 2023
Viaarxiv icon

Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

Add code
Feb 06, 2023
Viaarxiv icon

EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL

Add code
Jun 20, 2022
Figure 1 for EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Figure 2 for EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Figure 3 for EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Figure 4 for EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Viaarxiv icon