Picture for Edouard Leurent

Edouard Leurent

SEQUEL, NON-A-POST

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Add code
Jul 22, 2024
Figure 1 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 2 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 3 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 4 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Diversifying AI: Towards Creative Chess with AlphaZero

Add code
Aug 29, 2023
Viaarxiv icon

Optimizing Memory Mapping Using Deep Reinforcement Learning

Add code
May 11, 2023
Viaarxiv icon

Fast active learning for pure exploration in reinforcement learning

Add code
Jul 27, 2020
Figure 1 for Fast active learning for pure exploration in reinforcement learning
Viaarxiv icon

Adaptive Reward-Free Exploration

Add code
Jun 11, 2020
Figure 1 for Adaptive Reward-Free Exploration
Figure 2 for Adaptive Reward-Free Exploration
Figure 3 for Adaptive Reward-Free Exploration
Viaarxiv icon

Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

Add code
Jun 10, 2020
Figure 1 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 2 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 3 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 4 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Viaarxiv icon

Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs

Add code
Feb 25, 2020
Figure 1 for Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs
Figure 2 for Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs
Figure 3 for Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs
Figure 4 for Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs
Viaarxiv icon

Social Attention for Autonomous Decision-Making in Dense Traffic

Add code
Nov 27, 2019
Figure 1 for Social Attention for Autonomous Decision-Making in Dense Traffic
Figure 2 for Social Attention for Autonomous Decision-Making in Dense Traffic
Figure 3 for Social Attention for Autonomous Decision-Making in Dense Traffic
Figure 4 for Social Attention for Autonomous Decision-Making in Dense Traffic
Viaarxiv icon

Practical Open-Loop Optimistic Planning

Add code
Apr 09, 2019
Figure 1 for Practical Open-Loop Optimistic Planning
Figure 2 for Practical Open-Loop Optimistic Planning
Figure 3 for Practical Open-Loop Optimistic Planning
Figure 4 for Practical Open-Loop Optimistic Planning
Viaarxiv icon