Picture for Adrien Bolland

Adrien Bolland

Off-Policy Maximum Entropy RL with Future State and Action Visitation Measures

Add code
Dec 09, 2024
Viaarxiv icon

Costs Estimation in Unit Commitment Problems using Simulation-Based Inference

Add code
Sep 05, 2024
Viaarxiv icon

Reinforcement Learning for Efficient Design and Control Co-optimisation of Energy Systems

Add code
Jun 28, 2024
Viaarxiv icon

Behind the Myth of Exploration in Policy Gradients

Add code
Jan 31, 2024
Viaarxiv icon

Informed POMDP: Leveraging Additional Information in Model-Based RL

Add code
Jun 24, 2023
Viaarxiv icon

Policy Gradient Algorithms Implicitly Optimize by Continuation

Add code
May 11, 2023
Viaarxiv icon

Recurrent networks, hidden states and beliefs in partially observable environments

Add code
Aug 06, 2022
Figure 1 for Recurrent networks, hidden states and beliefs in partially observable environments
Figure 2 for Recurrent networks, hidden states and beliefs in partially observable environments
Figure 3 for Recurrent networks, hidden states and beliefs in partially observable environments
Figure 4 for Recurrent networks, hidden states and beliefs in partially observable environments
Viaarxiv icon

Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks

Add code
Jun 06, 2021
Figure 1 for Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks
Figure 2 for Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks
Figure 3 for Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks
Figure 4 for Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks
Viaarxiv icon

Learning optimal environments using projected stochastic gradient ascent

Add code
Jun 02, 2020
Figure 1 for Learning optimal environments using projected stochastic gradient ascent
Figure 2 for Learning optimal environments using projected stochastic gradient ascent
Figure 3 for Learning optimal environments using projected stochastic gradient ascent
Figure 4 for Learning optimal environments using projected stochastic gradient ascent
Viaarxiv icon

A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding

Add code
Apr 13, 2020
Figure 1 for A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding
Figure 2 for A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding
Figure 3 for A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding
Figure 4 for A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding
Viaarxiv icon