Picture for Marc G. Bellemare

Marc G. Bellemare

Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning

Add code
Oct 14, 2024
Viaarxiv icon

Controlling Large Language Model Agents with Entropic Activation Steering

Add code
Jun 01, 2024
Viaarxiv icon

A Distributional Analogue to the Successor Representation

Add code
Feb 13, 2024
Viaarxiv icon

Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

Add code
Nov 21, 2023
Viaarxiv icon

Small batch deep reinforcement learning

Add code
Oct 05, 2023
Viaarxiv icon

Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control

Add code
Sep 26, 2023
Figure 1 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 2 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 3 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 4 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Viaarxiv icon

Bootstrapped Representations in Reinforcement Learning

Add code
Jun 16, 2023
Viaarxiv icon

The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation

Add code
May 28, 2023
Viaarxiv icon

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks

Add code
Apr 25, 2023
Viaarxiv icon

An Analysis of Quantile Temporal-Difference Learning

Add code
Jan 11, 2023
Viaarxiv icon