Picture for Pablo Samuel Castro

Pablo Samuel Castro

Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning

Add code
Mar 08, 2025
Viaarxiv icon

Multi-Task Reinforcement Learning Enables Parameter Scaling

Add code
Mar 07, 2025
Viaarxiv icon

CALE: Continuous Arcade Learning Environment

Add code
Oct 31, 2024
Viaarxiv icon

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL

Add code
Oct 02, 2024
Figure 1 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 2 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 3 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 4 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Viaarxiv icon

NAVIX: Scaling MiniGrid Environments with JAX

Add code
Jul 28, 2024
Viaarxiv icon

Mixture of Experts in a Mixture of RL settings

Add code
Jun 26, 2024
Viaarxiv icon

On the consistency of hyper-parameter selection in value-based deep reinforcement learning

Add code
Jun 25, 2024
Figure 1 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 2 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 3 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 4 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Viaarxiv icon

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

Add code
Mar 06, 2024
Figure 1 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 2 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 3 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 4 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Viaarxiv icon

In deep reinforcement learning, a pruned network is a good network

Add code
Feb 19, 2024
Figure 1 for In deep reinforcement learning, a pruned network is a good network
Figure 2 for In deep reinforcement learning, a pruned network is a good network
Figure 3 for In deep reinforcement learning, a pruned network is a good network
Figure 4 for In deep reinforcement learning, a pruned network is a good network
Viaarxiv icon

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Add code
Feb 13, 2024
Figure 1 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 2 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 3 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 4 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Viaarxiv icon