Picture for Pablo Samuel Castro

Pablo Samuel Castro

CALE: Continuous Arcade Learning Environment

Add code
Oct 31, 2024
Viaarxiv icon

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL

Add code
Oct 02, 2024
Figure 1 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 2 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 3 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 4 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Viaarxiv icon

NAVIX: Scaling MiniGrid Environments with JAX

Add code
Jul 28, 2024
Viaarxiv icon

Mixture of Experts in a Mixture of RL settings

Add code
Jun 26, 2024
Viaarxiv icon

On the consistency of hyper-parameter selection in value-based deep reinforcement learning

Add code
Jun 25, 2024
Figure 1 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 2 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 3 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 4 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Viaarxiv icon

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

Add code
Mar 06, 2024
Figure 1 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 2 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 3 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 4 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Viaarxiv icon

In deep reinforcement learning, a pruned network is a good network

Add code
Feb 19, 2024
Figure 1 for In deep reinforcement learning, a pruned network is a good network
Figure 2 for In deep reinforcement learning, a pruned network is a good network
Figure 3 for In deep reinforcement learning, a pruned network is a good network
Figure 4 for In deep reinforcement learning, a pruned network is a good network
Viaarxiv icon

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Add code
Feb 13, 2024
Figure 1 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 2 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 3 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 4 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Viaarxiv icon

A density estimation perspective on learning from pairwise human preferences

Add code
Nov 30, 2023
Figure 1 for A density estimation perspective on learning from pairwise human preferences
Figure 2 for A density estimation perspective on learning from pairwise human preferences
Figure 3 for A density estimation perspective on learning from pairwise human preferences
Figure 4 for A density estimation perspective on learning from pairwise human preferences
Viaarxiv icon

Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

Add code
Nov 21, 2023
Viaarxiv icon