Picture for Johan Obando-Ceron

Johan Obando-Ceron

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Add code
Apr 09, 2025
Viaarxiv icon

Adaptive Computation Pruning for the Forgetting Transformer

Add code
Apr 09, 2025
Viaarxiv icon

Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

Add code
Mar 24, 2025
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

Neuroplastic Expansion in Deep Reinforcement Learning

Add code
Oct 10, 2024
Figure 1 for Neuroplastic Expansion in Deep Reinforcement Learning
Figure 2 for Neuroplastic Expansion in Deep Reinforcement Learning
Figure 3 for Neuroplastic Expansion in Deep Reinforcement Learning
Figure 4 for Neuroplastic Expansion in Deep Reinforcement Learning
Viaarxiv icon

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL

Add code
Oct 02, 2024
Figure 1 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 2 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 3 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Figure 4 for Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Viaarxiv icon

Mixture of Experts in a Mixture of RL settings

Add code
Jun 26, 2024
Viaarxiv icon

On the consistency of hyper-parameter selection in value-based deep reinforcement learning

Add code
Jun 25, 2024
Figure 1 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 2 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 3 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 4 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Viaarxiv icon

In deep reinforcement learning, a pruned network is a good network

Add code
Feb 19, 2024
Figure 1 for In deep reinforcement learning, a pruned network is a good network
Figure 2 for In deep reinforcement learning, a pruned network is a good network
Figure 3 for In deep reinforcement learning, a pruned network is a good network
Figure 4 for In deep reinforcement learning, a pruned network is a good network
Viaarxiv icon

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Add code
Feb 13, 2024
Figure 1 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 2 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 3 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 4 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Viaarxiv icon