Picture for João G. M. Araújo

João G. M. Araújo

On the consistency of hyper-parameter selection in value-based deep reinforcement learning

Add code
Jun 25, 2024
Figure 1 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 2 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 3 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 4 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Viaarxiv icon

Transformers need glasses! Information over-squashing in language tasks

Add code
Jun 06, 2024
Figure 1 for Transformers need glasses! Information over-squashing in language tasks
Figure 2 for Transformers need glasses! Information over-squashing in language tasks
Figure 3 for Transformers need glasses! Information over-squashing in language tasks
Figure 4 for Transformers need glasses! Information over-squashing in language tasks
Viaarxiv icon

Categorical Deep Learning: An Algebraic Theory of Architectures

Add code
Feb 23, 2024
Viaarxiv icon

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Add code
Feb 05, 2024
Figure 1 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Figure 2 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Figure 3 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Figure 4 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Viaarxiv icon

Scalable Training of Language Models using JAX pjit and TPUv4

Add code
Apr 13, 2022
Figure 1 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 2 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 3 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 4 for Scalable Training of Language Models using JAX pjit and TPUv4
Viaarxiv icon

No News is Good News: A Critique of the One Billion Word Benchmark

Add code
Oct 25, 2021
Figure 1 for No News is Good News: A Critique of the One Billion Word Benchmark
Figure 2 for No News is Good News: A Critique of the One Billion Word Benchmark
Viaarxiv icon

Mitigating harm in language models with conditional-likelihood filtration

Add code
Sep 04, 2021
Figure 1 for Mitigating harm in language models with conditional-likelihood filtration
Figure 2 for Mitigating harm in language models with conditional-likelihood filtration
Figure 3 for Mitigating harm in language models with conditional-likelihood filtration
Figure 4 for Mitigating harm in language models with conditional-likelihood filtration
Viaarxiv icon