Picture for Joshua Susskind

Joshua Susskind

How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks

Add code
Jul 03, 2024
Viaarxiv icon

Vanishing Gradients in Reinforcement Finetuning of Language Models

Add code
Oct 31, 2023
Figure 1 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Figure 2 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Figure 3 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Figure 4 for Vanishing Gradients in Reinforcement Finetuning of Language Models
Viaarxiv icon

When can transformers reason with abstract symbols?

Add code
Oct 15, 2023
Viaarxiv icon

Transformers learn through gradual rank increase

Add code
Jun 12, 2023
Viaarxiv icon

Position Prediction as an Effective Pretraining Strategy

Add code
Jul 15, 2022
Figure 1 for Position Prediction as an Effective Pretraining Strategy
Figure 2 for Position Prediction as an Effective Pretraining Strategy
Figure 3 for Position Prediction as an Effective Pretraining Strategy
Figure 4 for Position Prediction as an Effective Pretraining Strategy
Viaarxiv icon

The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon

Add code
Jun 13, 2022
Figure 1 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Figure 2 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Figure 3 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Figure 4 for The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon
Viaarxiv icon

Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation

Add code
Jan 28, 2022
Viaarxiv icon

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

Add code
May 17, 2021
Figure 1 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Figure 2 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Figure 3 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Figure 4 for Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Viaarxiv icon

Collegial Ensembles

Add code
Jun 17, 2020
Figure 1 for Collegial Ensembles
Figure 2 for Collegial Ensembles
Figure 3 for Collegial Ensembles
Figure 4 for Collegial Ensembles
Viaarxiv icon