Picture for Dan Busbridge

Dan Busbridge

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

Add code
Sep 06, 2024
Viaarxiv icon

Poly-View Contrastive Learning

Add code
Mar 08, 2024
Viaarxiv icon

Bootstrap Your Own Variance

Add code
Dec 06, 2023
Viaarxiv icon

REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation

Add code
Sep 07, 2023
Viaarxiv icon

How to Scale Your EMA

Add code
Jul 27, 2023
Viaarxiv icon

The Role of Entropy and Reconstruction in Multi-View Self-Supervised Learning

Add code
Jul 20, 2023
Viaarxiv icon

DUET: 2D Structured and Approximately Equivariant Representations

Add code
Jun 30, 2023
Viaarxiv icon

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

Add code
Mar 11, 2023
Viaarxiv icon

Elastic Weight Consolidation Improves the Robustness of Self-Supervised Learning Methods under Transfer

Add code
Oct 28, 2022
Viaarxiv icon

Position Prediction as an Effective Pretraining Strategy

Add code
Jul 15, 2022
Figure 1 for Position Prediction as an Effective Pretraining Strategy
Figure 2 for Position Prediction as an Effective Pretraining Strategy
Figure 3 for Position Prediction as an Effective Pretraining Strategy
Figure 4 for Position Prediction as an Effective Pretraining Strategy
Viaarxiv icon