Picture for Gregor Bachmann

Gregor Bachmann

Interpolated-MLPs: Controllable Inductive Bias

Add code
Oct 12, 2024
Viaarxiv icon

The pitfalls of next-token prediction

Add code
Mar 11, 2024
Viaarxiv icon

A Language Model's Guide Through Latent Space

Add code
Feb 22, 2024
Viaarxiv icon

How Good is a Single Basin?

Add code
Feb 05, 2024
Viaarxiv icon

Disentangling Linear Mode-Connectivity

Add code
Dec 15, 2023
Viaarxiv icon

Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies

Add code
Nov 06, 2023
Figure 1 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Figure 2 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Figure 3 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Figure 4 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Viaarxiv icon

Scaling MLPs: A Tale of Inductive Bias

Add code
Jun 23, 2023
Viaarxiv icon

Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes

Add code
Jun 04, 2023
Viaarxiv icon

CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes

Add code
Apr 12, 2023
Viaarxiv icon

Random Teachers are Good Teachers

Add code
Feb 23, 2023
Viaarxiv icon