Picture for Kyle Matoba

Kyle Matoba

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Add code
Nov 27, 2023
Figure 1 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Figure 2 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Figure 3 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Figure 4 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Viaarxiv icon

Accurate Extrinsic Prediction of Physical Systems Using Transformers

Add code
Oct 20, 2022
Figure 1 for Accurate Extrinsic Prediction of Physical Systems Using Transformers
Figure 2 for Accurate Extrinsic Prediction of Physical Systems Using Transformers
Figure 3 for Accurate Extrinsic Prediction of Physical Systems Using Transformers
Figure 4 for Accurate Extrinsic Prediction of Physical Systems Using Transformers
Viaarxiv icon

Flatten the Curve: Efficiently Training Low-Curvature Neural Networks

Add code
Jun 14, 2022
Figure 1 for Flatten the Curve: Efficiently Training Low-Curvature Neural Networks
Figure 2 for Flatten the Curve: Efficiently Training Low-Curvature Neural Networks
Figure 3 for Flatten the Curve: Efficiently Training Low-Curvature Neural Networks
Figure 4 for Flatten the Curve: Efficiently Training Low-Curvature Neural Networks
Viaarxiv icon

The Theoretical Expressiveness of Maxpooling

Add code
Mar 02, 2022
Figure 1 for The Theoretical Expressiveness of Maxpooling
Figure 2 for The Theoretical Expressiveness of Maxpooling
Figure 3 for The Theoretical Expressiveness of Maxpooling
Figure 4 for The Theoretical Expressiveness of Maxpooling
Viaarxiv icon

Challenges for Using Impact Regularizers to Avoid Negative Side Effects

Add code
Feb 23, 2021
Viaarxiv icon