Picture for Bernhard Bermeitinger

Bernhard Bermeitinger

Is More Data Worth the Cost? Dataset Scaling Laws in a Tiny Attention-Only Decoder

Add code
Apr 10, 2026
Viaarxiv icon

Efficient Neural Network Training via Subset Pretraining

Add code
Oct 21, 2024
Figure 1 for Efficient Neural Network Training via Subset Pretraining
Figure 2 for Efficient Neural Network Training via Subset Pretraining
Figure 3 for Efficient Neural Network Training via Subset Pretraining
Figure 4 for Efficient Neural Network Training via Subset Pretraining
Viaarxiv icon

Reducing the Transformer Architecture to a Minimum

Add code
Oct 17, 2024
Figure 1 for Reducing the Transformer Architecture to a Minimum
Figure 2 for Reducing the Transformer Architecture to a Minimum
Figure 3 for Reducing the Transformer Architecture to a Minimum
Figure 4 for Reducing the Transformer Architecture to a Minimum
Viaarxiv icon

Make Deep Networks Shallow Again

Add code
Sep 15, 2023
Figure 1 for Make Deep Networks Shallow Again
Figure 2 for Make Deep Networks Shallow Again
Figure 3 for Make Deep Networks Shallow Again
Figure 4 for Make Deep Networks Shallow Again
Viaarxiv icon

Number of Attention Heads vs Number of Transformer-Encoders in Computer Vision

Add code
Sep 15, 2022
Figure 1 for Number of Attention Heads vs Number of Transformer-Encoders in Computer Vision
Figure 2 for Number of Attention Heads vs Number of Transformer-Encoders in Computer Vision
Figure 3 for Number of Attention Heads vs Number of Transformer-Encoders in Computer Vision
Figure 4 for Number of Attention Heads vs Number of Transformer-Encoders in Computer Vision
Viaarxiv icon

Training Neural Networks in Single vs Double Precision

Add code
Sep 15, 2022
Figure 1 for Training Neural Networks in Single vs Double Precision
Figure 2 for Training Neural Networks in Single vs Double Precision
Figure 3 for Training Neural Networks in Single vs Double Precision
Viaarxiv icon

Multimodal Semantic Transfer from Text to Image. Fine-Grained Image Classification by Distributional Semantics

Add code
Jan 07, 2020
Figure 1 for Multimodal Semantic Transfer from Text to Image. Fine-Grained Image Classification by Distributional Semantics
Figure 2 for Multimodal Semantic Transfer from Text to Image. Fine-Grained Image Classification by Distributional Semantics
Figure 3 for Multimodal Semantic Transfer from Text to Image. Fine-Grained Image Classification by Distributional Semantics
Figure 4 for Multimodal Semantic Transfer from Text to Image. Fine-Grained Image Classification by Distributional Semantics
Viaarxiv icon

Representational Capacity of Deep Neural Networks -- A Computing Study

Add code
Jul 19, 2019
Figure 1 for Representational Capacity of Deep Neural Networks -- A Computing Study
Figure 2 for Representational Capacity of Deep Neural Networks -- A Computing Study
Figure 3 for Representational Capacity of Deep Neural Networks -- A Computing Study
Viaarxiv icon

Singular Value Decomposition and Neural Networks

Add code
Jun 27, 2019
Figure 1 for Singular Value Decomposition and Neural Networks
Figure 2 for Singular Value Decomposition and Neural Networks
Figure 3 for Singular Value Decomposition and Neural Networks
Figure 4 for Singular Value Decomposition and Neural Networks
Viaarxiv icon

Object Classification in Images of Neoclassical Artifacts Using Deep Learning

Add code
Oct 13, 2017
Viaarxiv icon