Picture for Bobby He

Bobby He

Understanding and Minimising Outlier Features in Neural Network Training

Add code
May 29, 2024
Figure 1 for Understanding and Minimising Outlier Features in Neural Network Training
Figure 2 for Understanding and Minimising Outlier Features in Neural Network Training
Figure 3 for Understanding and Minimising Outlier Features in Neural Network Training
Figure 4 for Understanding and Minimising Outlier Features in Neural Network Training
Viaarxiv icon

Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends

Add code
Mar 12, 2024
Figure 1 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 2 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 3 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 4 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Viaarxiv icon

Recurrent Distance-Encoding Neural Networks for Graph Representation Learning

Add code
Dec 03, 2023
Figure 1 for Recurrent Distance-Encoding Neural Networks for Graph Representation Learning
Figure 2 for Recurrent Distance-Encoding Neural Networks for Graph Representation Learning
Figure 3 for Recurrent Distance-Encoding Neural Networks for Graph Representation Learning
Figure 4 for Recurrent Distance-Encoding Neural Networks for Graph Representation Learning
Viaarxiv icon

Simplifying Transformer Blocks

Add code
Nov 03, 2023
Figure 1 for Simplifying Transformer Blocks
Figure 2 for Simplifying Transformer Blocks
Figure 3 for Simplifying Transformer Blocks
Figure 4 for Simplifying Transformer Blocks
Viaarxiv icon

The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit

Add code
Jun 30, 2023
Figure 1 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 2 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 3 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 4 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Viaarxiv icon

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation

Add code
Feb 20, 2023
Figure 1 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 2 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 3 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 4 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Viaarxiv icon

UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography

Add code
Feb 22, 2022
Figure 1 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Figure 2 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Figure 3 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Figure 4 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Viaarxiv icon

Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning

Add code
Oct 22, 2021
Figure 1 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Figure 2 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Figure 3 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Figure 4 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Viaarxiv icon

Stable ResNet

Add code
Oct 24, 2020
Figure 1 for Stable ResNet
Figure 2 for Stable ResNet
Figure 3 for Stable ResNet
Figure 4 for Stable ResNet
Viaarxiv icon

Bayesian Deep Ensembles via the Neural Tangent Kernel

Add code
Jul 11, 2020
Figure 1 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Figure 2 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Figure 3 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Figure 4 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Viaarxiv icon