Picture for Michael Mahoney

Michael Mahoney

Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior

Add code
Jun 01, 2023
Viaarxiv icon

GACT: Activation Compressed Training for General Architectures

Add code
Jun 28, 2022
Figure 1 for GACT: Activation Compressed Training for General Architectures
Figure 2 for GACT: Activation Compressed Training for General Architectures
Figure 3 for GACT: Activation Compressed Training for General Architectures
Figure 4 for GACT: Activation Compressed Training for General Architectures
Viaarxiv icon

AutoIP: A United Framework to Integrate Physics into Gaussian Processes

Add code
Feb 24, 2022
Figure 1 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Figure 2 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Figure 3 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Figure 4 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Viaarxiv icon

LocalNewton: Reducing Communication Bottleneck for Distributed Learning

Add code
May 16, 2021
Figure 1 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Figure 2 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Figure 3 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Figure 4 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Viaarxiv icon

Rethinking Batch Normalization in Transformers

Add code
Mar 17, 2020
Figure 1 for Rethinking Batch Normalization in Transformers
Figure 2 for Rethinking Batch Normalization in Transformers
Figure 3 for Rethinking Batch Normalization in Transformers
Figure 4 for Rethinking Batch Normalization in Transformers
Viaarxiv icon

PyHessian: Neural Networks Through the Lens of the Hessian

Add code
Jan 02, 2020
Figure 1 for PyHessian: Neural Networks Through the Lens of the Hessian
Figure 2 for PyHessian: Neural Networks Through the Lens of the Hessian
Figure 3 for PyHessian: Neural Networks Through the Lens of the Hessian
Figure 4 for PyHessian: Neural Networks Through the Lens of the Hessian
Viaarxiv icon

ANODEV2: A Coupled Neural ODE Evolution Framework

Add code
Jun 10, 2019
Figure 1 for ANODEV2: A Coupled Neural ODE Evolution Framework
Figure 2 for ANODEV2: A Coupled Neural ODE Evolution Framework
Figure 3 for ANODEV2: A Coupled Neural ODE Evolution Framework
Figure 4 for ANODEV2: A Coupled Neural ODE Evolution Framework
Viaarxiv icon

HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision

Add code
Apr 29, 2019
Figure 1 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Figure 2 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Figure 3 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Figure 4 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Viaarxiv icon

Trust Region Based Adversarial Attack on Neural Networks

Add code
Dec 16, 2018
Figure 1 for Trust Region Based Adversarial Attack on Neural Networks
Figure 2 for Trust Region Based Adversarial Attack on Neural Networks
Figure 3 for Trust Region Based Adversarial Attack on Neural Networks
Figure 4 for Trust Region Based Adversarial Attack on Neural Networks
Viaarxiv icon

Parameter Re-Initialization through Cyclical Batch Size Schedules

Add code
Dec 04, 2018
Figure 1 for Parameter Re-Initialization through Cyclical Batch Size Schedules
Figure 2 for Parameter Re-Initialization through Cyclical Batch Size Schedules
Figure 3 for Parameter Re-Initialization through Cyclical Batch Size Schedules
Figure 4 for Parameter Re-Initialization through Cyclical Batch Size Schedules
Viaarxiv icon