Picture for Andrew Tulloch

Andrew Tulloch

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

MTrainS: Improving DLRM training efficiency using heterogeneous memories

Add code
Apr 19, 2023
Figure 1 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Figure 2 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Figure 3 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Figure 4 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Viaarxiv icon

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models

Add code
Apr 15, 2021
Figure 1 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 2 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 3 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 4 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Viaarxiv icon

Mixed-Precision Embedding Using a Cache

Add code
Oct 23, 2020
Figure 1 for Mixed-Precision Embedding Using a Cache
Figure 2 for Mixed-Precision Embedding Using a Cache
Figure 3 for Mixed-Precision Embedding Using a Cache
Figure 4 for Mixed-Precision Embedding Using a Cache
Viaarxiv icon

Hybrid Composition with IdleBlock: More Efficient Networks for Image Recognition

Add code
Nov 19, 2019
Figure 1 for Hybrid Composition with IdleBlock: More Efficient Networks for Image Recognition
Figure 2 for Hybrid Composition with IdleBlock: More Efficient Networks for Image Recognition
Figure 3 for Hybrid Composition with IdleBlock: More Efficient Networks for Image Recognition
Figure 4 for Hybrid Composition with IdleBlock: More Efficient Networks for Image Recognition
Viaarxiv icon

Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications

Add code
Nov 29, 2018
Figure 1 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Figure 2 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Figure 3 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Figure 4 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Viaarxiv icon

On Periodic Functions as Regularizers for Quantization of Neural Networks

Add code
Nov 24, 2018
Figure 1 for On Periodic Functions as Regularizers for Quantization of Neural Networks
Figure 2 for On Periodic Functions as Regularizers for Quantization of Neural Networks
Figure 3 for On Periodic Functions as Regularizers for Quantization of Neural Networks
Figure 4 for On Periodic Functions as Regularizers for Quantization of Neural Networks
Viaarxiv icon

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Add code
Apr 30, 2018
Figure 1 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Figure 2 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Figure 3 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Figure 4 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Viaarxiv icon

High performance ultra-low-precision convolutions on mobile devices

Add code
Dec 06, 2017
Figure 1 for High performance ultra-low-precision convolutions on mobile devices
Viaarxiv icon