Picture for Youwei Zhuo

Youwei Zhuo

Heterogeneity-Aware Asynchronous Decentralized Training

Add code
Sep 17, 2019
Figure 1 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 2 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 3 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 4 for Heterogeneity-Aware Asynchronous Decentralized Training
Viaarxiv icon

Hop: Heterogeneity-Aware Decentralized Training

Add code
Feb 07, 2019
Figure 1 for Hop: Heterogeneity-Aware Decentralized Training
Figure 2 for Hop: Heterogeneity-Aware Decentralized Training
Figure 3 for Hop: Heterogeneity-Aware Decentralized Training
Figure 4 for Hop: Heterogeneity-Aware Decentralized Training
Viaarxiv icon

HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array

Add code
Jan 07, 2019
Figure 1 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 2 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 3 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 4 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Viaarxiv icon

E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs

Add code
Dec 12, 2018
Figure 1 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Figure 2 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Figure 3 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Figure 4 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Viaarxiv icon

CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices

Add code
Aug 29, 2017
Figure 1 for CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Figure 2 for CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Figure 3 for CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Figure 4 for CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Viaarxiv icon