Picture for Yuchen Hao

Yuchen Hao

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

Add code
Jul 05, 2024
Viaarxiv icon

Wukong: Towards a Scaling Law for Large-Scale Recommendation

Add code
Mar 08, 2024
Viaarxiv icon

Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation

Add code
Mar 07, 2024
Figure 1 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 2 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 3 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 4 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Viaarxiv icon

Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

Add code
Nov 14, 2023
Figure 1 for Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale
Figure 2 for Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale
Figure 3 for Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale
Figure 4 for Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale
Viaarxiv icon

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

Add code
Apr 21, 2023
Viaarxiv icon

DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction

Add code
Mar 11, 2022
Figure 1 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Figure 2 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Figure 3 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Figure 4 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Viaarxiv icon

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models

Add code
Apr 15, 2021
Figure 1 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 2 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 3 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 4 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Viaarxiv icon