Picture for Yanli Zhao

Yanli Zhao

Wukong: Towards a Scaling Law for Large-Scale Recommendation

Add code
Mar 08, 2024
Viaarxiv icon

Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation

Add code
Mar 07, 2024
Figure 1 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 2 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 3 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 4 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Viaarxiv icon

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

Add code
Apr 21, 2023
Viaarxiv icon

PyTorch Distributed: Experiences on Accelerating Data Parallel Training

Add code
Jun 28, 2020
Figure 1 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Figure 2 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Figure 3 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Figure 4 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Viaarxiv icon