Picture for Zhenglei Zhou

Zhenglei Zhou

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

Add code
Apr 15, 2024
Viaarxiv icon

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems

Add code
Jan 09, 2024
Viaarxiv icon

An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training

Add code
Dec 19, 2023
Viaarxiv icon