TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

Add code
Feb 20, 2023

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: