Picture for Anze Xie

Anze Xie

LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Add code
Oct 05, 2023
Viaarxiv icon