Authors
Zhiqi Lin, Youshan Miao, Quanlu Zhang, Fan Yang, Yi Zhu, Cheng Li, Saeed Maleki, Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, Mao Yang, Lintao Zhang, Lidong Zhou
Abstract
nnScaler enables domain experts to construct custom search spaces for parallel DNN training via three primitives, achieving up to 3.5x speedup over existing solutions like DeepSpeed, Megatron-LM, and Alpa.