Authors
Youhui Bai, Cheng Li, Quan Zhou, Jun Yi, Ping Gong, Feng Yan, Ruichuan Chen, Yinlong Xu
Abstract
Tensor Homomorphic Compression (THC) enables direct aggregation of compressed gradients while maintaining compression efficiency, significantly reducing communication bottlenecks in data parallel DNN training.