Blog posts

Understanding Gradient Compression in Distributed Training

less than 1 minute read

Published: November 20, 2024

As deep learning models continue to grow in size, distributed training has become essential for reducing training time. However, the communication overhead between nodes can become a significant bottleneck. This is where gradient compression comes into play.

Xianyu Zhu

Blog posts

2024

Understanding Gradient Compression in Distributed Training