Modeling the Impact of Fiber Latency on Compute-Communication Overlap in Geo-Distributed Multi-Datacenter AI Training

2026-05-18Performance

PerformanceDistributed, Parallel, and Cluster Computing
AI summary

AI summary is being generated…

Authors
Ioannis Papavasileiou, Sairam Prabhakar, Indu Kant Deo, Sergejs Makovejs
Abstract
We use discrete-event simulation to quantify the impact of fiber latency on the efficacy of geo-distributed AI model training with data parallelism. We conclude that the optimum distances between two AI clusters is 10-100km, over which hollow-core fiber enables 25% higher compute-communication overlap.