Skip to main content

Showing 1–1 of 1 results for author: Molero, E C

.
  1. arXiv:2309.16214  [pdf, other

    cs.DC cs.NI

    Canary: Congestion-Aware In-Network Allreduce Using Dynamic Trees

    Authors: Daniele De Sensi, Edgar Costa Molero, Salvatore Di Girolamo, Laurent Vanbever, Torsten Hoefler

    Abstract: The allreduce operation is an essential building block for many distributed applications, ranging from the training of deep learning models to scientific computing. In an allreduce operation, data from multiple hosts is aggregated together and then broadcasted to each host participating in the operation. Allreduce performance can be improved by a factor of two by aggregating the data directly in t… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    ACM Class: C.2.1; C.2.2; C.2.4; C.5.1