Skip to main content

Showing 1–3 of 3 results for author: Flügel, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01067  [pdf, other

    cs.LG cs.AI cs.DC

    AB-Training: A Communication-Efficient Approach for Distributed Low-Rank Learning

    Authors: Daniel Coquelin, Katherina Flügel, Marie Weiel, Nicholas Kiefer, Muhammed Öz, Charlotte Debus, Achim Streit, Markus Götz

    Abstract: Communication bottlenecks severely hinder the scalability of distributed neural network training, particularly in high-performance computing (HPC) environments. We introduce AB-training, a novel data-parallel method that leverages low-rank representations and independent training groups to significantly reduce communication overhead. Our experiments demonstrate an average reduction in network traf… ▽ More

    Submitted 30 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2401.08505  [pdf, other

    cs.LG cs.AI

    Harnessing Orthogonality to Train Low-Rank Neural Networks

    Authors: Daniel Coquelin, Katharina Flügel, Marie Weiel, Nicholas Kiefer, Charlotte Debus, Achim Streit, Markus Götz

    Abstract: This study explores the learning dynamics of neural networks by analyzing the singular value decomposition (SVD) of their weights throughout training. Our investigation reveals that an orthogonal basis within each multidimensional weight's SVD representation stabilizes during training. Building upon this, we introduce Orthogonality-Informed Adaptive Low-Rank (OIALR) training, a novel training meth… ▽ More

    Submitted 22 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  3. arXiv:2304.13372  [pdf, other

    cs.LG cs.AI

    Feed-Forward Optimization With Delayed Feedback for Neural Networks

    Authors: Katharina Flügel, Daniel Coquelin, Marie Weiel, Charlotte Debus, Achim Streit, Markus Götz

    Abstract: Backpropagation has long been criticized for being biologically implausible, relying on concepts that are not viable in natural learning processes. This paper proposes an alternative approach to solve two core issues, i.e., weight transport and update locking, for biological plausibility and computational efficiency. We introduce Feed-Forward with delayed Feedback (F$^3$), which improves upon prio… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.