Skip to main content

Showing 1–2 of 2 results for author: Pumir, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02052  [pdf, other

    cs.LG stat.ML

    PETRA: Parallel End-to-end Training with Reversible Architectures

    Authors: Stéphane Rivaud, Louis Fournier, Thomas Pumir, Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon

    Abstract: Reversible architectures have been shown to be capable of performing on par with their non-reversible architectures, being applied in deep learning for memory savings and generative modeling. In this work, we show how reversible architectures can solve challenges in parallelizing deep model training. We introduce PETRA, a novel alternative to backpropagation for parallelizing gradient computations… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:1806.03763  [pdf, other

    stat.ML cs.LG math.OC

    Smoothed analysis of the low-rank approach for smooth semidefinite programs

    Authors: Thomas Pumir, Samy Jelassi, Nicolas Boumal

    Abstract: We consider semidefinite programs (SDPs) of size n with equality constraints. In order to overcome scalability issues, Burer and Monteiro proposed a factorized approach based on optimizing over a matrix Y of size $n$ by $k$ such that $X = YY^*$ is the SDP variable. The advantages of such formulation are twofold: the dimension of the optimization variable is reduced and positive semidefiniteness is… ▽ More

    Submitted 27 November, 2018; v1 submitted 10 June, 2018; originally announced June 2018.