Skip to main content

Showing 1–3 of 3 results for author: Bréchet, P

.
  1. arXiv:2303.03027  [pdf, other

    stat.ML cs.LG

    Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss

    Authors: Pierre Bréchet, Katerina Papagiannouli, **g An, Guido Montúfar

    Abstract: We consider a deep matrix factorization model of covariance matrices trained with the Bures-Wasserstein distance. While recent works have made advances in the study of the optimization problem for overparametrized low-rank matrix approximation, much emphasis has been placed on discriminative settings and the square loss. In contrast, our model considers another type of loss and connects with the g… ▽ More

    Submitted 13 July, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 42 pages, 3 figures, accepted at ICML 2023

  2. arXiv:2102.09671  [pdf, other

    cs.LG stat.ML

    When Are Solutions Connected in Deep Networks?

    Authors: Quynh Nguyen, Pierre Brechet, Marco Mondelli

    Abstract: The question of how and why the phenomenon of mode connectivity occurs in training deep neural networks has gained remarkable attention in the research community. From a theoretical perspective, two possible explanations have been proposed: (i) the loss function has connected sublevel sets, and (ii) the solutions found by stochastic gradient descent are dropout stable. While these explanations pro… ▽ More

    Submitted 21 October, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at NeurIPS 2021

  3. arXiv:1912.02160  [pdf, other

    cs.LG stat.ML

    Informative GANs via Structured Regularization of Optimal Transport

    Authors: Pierre Bréchet, Tao Wu, Thomas Möllenhoff, Daniel Cremers

    Abstract: We tackle the challenge of disentangled representation learning in generative adversarial networks (GANs) from the perspective of regularized optimal transport (OT). Specifically, a smoothed OT loss gives rise to an implicit transportation plan between the latent space and the data space. Based on this theoretical observation, we exploit a structured regularization on the transportation plan to en… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: Presented at the Optimal Transport and Machine Learning Workshop, NeurIPS 2019