Skip to main content

Showing 1–2 of 2 results for author: Menegaux, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.10933  [pdf, other

    cs.LG

    Self-Attention in Colors: Another Take on Encoding Graph Structure in Transformers

    Authors: Romain Menegaux, Emmanuel Jehanno, Margot Selosse, Julien Mairal

    Abstract: We introduce a novel self-attention mechanism, which we call CSA (Chromatic Self-Attention), which extends the notion of attention scores to attention _filters_, independently modulating the feature channels. We showcase CSA in a fully-attentional graph Transformer CGT (Chromatic Graph Transformer) which integrates both graph structural information and edge features, completely bypassing the need… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  2. arXiv:2302.02904  [pdf, other

    cs.LG math.OC

    Rethinking Gauss-Newton for learning over-parameterized models

    Authors: Michael Arbel, Romain Menegaux, Pierre Wolinski

    Abstract: This work studies the global convergence and implicit bias of Gauss Newton's (GN) when optimizing over-parameterized one-hidden layer networks in the mean-field regime. We first establish a global convergence result for GN in the continuous-time limit exhibiting a faster convergence rate compared to GD due to improved conditioning. We then perform an empirical study on a synthetic regression task… ▽ More

    Submitted 12 December, 2023; v1 submitted 6 February, 2023; originally announced February 2023.