Skip to main content

Showing 1–9 of 9 results for author: Refinetti, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.05440  [pdf, other

    cs.LG

    Forward Learning with Top-Down Feedback: Empirical and Analytical Characterization

    Authors: Ravi Srinivasan, Francesca Mignacco, Martino Sorbaro, Maria Refinetti, Avi Cooper, Gabriel Kreiman, Giorgia Dellaferrera

    Abstract: "Forward-only" algorithms, which train neural networks while avoiding a backward pass, have recently gained attention as a way of solving the biologically unrealistic aspects of backpropagation. Here, we first address compelling challenges related to the "forward-only" rules, which include reducing the performance gap with backpropagation and providing an analytical understanding of their dynamics… ▽ More

    Submitted 22 March, 2024; v1 submitted 10 February, 2023; originally announced February 2023.

  2. arXiv:2211.11567  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Neural networks trained with SGD learn distributions of increasing complexity

    Authors: Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt

    Abstract: The ability of deep neural networks to generalise well even when they interpolate their training data has been explained using various "simplicity biases". These theories postulate that neural networks avoid overfitting by first learning simple functions, say a linear classifier, before learning more complex, non-linear functions. Meanwhile, data structure is also recognised as a key ingredient fo… ▽ More

    Submitted 26 May, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Source code available at https://github.com/sgoldt/dist_inc_comp

    Journal ref: ICML 2023

  3. arXiv:2202.04509  [pdf, other

    cs.LG stat.ML

    Optimal learning rate schedules in high-dimensional non-convex optimization problems

    Authors: Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli

    Abstract: Learning rate schedules are ubiquitously used to speed up and improve optimisation. Many different policies have been introduced on an empirical basis, and theoretical analyses have been developed for convex settings. However, in many realistic problems the loss-landscape is high-dimensional and non convex -- a case for which results are scarce. In this paper we present a first analytical study of… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  4. arXiv:2201.13383  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics for Convex Losses in High-Dimension

    Authors: Bruno Loureiro, Cédric Gerbelot, Maria Refinetti, Gabriele Sicuro, Florent Krzakala

    Abstract: From the sampling of data to the initialisation of parameters, randomness is ubiquitous in modern Machine Learning practice. Understanding the statistical fluctuations engendered by the different sources of randomness in prediction is therefore key to understanding robust generalisation. In this manuscript we develop a quantitative and rigorous theory for the study of fluctuations in an ensemble o… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 17 pages + Appendix

    Journal ref: Proceedings of the 39th International Conference on Machine Learning (ICML). PMLR 162:14283-14314, 2022

  5. arXiv:2201.02115  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    The dynamics of representation learning in shallow, non-linear autoencoders

    Authors: Maria Refinetti, Sebastian Goldt

    Abstract: Autoencoders are the simplest neural network for unsupervised learning, and thus an ideal framework for studying feature learning. While a detailed understanding of the dynamics of linear autoencoders has recently been obtained, the study of non-linear autoencoders has been hindered by the technical difficulty of handling training data with non-trivial correlations - a fundamental prerequisite for… ▽ More

    Submitted 16 June, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:18499-18519 (2022)

  6. arXiv:2102.11742  [pdf, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech stat.ML

    Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed

    Authors: Maria Refinetti, Sebastian Goldt, Florent Krzakala, Lenka Zdeborová

    Abstract: A recent series of theoretical works showed that the dynamics of neural networks with a certain initialisation are well-captured by kernel methods. Concurrent empirical work demonstrated that kernel methods can come close to the performance of neural networks on some image classification tasks. These results raise the question of whether neural networks only learn successfully if kernels also lear… ▽ More

    Submitted 10 June, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: The accompanying code for this paper is available at https://github.com/mariaref/rfvs2lnn_GMM_online

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  7. arXiv:2011.12428  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG cs.NE

    Align, then memorise: the dynamics of learning with feedback alignment

    Authors: Maria Refinetti, Stéphane d'Ascoli, Ruben Ohana, Sebastian Goldt

    Abstract: Direct Feedback Alignment (DFA) is emerging as an efficient and biologically plausible alternative to the ubiquitous backpropagation algorithm for training deep neural networks. Despite relying on random feedback weights for the backward pass, DFA successfully trains state-of-the-art models such as Transformers. On the other hand, it notoriously fails to train convolutional networks. An understand… ▽ More

    Submitted 10 June, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: The accompanying code for this paper is available at https://github.com/sdascoli/dfa-dynamics

    Journal ref: Proceedings of the 38th International Conference on Machine Learning (ICML), PMLR 139, 2021

  8. arXiv:2009.09422  [pdf, other

    q-bio.PE cond-mat.stat-mech cs.AI cs.LG

    Epidemic mitigation by statistical inference from contact tracing data

    Authors: Antoine Baker, Indaco Biazzo, Alfredo Braunstein, Giovanni Catania, Luca Dall'Asta, Alessandro Ingrosso, Florent Krzakala, Fabio Mazza, Marc Mézard, Anna Paola Muntoni, Maria Refinetti, Stefano Sarao Mannelli, Lenka Zdeborová

    Abstract: Contact-tracing is an essential tool in order to mitigate the impact of pandemic such as the COVID-19. In order to achieve efficient and scalable contact-tracing in real time, digital devices can play an important role. While a lot of attention has been paid to analyzing the privacy and ethical risks of the associated mobile applications, so far much less research has been devoted to optimizing th… ▽ More

    Submitted 20 September, 2020; originally announced September 2020.

    Comments: 21 pages, 7 figures

    ACM Class: G.3; G.4; I.2.11; J.3

    Journal ref: PNAS 2021 Vol. 118 No. 32 e2106548118

  9. arXiv:2003.01054  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Double Trouble in Double Descent : Bias and Variance(s) in the Lazy Regime

    Authors: Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli, Florent Krzakala

    Abstract: Deep neural networks can achieve remarkable generalization performances while interpolating the training data perfectly. Rather than the U-curve emblematic of the bias-variance trade-off, their test error often follows a "double descent" - a mark of the beneficial role of overparametrization. In this work, we develop a quantitative theory for this phenomenon in the so-called lazy learning regime o… ▽ More

    Submitted 3 April, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 29 pages, 12 figures