Skip to main content

Showing 1–9 of 9 results for author: Saglietti, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.01589  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG q-bio.NC

    Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks

    Authors: Stefano Sarao Mannelli, Yaraslau Ivashinka, Andrew Saxe, Luca Saglietti

    Abstract: A wide range of empirical and theoretical works have shown that overparameterisation can amplify the performance of neural networks. According to the lottery ticket hypothesis, overparameterised networks have an increased chance of containing a sub-network that is well-initialised to solve the task at hand. A more parsimonious approach, inspired by animal learning, consists in guiding the learner… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  2. arXiv:2205.15935  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Bias-inducing geometries: an exactly solvable data model with fairness implications

    Authors: Stefano Sarao Mannelli, Federica Gerace, Negar Rostamzadeh, Luca Saglietti

    Abstract: Machine learning (ML) may be oblivious to human bias but it is not immune to its perpetuation. Marginalisation and iniquitous group representation are often traceable in the very data used for training, and may be reflected or even enhanced by the learning models. In the present work, we aim at clarifying the role played by data geometry in the emergence of ML bias. We introduce an exactly solvabl… ▽ More

    Submitted 13 November, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 9 pages + methods + SI

  3. arXiv:2106.08068  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    An Analytical Theory of Curriculum Learning in Teacher-Student Networks

    Authors: Luca Saglietti, Stefano Sarao Mannelli, Andrew Saxe

    Abstract: In humans and animals, curriculum learning -- presenting data in a curated order - is critical to rapid learning and effective pedagogy. Yet in machine learning, curricula are not widely used and empirically often yield only moderate benefits. This stark difference in the importance of curriculum raises a fundamental theoretical question: when and why does curriculum learning help? In this work,… ▽ More

    Submitted 12 October, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted to NeurIPS 2022

  4. arXiv:1810.11738  [pdf, other

    cs.LG stat.ML

    Gaussian Process Prior Variational Autoencoders

    Authors: Francesco Paolo Casale, Adrian V Dalca, Luca Saglietti, Jennifer Listgarten, Nicolo Fusi

    Abstract: Variational autoencoders (VAE) are a powerful and widely-used class of models to learn complex data distributions in an unsupervised fashion. One important limitation of VAEs is the prior assumption that latent sample representations are independent and identically distributed. However, for many important datasets, such as time-series of images, this assumption is too strong: accounting for covari… ▽ More

    Submitted 26 November, 2018; v1 submitted 27 October, 2018; originally announced October 2018.

    Comments: Accepted at 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montréal, Canada

  5. arXiv:1710.09825  [pdf, other

    cond-mat.dis-nn cs.LG cs.NE stat.ML

    On the role of synaptic stochasticity in training low-precision neural networks

    Authors: Carlo Baldassi, Federica Gerace, Hilbert J. Kappen, Carlo Lucibello, Luca Saglietti, Enzo Tartaglione, Riccardo Zecchina

    Abstract: Stochasticity and limited precision of synaptic weights in neural network models are key aspects of both biological and hardware modeling of learning processes. Here we show that a neural network model with stochastic binary weights naturally gives prominence to exponentially rare dense regions of solutions with a number of desirable properties such as robustness and good generalization performanc… ▽ More

    Submitted 19 March, 2018; v1 submitted 26 October, 2017; originally announced October 2017.

    Comments: 7 pages + 14 pages of supplementary material

    Journal ref: Phys. Rev. Lett. 120, 268103 (2018)

  6. arXiv:1605.06444  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Unreasonable Effectiveness of Learning Neural Networks: From Accessible States and Robust Ensembles to Basic Algorithmic Schemes

    Authors: Carlo Baldassi, Christian Borgs, Jennifer Chayes, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: In artificial neural networks, learning from data is a computationally demanding task in which a large number of connection weights are iteratively tuned through stochastic-gradient-based heuristic processes over a cost-function. It is not well understood how learning occurs in these systems, in particular how they avoid getting trapped in configurations with poor computational performance. Here w… ▽ More

    Submitted 6 October, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: 31 pages (14 main text, 18 appendix), 12 figures (6 main text, 6 appendix)

    Journal ref: Proc. Natl. Acad. Sci. U.S.A. 113(48):E7655-E7662, 2016

  7. arXiv:1602.04129  [pdf, ps, other

    cond-mat.dis-nn q-bio.NC stat.ML

    Learning may need only a few bits of synaptic precision

    Authors: Carlo Baldassi, Federica Gerace, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: Learning in neural networks poses peculiar challenges when using discretized rather then continuous synaptic states. The choice of discrete synapses is motivated by biological reasoning and experiments, and possibly by hardware implementation considerations as well. In this paper we extend a previous large deviations analysis which unveiled the existence of peculiar dense regions in the space of s… ▽ More

    Submitted 27 May, 2016; v1 submitted 12 February, 2016; originally announced February 2016.

    Comments: 38 pages (main text: 16 pages), 5 figures; http://link.aps.org/doi/10.1103/PhysRevE.93.052313

    Journal ref: Phys. Rev. E 93, 052313 (2016)

  8. arXiv:1511.05634  [pdf, ps, other

    cond-mat.dis-nn stat.ML

    Local entropy as a measure for sampling solutions in Constraint Satisfaction Problems

    Authors: Carlo Baldassi, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: We introduce a novel Entropy-driven Monte Carlo (EdMC) strategy to efficiently sample solutions of random Constraint Satisfaction Problems (CSPs). First, we extend a recent result that, using a large-deviation analysis, shows that the geometry of the space of solutions of the Binary Perceptron Learning Problem (a prototypical CSP), contains regions of very high-density of solutions. Despite being… ▽ More

    Submitted 25 February, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

    Comments: 46 pages (main text: 22), 7 figures. This is an author-created, un-copyedited version of an article published in Journal of Statistical Mechanics: Theory and Experiment. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The Version of Record is available online at http://dx.doi.org/10.1088/1742-5468/2016/02/023301

    ACM Class: G.1.6; I.2.M

    Journal ref: J. Stat. Mech. 2016 (2) 023301

  9. arXiv:1509.05753  [pdf, other

    cond-mat.dis-nn q-bio.NC stat.ML

    Subdominant Dense Clusters Allow for Simple Learning and High Computational Performance in Neural Networks with Discrete Synapses

    Authors: Carlo Baldassi, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: We show that discrete synaptic weights can be efficiently used for learning in large scale neural systems, and lead to unanticipated computational performance. We focus on the representative case of learning random patterns with binary synapses in single layer networks. The standard statistical analysis shows that this problem is exponentially dominated by isolated solutions that are extremely har… ▽ More

    Submitted 18 September, 2015; originally announced September 2015.

    Comments: 11 pages, 4 figures (main text: 5 pages, 3 figures; Supplemental Material: 6 pages, 1 figure)

    Journal ref: Physical Review Letters, 15, 128101 (2015) url=http://journals.aps.org/prl/abstract/10.1103/PhysRevLett.115.128101