Skip to main content

Showing 1–7 of 7 results for author: Ingrosso, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.03260  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    Feature learning in finite-width Bayesian deep linear networks with multiple outputs and convolutional layers

    Authors: Federico Bassetti, Marco Gherardi, Alessandro Ingrosso, Mauro Pastore, Pietro Rotondo

    Abstract: Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript, we provide rigorous results for the statistics of functions implemented by the aforementioned class of networks, thus moving closer to a complete characterizatio… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    MSC Class: 62E20; 62E15; 82B44

  2. arXiv:2211.11567  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Neural networks trained with SGD learn distributions of increasing complexity

    Authors: Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt

    Abstract: The ability of deep neural networks to generalise well even when they interpolate their training data has been explained using various "simplicity biases". These theories postulate that neural networks avoid overfitting by first learning simple functions, say a linear classifier, before learning more complex, non-linear functions. Meanwhile, data structure is also recognised as a key ingredient fo… ▽ More

    Submitted 26 May, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Source code available at https://github.com/sgoldt/dist_inc_comp

    Journal ref: ICML 2023

  3. arXiv:2202.00565  [pdf, other

    cond-mat.dis-nn q-bio.NC stat.ML

    Data-driven emergence of convolutional structure in neural networks

    Authors: Alessandro Ingrosso, Sebastian Goldt

    Abstract: Exploiting data invariances is crucial for efficient learning in both artificial and biological neural circuits. Understanding how neural networks can discover appropriate representations capable of harnessing the underlying symmetries of their inputs is thus crucial in machine learning and neuroscience. Convolutional neural networks, for example, were designed to exploit translation symmetry and… ▽ More

    Submitted 18 August, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Main text: 19 pages, 4 figures; Supplementary Material: 4 pages, 4 figures

    Journal ref: Proceedings of the National Academy of Science vol 119 (40) e2201854119 (2022)

  4. arXiv:1605.06444  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Unreasonable Effectiveness of Learning Neural Networks: From Accessible States and Robust Ensembles to Basic Algorithmic Schemes

    Authors: Carlo Baldassi, Christian Borgs, Jennifer Chayes, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: In artificial neural networks, learning from data is a computationally demanding task in which a large number of connection weights are iteratively tuned through stochastic-gradient-based heuristic processes over a cost-function. It is not well understood how learning occurs in these systems, in particular how they avoid getting trapped in configurations with poor computational performance. Here w… ▽ More

    Submitted 6 October, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: 31 pages (14 main text, 18 appendix), 12 figures (6 main text, 6 appendix)

    Journal ref: Proc. Natl. Acad. Sci. U.S.A. 113(48):E7655-E7662, 2016

  5. arXiv:1602.01889  [pdf, ps, other

    q-bio.NC stat.ML

    Discovering Neuronal Cell Types and Their Gene Expression Profiles Using a Spatial Point Process Mixture Model

    Authors: Furong Huang, Animashree Anandkumar, Christian Borgs, Jennifer Chayes, Ernest Fraenkel, Michael Hawrylycz, Ed Lein, Alessandro Ingrosso, Srinivas Turaga

    Abstract: Cataloging the neuronal cell types that comprise circuitry of individual brain regions is a major goal of modern neuroscience and the BRAIN initiative. Single-cell RNA sequencing can now be used to measure the gene expression profiles of individual neurons and to categorize neurons based on their gene expression profiles. While the single-cell techniques are extremely powerful and hold great promi… ▽ More

    Submitted 10 June, 2016; v1 submitted 4 February, 2016; originally announced February 2016.

  6. arXiv:1511.05634  [pdf, ps, other

    cond-mat.dis-nn stat.ML

    Local entropy as a measure for sampling solutions in Constraint Satisfaction Problems

    Authors: Carlo Baldassi, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: We introduce a novel Entropy-driven Monte Carlo (EdMC) strategy to efficiently sample solutions of random Constraint Satisfaction Problems (CSPs). First, we extend a recent result that, using a large-deviation analysis, shows that the geometry of the space of solutions of the Binary Perceptron Learning Problem (a prototypical CSP), contains regions of very high-density of solutions. Despite being… ▽ More

    Submitted 25 February, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

    Comments: 46 pages (main text: 22), 7 figures. This is an author-created, un-copyedited version of an article published in Journal of Statistical Mechanics: Theory and Experiment. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The Version of Record is available online at http://dx.doi.org/10.1088/1742-5468/2016/02/023301

    ACM Class: G.1.6; I.2.M

    Journal ref: J. Stat. Mech. 2016 (2) 023301

  7. arXiv:1509.05753  [pdf, other

    cond-mat.dis-nn q-bio.NC stat.ML

    Subdominant Dense Clusters Allow for Simple Learning and High Computational Performance in Neural Networks with Discrete Synapses

    Authors: Carlo Baldassi, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: We show that discrete synaptic weights can be efficiently used for learning in large scale neural systems, and lead to unanticipated computational performance. We focus on the representative case of learning random patterns with binary synapses in single layer networks. The standard statistical analysis shows that this problem is exponentially dominated by isolated solutions that are extremely har… ▽ More

    Submitted 18 September, 2015; originally announced September 2015.

    Comments: 11 pages, 4 figures (main text: 5 pages, 3 figures; Supplemental Material: 6 pages, 1 figure)

    Journal ref: Physical Review Letters, 15, 128101 (2015) url=http://journals.aps.org/prl/abstract/10.1103/PhysRevLett.115.128101