Skip to main content

Showing 1–10 of 10 results for author: Lucibello, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.02951  [pdf, other

    cond-mat.stat-mech stat.ML

    Noise-cleaning the precision matrix of fMRI time series

    Authors: Miguel Ibáñez-Berganza, Carlo Lucibello, Francesca Santucci, Tommaso Gili, Andrea Gabrielli

    Abstract: We present a comparison between various algorithms of inference of covariance and precision matrices in small datasets of real vectors, of the typical length and dimension of human brain activity time series retrieved by functional Magnetic Resonance Imaging (fMRI). Assuming a Gaussian model underlying the neural activity, the problem consists in denoising the empirically observed matrices in orde… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 15 pages, 12 figures (of which 12 pages, 3 figures in the main text)

  2. arXiv:2110.14583  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Deep learning via message passing algorithms based on belief propagation

    Authors: Carlo Lucibello, Fabrizio Pittorino, Gabriele Perugini, Riccardo Zecchina

    Abstract: Message-passing algorithms based on the Belief Propagation (BP) equations constitute a well-known distributed computational scheme. It is exact on tree-like graphical models and has also proven to be effective in many problems defined on graphs with loops (from inference to optimization, from signal processing to clustering). The BP-based scheme is fundamentally different from stochastic gradient… ▽ More

    Submitted 15 March, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Journal ref: Mach. Learn.: Sci. Technol. 3 035005 (2022)

  3. arXiv:2006.07897  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Entropic gradient descent algorithms and wide flat minima

    Authors: Fabrizio Pittorino, Carlo Lucibello, Christoph Feinauer, Gabriele Perugini, Carlo Baldassi, Elizaveta Demyanenko, Riccardo Zecchina

    Abstract: The properties of flat minima in the empirical risk landscape of neural networks have been debated for some time. Increasing evidence suggests they possess better generalization capabilities with respect to sharp ones. First, we discuss Gaussian mixture classification models and show analytically that there exist Bayes optimal pointwise estimators which correspond to minimizers belonging to wide f… ▽ More

    Submitted 15 November, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: ICLR 2021 camera-ready

  4. arXiv:1911.06756  [pdf, other

    cond-mat.dis-nn stat.ML

    Clustering of solutions in the symmetric binary perceptron

    Authors: Carlo Baldassi, Riccardo Della Vecchia, Carlo Lucibello, Riccardo Zecchina

    Abstract: The geometrical features of the (non-convex) loss landscape of neural network models are crucial in ensuring successful optimization and, most importantly, the capability to generalize well. While minimizers' flatness consistently correlates with good generalization, there has been little rigorous work in exploring the condition of existence of such minimizers, even in toy models. Here we consider… ▽ More

    Submitted 11 May, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Journal ref: J. Stat. Mech. (2020) 073303

  5. arXiv:1902.00177  [pdf, other

    stat.ML cs.LG

    Critical initialisation in continuous approximations of binary neural networks

    Authors: George Stamatescu, Federica Gerace, Carlo Lucibello, Ian Fuss, Langford B. White

    Abstract: The training of stochastic neural network models with binary ($\pm1$) weights and activations via continuous surrogate networks is investigated. We derive new surrogates using a novel derivation based on writing the stochastic neural network as a Markov chain. This derivation also encompasses existing variants of the surrogates presented in the literature. Following this, we theoretically study th… ▽ More

    Submitted 9 April, 2020; v1 submitted 31 January, 2019; originally announced February 2019.

    Journal ref: ICLR 2020

  6. arXiv:1710.09825  [pdf, other

    cond-mat.dis-nn cs.LG cs.NE stat.ML

    On the role of synaptic stochasticity in training low-precision neural networks

    Authors: Carlo Baldassi, Federica Gerace, Hilbert J. Kappen, Carlo Lucibello, Luca Saglietti, Enzo Tartaglione, Riccardo Zecchina

    Abstract: Stochasticity and limited precision of synaptic weights in neural network models are key aspects of both biological and hardware modeling of learning processes. Here we show that a neural network model with stochastic binary weights naturally gives prominence to exponentially rare dense regions of solutions with a number of desirable properties such as robustness and good generalization performanc… ▽ More

    Submitted 19 March, 2018; v1 submitted 26 October, 2017; originally announced October 2017.

    Comments: 7 pages + 14 pages of supplementary material

    Journal ref: Phys. Rev. Lett. 120, 268103 (2018)

  7. arXiv:1605.06444  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Unreasonable Effectiveness of Learning Neural Networks: From Accessible States and Robust Ensembles to Basic Algorithmic Schemes

    Authors: Carlo Baldassi, Christian Borgs, Jennifer Chayes, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: In artificial neural networks, learning from data is a computationally demanding task in which a large number of connection weights are iteratively tuned through stochastic-gradient-based heuristic processes over a cost-function. It is not well understood how learning occurs in these systems, in particular how they avoid getting trapped in configurations with poor computational performance. Here w… ▽ More

    Submitted 6 October, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: 31 pages (14 main text, 18 appendix), 12 figures (6 main text, 6 appendix)

    Journal ref: Proc. Natl. Acad. Sci. U.S.A. 113(48):E7655-E7662, 2016

  8. arXiv:1602.04129  [pdf, ps, other

    cond-mat.dis-nn q-bio.NC stat.ML

    Learning may need only a few bits of synaptic precision

    Authors: Carlo Baldassi, Federica Gerace, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: Learning in neural networks poses peculiar challenges when using discretized rather then continuous synaptic states. The choice of discrete synapses is motivated by biological reasoning and experiments, and possibly by hardware implementation considerations as well. In this paper we extend a previous large deviations analysis which unveiled the existence of peculiar dense regions in the space of s… ▽ More

    Submitted 27 May, 2016; v1 submitted 12 February, 2016; originally announced February 2016.

    Comments: 38 pages (main text: 16 pages), 5 figures; http://link.aps.org/doi/10.1103/PhysRevE.93.052313

    Journal ref: Phys. Rev. E 93, 052313 (2016)

  9. arXiv:1511.05634  [pdf, ps, other

    cond-mat.dis-nn stat.ML

    Local entropy as a measure for sampling solutions in Constraint Satisfaction Problems

    Authors: Carlo Baldassi, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: We introduce a novel Entropy-driven Monte Carlo (EdMC) strategy to efficiently sample solutions of random Constraint Satisfaction Problems (CSPs). First, we extend a recent result that, using a large-deviation analysis, shows that the geometry of the space of solutions of the Binary Perceptron Learning Problem (a prototypical CSP), contains regions of very high-density of solutions. Despite being… ▽ More

    Submitted 25 February, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

    Comments: 46 pages (main text: 22), 7 figures. This is an author-created, un-copyedited version of an article published in Journal of Statistical Mechanics: Theory and Experiment. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The Version of Record is available online at http://dx.doi.org/10.1088/1742-5468/2016/02/023301

    ACM Class: G.1.6; I.2.M

    Journal ref: J. Stat. Mech. 2016 (2) 023301

  10. arXiv:1509.05753  [pdf, other

    cond-mat.dis-nn q-bio.NC stat.ML

    Subdominant Dense Clusters Allow for Simple Learning and High Computational Performance in Neural Networks with Discrete Synapses

    Authors: Carlo Baldassi, Alessandro Ingrosso, Carlo Lucibello, Luca Saglietti, Riccardo Zecchina

    Abstract: We show that discrete synaptic weights can be efficiently used for learning in large scale neural systems, and lead to unanticipated computational performance. We focus on the representative case of learning random patterns with binary synapses in single layer networks. The standard statistical analysis shows that this problem is exponentially dominated by isolated solutions that are extremely har… ▽ More

    Submitted 18 September, 2015; originally announced September 2015.

    Comments: 11 pages, 4 figures (main text: 5 pages, 3 figures; Supplemental Material: 6 pages, 1 figure)

    Journal ref: Physical Review Letters, 15, 128101 (2015) url=http://journals.aps.org/prl/abstract/10.1103/PhysRevLett.115.128101