Skip to main content

Showing 1–23 of 23 results for author: Malatesta, E M

.
  1. arXiv:2407.07572  [pdf, other

    q-bio.NC cond-mat.dis-nn

    Impact of dendritic non-linearities on the computational capabilities of neurons

    Authors: Clarissa Lauditi, Enrico M. Malatesta, Fabrizio Pittorino, Carlo Baldassi, Nicolas Brunel, Riccardo Zecchina

    Abstract: Multiple neurophysiological experiments have shown that dendritic non-linearities can have a strong influence on synaptic input integration. In this work we model a single neuron as a two-layer computational unit with non-overlap** sign-constrained synaptic weights and a biologically plausible form of dendritic non-linearity, which is analytically tractable using statistical physics methods. Usi… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 35 pages, 11 figures

  2. arXiv:2407.05658  [pdf, other

    cond-mat.dis-nn cs.LG cs.NE

    Random Features Hopfield Networks generalize retrieval to previously unseen examples

    Authors: Silvio Kalaj, Clarissa Lauditi, Gabriele Perugini, Carlo Lucibello, Enrico M. Malatesta, Matteo Negri

    Abstract: It has been recently shown that a learning transition happens when a Hopfield Network stores examples generated as superpositions of random features, where new attractors corresponding to such features appear in the model. In this work we reveal that the network also develops attractors corresponding to previously unseen examples generated with the same set of features. We explain this surprising… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2405.18191  [pdf, other

    hep-th

    Instantons in $φ^4$ Theories: Transseries, Virial Theorems and Numerical Aspects

    Authors: Ludovico T. Giorgini, Ulrich D. Jentschura, Enrico M. Malatesta, Tommaso Rizzo, Jean Zinn-Justin

    Abstract: We discuss numerical aspects of instantons in two- and three-dimensional $φ^4$ theories with an internal $O(N)$ symmetry group, the so-called $N$-vector model. Combining asymptotic transseries expansions for large argument with convergence acceleration techniques, we obtain high-precision values for certain integrals of the instanton that naturally occur in loop corrections around instanton config… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 11 pages; RevTeX

  4. arXiv:2401.12610  [pdf, other

    cs.LG cond-mat.dis-nn math.PR math.ST

    The twin peaks of learning neural networks

    Authors: Elizaveta Demyanenko, Christoph Feinauer, Enrico M. Malatesta, Luca Saglietti

    Abstract: Recent works demonstrated the existence of a double-descent phenomenon for the generalization error of neural networks, where highly overparameterized models escape overfitting and achieve good test performance, at odds with the standard bias-variance trade-off described by statistical learning theory. In the present work, we explore a link between this phenomenon and the increase of complexity an… ▽ More

    Submitted 1 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 37 pages, 31 figures

  5. arXiv:2309.09240  [pdf, other

    cond-mat.dis-nn cs.LG math.PR math.ST

    High-dimensional manifold of solutions in neural networks: insights from statistical physics

    Authors: Enrico M. Malatesta

    Abstract: In these pedagogic notes I review the statistical mechanics approach to neural networks, focusing on the paradigmatic example of the perceptron architecture with binary an continuous weights, in the classification setting. I will review the Gardner's approach based on replica method and the derivation of the SAT/UNSAT transition in the storage setting. Then, I discuss some recent works that unveil… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 22 pages, 9 figures, based on a set of lectures done at the "School of the Italian Society of Statistical Physics", IMT, Lucca

  6. arXiv:2305.10623  [pdf, other

    cond-mat.dis-nn cs.LG math.PR math.ST

    The star-shaped space of solutions of the spherical negative perceptron

    Authors: Brandon Livio Annesi, Clarissa Lauditi, Carlo Lucibello, Enrico M. Malatesta, Gabriele Perugini, Fabrizio Pittorino, Luca Saglietti

    Abstract: Empirical studies on the landscape of neural networks have shown that low-energy configurations are often found in complex connected structures, where zero-energy paths between pairs of distant solutions can be constructed. Here we consider the spherical negative perceptron, a prototypical non-convex neural network model framed as a continuous constraint satisfaction problem. We introduce a genera… ▽ More

    Submitted 5 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 27 pages, 16 figures, comments are welcome

  7. arXiv:2304.13871  [pdf, other

    cond-mat.dis-nn cs.LG math.PR math.ST

    Typical and atypical solutions in non-convex neural networks with discrete and continuous weights

    Authors: Carlo Baldassi, Enrico M. Malatesta, Gabriele Perugini, Riccardo Zecchina

    Abstract: We study the binary and continuous negative-margin perceptrons as simple non-convex neural network models learning random rules and associations. We analyze the geometry of the landscape of solutions in both models and find important similarities and differences. Both models exhibit subdominant minimizers which are extremely flat and wide. These minimizers coexist with a background of dominant sol… ▽ More

    Submitted 24 July, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 34 pages, 13 figures

  8. arXiv:2111.12765  [pdf, other

    hep-th cond-mat.dis-nn cond-mat.stat-mech

    Correlation Functions of the Anharmonic Oscillator: Numerical Verification of Two-Loop Corrections to the Large-Order Behavior

    Authors: Ludovico T. Giorgini, Ulrich D. Jentschura, Enrico M. Malatesta, Giorgio Parisi, Tommaso Rizzo, Jean Zinn-Justin

    Abstract: Recently, the large-order behavior of correlation functions of the $O(N)$-anharmonic oscillator has been analyzed by us in [L. T. Giorgini et el., Phys. Rev. D 101, 125001 (2020)]. Two-loop corrections about the instanton configurations were obtained for the partition function, and the two-point and four-point functions, and the derivative of the two-point function at zero momentum transfer. Here,… ▽ More

    Submitted 1 December, 2021; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: 18 pages, 4 figures

  9. arXiv:2110.00683  [pdf, other

    cs.LG cond-mat.dis-nn math.PR stat.ML

    Learning through atypical "phase transitions" in overparameterized neural networks

    Authors: Carlo Baldassi, Clarissa Lauditi, Enrico M. Malatesta, Rosalba Pacelli, Gabriele Perugini, Riccardo Zecchina

    Abstract: Current deep neural networks are highly overparameterized (up to billions of connection weights) and nonlinear. Yet they can fit data almost perfectly through variants of gradient descent algorithms and achieve unexpected levels of prediction accuracy without overfitting. These are formidable results that defy predictions of statistical learning and pose conceptual challenges for non-convex optimi… ▽ More

    Submitted 11 June, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: 28 pages, 14 figures

  10. arXiv:2107.01163  [pdf, other

    cond-mat.dis-nn cs.LG math-ph math.PR

    Unveiling the structure of wide flat minima in neural networks

    Authors: Carlo Baldassi, Clarissa Lauditi, Enrico M. Malatesta, Gabriele Perugini, Riccardo Zecchina

    Abstract: The success of deep learning has revealed the application potential of neural networks across the sciences and opened up fundamental theoretical problems. In particular, the fact that learning algorithms based on simple variants of gradient methods are able to find near-optimal minima of highly nonconvex loss functions is an unexpected feature of neural networks. Moreover, such algorithms are able… ▽ More

    Submitted 14 February, 2022; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: 15 pages, 8 figures

  11. arXiv:2010.14761  [pdf, other

    cs.LG cond-mat.dis-nn math.ST

    Wide flat minima and optimal generalization in classifying high-dimensional Gaussian mixtures

    Authors: Carlo Baldassi, Enrico M. Malatesta, Matteo Negri, Riccardo Zecchina

    Abstract: We analyze the connection between minimizers with good generalizing properties and high local entropy regions of a threshold-linear classifier in Gaussian mixtures with the mean squared error loss function. We show that there exist configurations that achieve the Bayes-optimal generalization error, even in the case of unbalanced clusters. We explore analytically the error-counting loss landscape i… ▽ More

    Submitted 17 November, 2020; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: 19 pages, 4 figures. arXiv admin note: text overlap with arXiv:2006.07897

  12. Two-Loop Corrections to the Large-Order Behavior of Correlation Functions in the One-Dimensional N-Vector Model

    Authors: L. T. Giorgini, U. D. Jentschura, E. M. Malatesta, G. Parisi, T. Rizzo, J. Zinn-Justin

    Abstract: For a long time, the predictive limits of perturbative quantum field theory have been limited by our inability to carry out loop calculations to arbitrarily high order, which become increasingly complex as the order of perturbation theory is increased. This problem is exacerbated by the fact that perturbation series derived from loop diagram (Feynman diagram) calculations represent asymptotic (div… ▽ More

    Submitted 2 June, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: 27 pages; RevTeX

    Journal ref: Phys. Rev. D 101, 125001 (2020)

  13. arXiv:1907.07578  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    Properties of the geometry of solutions and capacity of multi-layer neural networks with Rectified Linear Units activations

    Authors: Carlo Baldassi, Enrico M. Malatesta, Riccardo Zecchina

    Abstract: Rectified Linear Units (ReLU) have become the main model for the neural units in current deep learning systems. This choice has been originally suggested as a way to compensate for the so called vanishing gradient problem which can undercut stochastic gradient descent (SGD) learning in networks composed of multiple layers. Here we provide analytical results on the effects of ReLUs on the capacity… ▽ More

    Submitted 3 May, 2024; v1 submitted 17 July, 2019; originally announced July 2019.

    Comments: 11 pages, 3 figures

    Journal ref: Phys. Rev. Lett. 123, 170602 (2019)

  14. arXiv:1905.08529  [pdf, other

    cond-mat.dis-nn math-ph

    Fluctuations in the random-link matching problem

    Authors: Enrico M. Malatesta, Giorgio Parisi, Gabriele Sicuro

    Abstract: Using the replica approach and the cavity method, we study the fluctuations of the optimal cost in the random-link matching problem. By means of replica arguments, we derive the exact expression of its variance. Moreover, we study the large deviation function, deriving its expression in two different ways, namely using both the replica method and the cavity method.

    Submitted 30 August, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 9 pages, 3 figures

    Journal ref: Phys. Rev. E 100, 032102 (2019)

  15. arXiv:1902.00455  [pdf

    cond-mat.dis-nn

    Random Combinatorial Optimization Problems: Mean Field and Finite-Dimensional Results

    Authors: Enrico M. Malatesta

    Abstract: This PhD thesis is organized as follows. In the first two chapters I will review some basic notions of statistical physics of disordered systems, such as random graph theory, the mean-field approximation, spin glasses and combinatorial optimization. The replica method will also be introduced and applied to the Sherrington-Kirkpatrick model, one of the simplest mean-field models of spin-glasses. Th… ▽ More

    Submitted 12 October, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: 211 pages

  16. Average optimal cost for the Euclidean TSP in one dimension

    Authors: Sergio Caracciolo, Andrea Di Gioacchino, Enrico M. Malatesta, Carlo Vanoni

    Abstract: The traveling-salesman problem is one of the most studied combinatorial optimization problems, because of the simplicity in its statement and the difficulty in its solution. We study the traveling salesman problem when the positions of the cities are chosen at random in the unit interval and the cost associated to the travel between two cities is their distance elevated to an arbitrary power… ▽ More

    Submitted 26 February, 2019; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: 14 pages, 8 figures

    Journal ref: J. Phys. A: Math. Theor. 52 (2019) 264003

  17. Selberg integrals in 1D random Euclidean optimization problems

    Authors: Sergio Caracciolo, Andrea Di Gioacchino, Enrico M. Malatesta, Luca G. Molinari

    Abstract: We consider a set of Euclidean optimization problems in one dimension, where the cost function associated to the couple of points $x$ and $y$ is the Euclidean distance between them to an arbitrary power $p\ge1$, and the points are chosen at random with flat measure. We derive the exact average cost for the random assignment problem, for any number of points, by using Selberg's integrals. Some vari… ▽ More

    Submitted 6 May, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: 9 pages, 2 figures

    Journal ref: J. Stat. Mech. (2019) 063401

  18. Exact value for the average optimal cost of bipartite traveling-salesman and 2-factor problems in two dimensions

    Authors: Riccardo Capelli, Sergio Caracciolo, Andrea Di Gioacchino, Enrico M. Malatesta

    Abstract: We show that the average cost for the traveling-salesman problem in two dimensions, which is the archetypal problem in combinatorial optimization, in the bipartite case, is simply related to the average cost of the assignment problem with the same Euclidean, increasing, convex weights. In this way we extend a result already known in one dimension where exact solutions are avalaible. The recently d… ▽ More

    Submitted 27 September, 2018; v1 submitted 10 July, 2018; originally announced July 2018.

    Comments: 5 pages, 3 figures

    Journal ref: Phys. Rev. E 98, 030101 (2018)

  19. Plastic number and possible optimal solutions for an Euclidean 2-matching in one dimension

    Authors: Sergio Caracciolo, Andrea Di Gioacchino, Enrico M. Malatesta

    Abstract: In this work we consider the problem of finding the minimum-weight loop cover of an undirected graph. This combinatorial optimization problem is called 2-matching and can be seen as a relaxation of the traveling salesman problem since one does not have the unique loop condition. We consider this problem both on the complete bipartite and complete graph embedded in a one dimensional interval, the w… ▽ More

    Submitted 25 August, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: 19 pages, 5 figures

    Journal ref: J. Stat. Mech. (2018) 083402

  20. The Random Fractional Matching Problem

    Authors: Carlo Lucibello, Enrico M. Malatesta, Giorgio Parisi, Gabriele Sicuro

    Abstract: We consider two formulations of the random-link fractional matching problem, a relaxed version of the more standard random-link (integer) matching problem. In one formulation, we allow each node to be linked to itself in the optimal matching configuration. In the other one, on the contrary, such a link is forbidden. Both problems have the same asymptotic average optimal cost of the random-link mat… ▽ More

    Submitted 4 May, 2018; v1 submitted 8 February, 2018; originally announced February 2018.

    Comments: 24 pages, 3 figures

    Journal ref: J. Stat. Mech. (2018) 053301

  21. arXiv:1802.01545  [pdf, other

    cond-mat.dis-nn math.CO

    Solution for a bipartite Euclidean traveling-salesman problem in one dimension

    Authors: Sergio Caracciolo, Andrea Di Gioacchino, Marco Gherardi, Enrico M. Malatesta

    Abstract: The traveling salesman problem is one of the most studied combinatorial optimization problems, because of the simplicity in its statement and the difficulty in its solution. We characterize the optimal cycle for every convex and increasing cost function when the points are thrown independently and with an identical probability distribution in a compact interval. We compute the average optimal cost… ▽ More

    Submitted 17 May, 2018; v1 submitted 5 February, 2018; originally announced February 2018.

    Comments: 9 pages, 5 figures

    Journal ref: Phys. Rev. E 97, 052109 (2018)

  22. Two-Loop Corrections to Large Order Behavior of $\varphi^4$ Theory

    Authors: Enrico M. Malatesta, Giorgio Parisi, Tommaso Rizzo

    Abstract: We consider the large order behavior of the perturbative expansion of the scalar $\varphi^4$ field theory in terms of a perturbative expansion around an instanton solution. We have computed the series of the free energy up to two-loop order in two and three dimension. Topologically there is only an additional Feynman diagram with respect to the previously known one dimensional case, but a careful… ▽ More

    Submitted 16 May, 2018; v1 submitted 14 April, 2017; originally announced April 2017.

    Comments: 12 pages, 2 figures

    Journal ref: Nuclear Physics B, Volume 922, 2017, Pages 293-318

  23. Finite-size corrections in the random assignment problem

    Authors: Sergio Caracciolo, Matteo P. D'Achille, Enrico M. Malatesta, Gabriele Sicuro

    Abstract: We analytically derive, in the context of the replica formalism, the first finite size corrections to the average optimal cost in the random assignment problem for a quite generic distribution law for the costs. We show that, when moving from a power-law distribution to a $Γ$ distribution, the leading correction changes both in sign and in its scaling properties. We also examine the behavior of th… ▽ More

    Submitted 17 May, 2017; v1 submitted 20 February, 2017; originally announced February 2017.

    Comments: 16 pages, 4 figures

    Journal ref: Phys. Rev. E 95, 052129 (2017)