Skip to main content

Showing 1–10 of 10 results for author: Sclocchi, A

.
  1. arXiv:2402.16991  [pdf, other

    stat.ML cond-mat.dis-nn cs.CV cs.LG

    A Phase Transition in Diffusion Models Reveals the Hierarchical Nature of Data

    Authors: Antonio Sclocchi, Alessandro Favero, Matthieu Wyart

    Abstract: Understanding the structure of real data is paramount in advancing modern deep-learning methodologies. Natural data such as images are believed to be composed of features organised in a hierarchical and combinatorial manner, which neural networks capture during learning. Recent advancements show that diffusion models can generate high-quality images, hinting at their ability to capture this underl… ▽ More

    Submitted 4 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 21 pages, 16 figures

  2. arXiv:2309.10688  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    On the different regimes of Stochastic Gradient Descent

    Authors: Antonio Sclocchi, Matthieu Wyart

    Abstract: Modern deep networks are trained with stochastic gradient descent (SGD) whose key hyperparameters are the number of data considered at each step or batch size $B$, and the step size or learning rate $η$. For small $B$ and large $η$, SGD corresponds to a stochastic evolution of the parameters, whose noise amplitude is governed by the ''temperature'' $T\equiv η/B$. Yet this description is observed t… ▽ More

    Submitted 27 February, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Main: 8 pages, 4 figures; Appendix: 15 pages, 11 figures

    Journal ref: Proceedings of the National Academy of Sciences 121.9 (2024): e2316301121

  3. arXiv:2301.13703  [pdf, other

    cs.LG cond-mat.dis-nn

    Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning

    Authors: Antonio Sclocchi, Mario Geiger, Matthieu Wyart

    Abstract: Understanding when the noise in stochastic gradient descent (SGD) affects generalization of deep neural networks remains a challenge, complicated by the fact that networks can operate in distinct training regimes. Here we study how the magnitude of this noise $T$ affects performance as the size of the training set $P$ and the scale of initialization $α$ are varied. For gradient descent, $α$ is a k… ▽ More

    Submitted 30 May, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: 25 pages, 21 figures, added analysis in feature-learning

  4. arXiv:2202.03348  [pdf, other

    cs.LG cond-mat.stat-mech

    Failure and success of the spectral bias prediction for Kernel Ridge Regression: the case of low-dimensional data

    Authors: Umberto M. Tomasini, Antonio Sclocchi, Matthieu Wyart

    Abstract: Recently, several theories including the replica method made predictions for the generalization error of Kernel Ridge Regression. In some regimes, they predict that the method has a `spectral bias': decomposing the true function $f^*$ on the eigenbasis of the kernel, it fits well the coefficients associated with the O(P) largest eigenvalues, where $P$ is the size of the training set. This predicti… ▽ More

    Submitted 16 February, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 34 pages, 11 figures

  5. arXiv:2106.08581  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech

    High dimensional optimization under non-convex excluded volume constraints

    Authors: Antonio Sclocchi, Pierfrancesco Urbani

    Abstract: We consider high dimensional random optimization problems where the dynamical variables are subjected to non-convex excluded volume constraints. We focus on the case in which the cost function is a simple quadratic cost and the excluded volume constraints are modeled by a perceptron constraint satisfaction problem. We show that depending on the density of constraints, one can have different situat… ▽ More

    Submitted 22 December, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: 7 pages, 3 figures

  6. arXiv:2010.10253  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech

    Proliferation of non-linear excitations in the piecewise-linear perceptron

    Authors: Antonio Sclocchi, Pierfrancesco Urbani

    Abstract: We investigate the properties of local minima of the energy landscape of a continuous non-convex optimization problem, the spherical perceptron with piecewise linear cost function and show that they are critical, marginally stable and displaying a set of pseudogaps, singularities and non-linear excitations whose properties appear to be in the same universality class of jammed packings of hard sphe… ▽ More

    Submitted 14 December, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 14 pages, 7 figures

    Journal ref: SciPost Phys. 10, 013 (2021)

  7. arXiv:2010.02158  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech

    Surfing on minima of isostatic landscapes: avalanches and unjamming transition

    Authors: Silvio Franz, Antonio Sclocchi, Pierfrancesco Urbani

    Abstract: Recently, we showed that optimization problems, both in infinite as well as in finite dimensions, for continuous variables and soft excluded volume constraints, can display entire isostatic phases where local minima of the cost function are marginally stable configurations endowed with non-linear excitations [1,2]. In this work we describe an athermal adiabatic algorithm to explore with continuity… ▽ More

    Submitted 10 December, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: 22 pages, 13 figures

    Journal ref: J. Stat. Mech. (2021) 023208

  8. Critical energy landscape of linear soft spheres

    Authors: Silvio Franz, Antonio Sclocchi, Pierfrancesco Urbani

    Abstract: We show that soft spheres interacting with a linear ramp potential when overcompressed beyond the jamming point fall in an amorphous solid phase which is critical, mechanically marginally stable and share many features with the jamming point itself. In the whole phase, the relevant local minima of the potential energy landscape display an isostatic contact network of perfectly touching spheres who… ▽ More

    Submitted 12 June, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 12 pages, 10 figures, Submitted to SciPost Physics

    Journal ref: SciPost Phys. 9, 012 (2020)

  9. Critical jammed phase of the linear perceptron

    Authors: Silvio Franz, Antonio Sclocchi, Pierfrancesco Urbani

    Abstract: Criticality in statistical physics naturally emerges at isolated points in the phase diagram. Jamming of spheres is not an exception: varying density, it is the critical point that separates the unjammed phase where spheres do not overlap and the jammed phase where they cannot be arranged without overlaps. The same remains true in more general constraint satisfaction problems with continuous varia… ▽ More

    Submitted 5 August, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: 10 pages, 4 figures

    Journal ref: Phys. Rev. Lett. 123, 115702 (2019)

  10. arXiv:1611.05085  [pdf, other

    cond-mat.mes-hall quant-ph

    Topology of a dissipative spin: dynamical Chern number, bath induced non-adiabaticity and a quantum dynamo effect

    Authors: Loic Henriet, Antonio Sclocchi, Peter P. Orth, Karyn Le Hur

    Abstract: We analyze the topological deformations of a spin-1/2 in an effective magnetic field induced by an ohmic quantum dissipative environment at zero temperature. From Bethe Ansatz results and a variational approach, we confirm that the Chern number is preserved in the delocalized phase for $α<1$. We report a divergence of the Berry curvature at the equator when $α_c=1$ that appears at the localization… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

    Journal ref: Phys. Rev. B 95, 054307 (2017)