Skip to main content

Showing 1–7 of 7 results for author: Marsili, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.01656  [pdf, other

    cs.LG cond-mat.dis-nn physics.data-an stat.ML

    Statistical signatures of abstraction in deep neural networks

    Authors: Carlo Orientale Caputo, Matteo Marsili

    Abstract: We study how abstract representations emerge in a Deep Belief Network (DBN) trained on benchmark datasets. Our analysis targets the principles of learning in the early stages of information processing, starting from the "primordial soup" of the under-sampling regime. As the data is processed by deeper and deeper layers, features are detected and removed, transferring more and more "context-invaria… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 18 pages, 5 figures

  2. arXiv:2202.00339  [pdf, other

    cs.LG cond-mat.dis-nn physics.data-an stat.ML

    Quantifying Relevance in Learning and Inference

    Authors: Matteo Marsili, Yasser Roudi

    Abstract: Learning is a distinctive feature of intelligent behaviour. High-throughput experimental data and Big Data promise to open new windows on complex systems such as cells, the brain or our societies. Yet, the puzzling success of Artificial Intelligence and Machine Learning shows that we still have a poor conceptual understanding of learning. These applications push statistical inference into uncharte… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: review article, 63 pages, 14 figures

  3. arXiv:2112.09420  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    A random energy approach to deep learning

    Authors: Rongrong Xie, Matteo Marsili

    Abstract: We study a generic ensemble of deep belief networks which is parametrized by the distribution of energy levels of the hidden states of each layer. We show that, within a random energy approach, statistical dependence can propagate from the visible to deep layers only if each layer is tuned close to the critical point during learning. As a consequence, efficiently trained learning machines are char… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 16 pages, 4 figures

  4. arXiv:1903.00386  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG physics.data-an

    On the complexity of logistic regression models

    Authors: Nicola Bulso, Matteo Marsili, Yasser Roudi

    Abstract: We investigate the complexity of logistic regression models which is defined by counting the number of indistinguishable distributions that the model can represent (Balasubramanian, 1997). We find that the complexity of logistic models with binary inputs does not only depend on the number of parameters but also on the distribution of inputs in a non-trivial way which standard treatments of complex… ▽ More

    Submitted 1 March, 2019; originally announced March 2019.

    Comments: 29 pages, 6 figures, The supplementary material is an ancillary file and can be downloaded from a link on the right

  5. arXiv:1809.00652  [pdf, other

    stat.ME physics.data-an

    Minimum Description Length codes are critical

    Authors: Ryan John Cubero, Matteo Marsili, Yasser Roudi

    Abstract: In the Minimum Description Length (MDL) principle, learning from the data is equivalent to an optimal coding problem. We show that the codes that achieve optimal compression in MDL are critical in a very precise sense. First, when they are taken as generative models of samples, they generate samples with broad empirical distributions and with a high value of the relevance, defined as the entropy o… ▽ More

    Submitted 2 October, 2018; v1 submitted 3 September, 2018; originally announced September 2018.

    Comments: 23 pages, 5 figures; Corrected the author name, revised Section 2.2 (Large Deviations of the Universal Codes Exhibit Phase Transitions), corrected Eq. (89)

    Journal ref: Entropy 2018, 20(10)

  6. arXiv:1702.07549  [pdf, other

    cond-mat.dis-nn stat.ML

    The Stochastic complexity of spin models: Are pairwise models really simple?

    Authors: Alberto Beretta, Claudia Battistin, Clélia de Mulatier, Iacopo Mastromatteo, Matteo Marsili

    Abstract: Models can be simple for different reasons: because they yield a simple and computationally efficient interpretation of a generic dataset (e.g. in terms of pairwise dependences) - as in statistical learning - or because they capture the essential ingredients of a specific phenomenon - as e.g. in physics - leading to non-trivial falsifiable predictions. In information theory and Bayesian inference,… ▽ More

    Submitted 11 April, 2018; v1 submitted 24 February, 2017; originally announced February 2017.

  7. arXiv:1603.00952  [pdf, other

    stat.ML cond-mat.dis-nn

    Sparse model selection in the highly under-sampled regime

    Authors: Nicola Bulso, Matteo Marsili, Yasser Roudi

    Abstract: We propose a method for recovering the structure of a sparse undirected graphical model when very few samples are available. The method decides about the presence or absence of bonds between pairs of variable by considering one pair at a time and using a closed form formula, analytically derived by calculating the posterior probability for every possible model explaining a two body system using Je… ▽ More

    Submitted 2 January, 2017; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: 54 pages, 26 figures

    Journal ref: J. Stat. Mech. (2016) 093404