Skip to main content

Showing 1–3 of 3 results for author: Koch, E d M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2012.03531  [pdf, other

    cs.LG cs.AI stat.ML

    Why Unsupervised Deep Networks Generalize

    Authors: Anita de Mello Koch, Ellen de Mello Koch, Robert de Mello Koch

    Abstract: Promising resolutions of the generalization puzzle observe that the actual number of parameters in a deep network is much smaller than naive estimates suggest. The renormalization group is a compelling example of a problem which has very few parameters, despite the fact that naive estimates suggest otherwise. Our central hypothesis is that the mechanisms behind the renormalization group are also a… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  2. arXiv:2002.02664  [pdf, other

    cs.LG cond-mat.stat-mech physics.comp-ph stat.ML

    Short sighted deep learning

    Authors: Ellen de Melllo Koch, Anita de Mello Koch, Nicholas Kastanos, Ling Cheng

    Abstract: A theory explaining how deep learning works is yet to be developed. Previous work suggests that deep learning performs a coarse graining, similar in spirit to the renormalization group (RG). This idea has been explored in the setting of a local (nearest neighbor interactions) Ising spin lattice. We extend the discussion to the setting of a long range spin lattice. Markov Chain Monte Carlo (MCMC) s… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Journal ref: Phys. Rev. E 102, 013307 (2020)

  3. arXiv:1906.05212  [pdf, other

    cs.LG cond-mat.stat-mech physics.comp-ph stat.ML

    Is Deep Learning a Renormalization Group Flow?

    Authors: Ellen de Mello Koch, Robert de Mello Koch, Ling Cheng

    Abstract: Although there has been a rapid development of practical applications, theoretical explanations of deep learning are in their infancy. Deep learning performs a sophisticated coarse graining. Since coarse graining is a key ingredient of the renormalization group (RG), RG may provide a useful theoretical framework directly relevant to deep learning. In this study we pursue this possibility. A statis… ▽ More

    Submitted 10 June, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/ACCESS.2020.3000901, IEEE Access