Skip to main content

Showing 1–3 of 3 results for author: Frye, C G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2003.10397  [pdf, other

    cs.LG cs.NE stat.ML

    Critical Point-Finding Methods Reveal Gradient-Flat Regions of Deep Network Losses

    Authors: Charles G. Frye, James Simon, Neha S. Wadia, Andrew Ligeralde, Michael R. DeWeese, Kristofer E. Bouchard

    Abstract: Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-based optimization algorithms converge to approximately the same performance from many random initial points. One thread of work has focused on explaining this phenomenon by characterizing the local curvature near critical points of the loss function, where the gradients are near zero, and demonstratin… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: 18 pages, 5 figures

  2. arXiv:1906.05273  [pdf, ps, other

    math.OC cs.LG

    Critical Point Finding with Newton-MR by Analogy to Computing Square Roots

    Authors: Charles G Frye

    Abstract: Understanding of the behavior of algorithms for resolving the optimization problem (hereafter shortened to OP) of optimizing a differentiable loss function (OP1), is enhanced by knowledge of the critical points of that loss function, i.e. the points where the gradient is 0. Here, we describe a solution to the problem of finding critical points by proposing and solving three optimization problems:… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: 8 pages, 0 figures

  3. arXiv:1901.10603  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Numerically Recovering the Critical Points of a Deep Linear Autoencoder

    Authors: Charles G. Frye, Neha S. Wadia, Michael R. DeWeese, Kristofer E. Bouchard

    Abstract: Numerically locating the critical points of non-convex surfaces is a long-standing problem central to many fields. Recently, the loss surfaces of deep neural networks have been explored to gain insight into outstanding questions in optimization, generalization, and network architecture design. However, the degree to which recently-proposed methods for numerically recovering critical points actuall… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.