Skip to main content

Showing 1–9 of 9 results for author: Petersen, P C

.
  1. arXiv:2404.14875  [pdf, other

    cs.LG math.OC

    Regularized Gauss-Newton for Optimizing Overparameterized Neural Networks

    Authors: Adeyemi D. Adeoye, Philipp Christian Petersen, Alberto Bemporad

    Abstract: The generalized Gauss-Newton (GGN) optimization method incorporates curvature estimates into its solution steps, and provides a good approximation to the Newton method for large-scale optimization problems. GGN has been found particularly interesting for practical training of deep neural networks, not only for its impressive convergence speed, but also for its close relation with neural tangent ke… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 27 pages, 9 figures, 2 tables

  2. arXiv:2404.04549  [pdf, other

    cs.NE cs.LG math.FA stat.ML

    Efficient Learning Using Spiking Neural Networks Equipped With Affine Encoders and Decoders

    Authors: A. Martina Neuman, Philipp Christian Petersen

    Abstract: We study the learning problem associated with spiking neural networks. Specifically, we consider hypothesis sets of spiking neural networks with affine temporal encoders and decoders and simple spiking neurons having only positive synaptic weights. We demonstrate that the positivity of the weights continues to enable a wide range of expressivity results, including rate-optimal approximation of smo… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  3. arXiv:2301.13867  [pdf, other

    cs.LG cs.AI cs.CL

    Mathematical Capabilities of ChatGPT

    Authors: Simon Frieder, Luca Pinchetti, Alexis Chevalier, Ryan-Rhys Griffiths, Tommaso Salvatori, Thomas Lukasiewicz, Philipp Christian Petersen, Julius Berner

    Abstract: We investigate the mathematical capabilities of two iterations of ChatGPT (released 9-January-2023 and 30-January-2023) and of GPT-4 by testing them on publicly available datasets, as well as hand-crafted ones, using a novel methodology. In contrast to formal mathematics, where large databases of formal proofs are available (e.g., the Lean Mathematical Library), current datasets of natural-languag… ▽ More

    Submitted 20 July, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Added further evaluations on another ChatGPT version and on GPT-4. The GHOSTS and miniGHOSTS datasets are available at https://github.com/xyfrieder/science-GHOSTS

    Journal ref: NeurIPS 2023 Datasets and Benchmarks

  4. arXiv:2212.09507  [pdf, ps, other

    cs.LG math.FA stat.ML

    VC dimensions of group convolutional neural networks

    Authors: Philipp Christian Petersen, Anna Sepliarskaia

    Abstract: We study the generalization capacity of group convolutional neural networks. We identify precise estimates for the VC dimensions of simple sets of group convolutional neural networks. In particular, we find that for infinite groups and appropriately chosen convolutional kernels, already two-parameter families of convolutional neural networks have an infinite VC dimension, despite being invariant t… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    MSC Class: 68T07; 68Q32; 68T05

  5. arXiv:2210.00805  [pdf, other

    cs.LG math.FA stat.ML

    Limitations of neural network training due to numerical instability of backpropagation

    Authors: Clemens Karner, Vladimir Kazeev, Philipp Christian Petersen

    Abstract: We study the training of deep neural networks by gradient descent where floating-point arithmetic is used to compute the gradients. In this framework and under realistic assumptions, we demonstrate that it is highly unlikely to find ReLU neural networks that maintain, in the course of training with gradient descent, superlinearly many affine pieces with respect to their number of layers. In virtua… ▽ More

    Submitted 15 November, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    MSC Class: 65G50; 68T07; 41A25; 68T09

  6. arXiv:2206.00934  [pdf, other

    math.NA math.AP stat.ML

    Deep neural networks can stably solve high-dimensional, noisy, non-linear inverse problems

    Authors: Andrés Felipe Lerma Pineda, Philipp Christian Petersen

    Abstract: We study the problem of reconstructing solutions of inverse problems when only noisy measurements are available. We assume that the problem can be modeled with an infinite-dimensional forward operator that is not continuously invertible. Then, we restrict this forward operator to finite-dimensional spaces so that the inverse is Lipschitz continuous. For the inverse operator, we demonstrate that th… ▽ More

    Submitted 20 October, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

    MSC Class: 35R30; 41A25; 68T05

  7. Exponential ReLU Neural Network Approximation Rates for Point and Edge Singularities

    Authors: Carlo Marcati, Joost A. A. Opschoor, Philipp C. Petersen, Christoph Schwab

    Abstract: We prove exponential expressivity with stable ReLU Neural Networks (ReLU NNs) in $H^1(Ω)$ for weighted analytic function classes in certain polytopal domains $Ω$, in space dimension $d=2,3$. Functions in these classes are locally analytic on open subdomains $D\subset Ω$, but may exhibit isolated point singularities in the interior of $Ω$ or corner and edge singularities at the boundary… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: Found Comput Math (2022)

    MSC Class: 35Q40; 41A25; 41A46; 65N30

    Journal ref: Found. Comput. Math.23(2023), no.3, 1043-1127

  8. arXiv:1901.05744  [pdf, ps, other

    cs.LG stat.ML

    The Oracle of DLphi

    Authors: Dominik Alfke, Weston Baines, Jan Blechschmidt, Mauricio J. del Razo Sarmina, Amnon Drory, Dennis Elbrächter, Nando Farchmin, Matteo Gambara, Silke Glas, Philipp Grohs, Peter Hinz, Danijel Kivaranovic, Christian Kümmerle, Gitta Kutyniok, Sebastian Lunz, Jan Macdonald, Ryan Malthaner, Gregory Naisat, Ariel Neufeld, Philipp Christian Petersen, Rafael Reisenhofer, Jun-Da Sheng, Laura Thesing, Philipp Trunschke, Johannes von Lindheim , et al. (2 additional authors not shown)

    Abstract: We present a novel technique based on deep learning and set theory which yields exceptional classification and prediction results. Having access to a sufficiently large amount of labelled training data, our methodology is capable of predicting the labels of the test data almost always even if the training data is entirely unrelated to the test data. In other words, we prove in a specific setting t… ▽ More

    Submitted 27 January, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

    MSC Class: 68T05; 82C32

  9. arXiv:1810.12835  [pdf, other

    math.FA math.AP

    Gamma-convergence of a shearlet-based Ginzburg--Landau energy

    Authors: Philipp Christian Petersen, Endre Süli

    Abstract: We introduce two shearlet-based Ginzburg--Landau energies, based on the continuous and the discrete shearlet transform. The energies result from replacing the elastic energy term of a classical Ginzburg--Landau energy by the weighted $L^2$-norm of a shearlet transform. The asymptotic behaviour of sequences of these energies is analysed within the framework of $Γ$-convergence and the limit energy i… ▽ More

    Submitted 27 November, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    MSC Class: 42C40; 65T60; 49M25