Skip to main content

Showing 1–3 of 3 results for author: Rauchensteiner, M

.
  1. arXiv:2211.04589  [pdf, other

    cs.LG stat.ML

    Finite Sample Identification of Wide Shallow Neural Networks with Biases

    Authors: Massimo Fornasier, Timo Klock, Marco Mondelli, Michael Rauchensteiner

    Abstract: Artificial neural networks are functions depending on a finite number of parameters typically encoded as weights and biases. The identification of the parameters of the network from finite samples of input-output pairs is often referred to as the \emph{teacher-student model}, and this model has represented a popular framework for understanding training and generalization. Even if the problem is NP… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    MSC Class: 65D15; 68T07; 90C26

  2. arXiv:2101.07150  [pdf, other

    cs.LG

    Stable Recovery of Entangled Weights: Towards Robust Identification of Deep Neural Networks from Minimal Samples

    Authors: Christian Fiedler, Massimo Fornasier, Timo Klock, Michael Rauchensteiner

    Abstract: In this paper we approach the problem of unique and stable identifiability of generic deep artificial neural networks with pyramidal shape and smooth activation functions from a finite number of input-output samples. More specifically we introduce the so-called entangled weights, which compose weights of successive layers intertwined with suitable diagonal and invertible matrices depending on the… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

    MSC Class: 65D15; 68T07; 90C26

  3. arXiv:1907.00485  [pdf, other

    cs.LG cs.IT stat.ML

    Robust and Resource Efficient Identification of Two Hidden Layer Neural Networks

    Authors: Massimo Fornasier, Timo Klock, Michael Rauchensteiner

    Abstract: We address the structure identification and the uniform approximation of two fully nonlinear layer neural networks of the type $f(x)=1^T h(B^T g(A^T x))$ on $\mathbb R^d$ from a small number of query samples. We approach the problem by sampling actively finite difference approximations to Hessians of the network. Gathering several approximate Hessians allows reliably to approximate the matrix subs… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.