Skip to main content

Showing 1–4 of 4 results for author: Leonardi, G P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.09184  [pdf, other

    stat.ML cs.LG

    A Two-Scale Complexity Measure for Deep Learning Models

    Authors: Massimiliano Datres, Gian Paolo Leonardi, Alessio Figalli, David Sutter

    Abstract: We introduce a novel capacity measure 2sED for statistical models based on the effective dimension. The new quantity provably bounds the generalization error under mild assumptions on the model. Furthermore, simulations on standard data sets and popular model architectures show that 2sED correlates well with the training error. For Markovian models, we show how to efficiently approximate 2sED from… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  2. arXiv:2203.11323  [pdf, other

    cs.LG cs.CV

    Training Quantised Neural Networks with STE Variants: the Additive Noise Annealing Algorithm

    Authors: Matteo Spallanzani, Gian Paolo Leonardi, Luca Benini

    Abstract: Training quantised neural networks (QNNs) is a non-differentiable optimisation problem since weights and features are output by piecewise constant functions. The standard solution is to apply the straight-through estimator (STE), using different functions during the inference and gradient computation steps. Several STE variants have been proposed in the literature aiming to maximise the task accur… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  3. arXiv:2011.01858  [pdf, ps, other

    cs.LG math.OC

    Analytical aspects of non-differentiable neural networks

    Authors: Gian Paolo Leonardi, Matteo Spallanzani

    Abstract: Research in computational deep learning has directed considerable efforts towards hardware-oriented optimisations for deep neural networks, via the simplification of the activation functions, or the quantization of both activations and weights. The resulting non-differentiability (or even discontinuity) of the networks poses some challenging problems, especially in connection with the learning pro… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    MSC Class: 68T07; 41A25 (Primary) 65K10 (Secondary)

  4. arXiv:1905.10452  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Additive Noise Annealing and Approximation Properties of Quantized Neural Networks

    Authors: Matteo Spallanzani, Lukas Cavigelli, Gian Paolo Leonardi, Marko Bertogna, Luca Benini

    Abstract: We present a theoretical and experimental investigation of the quantization problem for artificial neural networks. We provide a mathematical definition of quantized neural networks and analyze their approximation capabilities, showing in particular that any Lipschitz-continuous map defined on a hypercube can be uniformly approximated by a quantized neural network. We then focus on the regularizat… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.