Skip to main content

Showing 1–6 of 6 results for author: Petzka, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.02337  [pdf, other

    cs.LG

    FAM: Relative Flatness Aware Minimization

    Authors: Linara Adilova, Amr Abourayya, Jianning Li, Amin Dada, Henning Petzka, Jan Egger, Jens Kleesiek, Michael Kamp

    Abstract: Flatness of the loss curve around a model at hand has been shown to empirically correlate with its generalization ability. Optimizing for flatness has been proposed as early as 1994 by Hochreiter and Schmidthuber, and was followed by more recent successful sharpness-aware optimization techniques. Their widespread adoption in practice, though, is dubious because of the lack of theoretically grounde… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Proceedings of the 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML) at the 40 th International Conference on Machine Learning, Honolulu, Hawaii, USA. 2023

  2. arXiv:2203.01035  [pdf, other

    cs.LG

    Discriminating Against Unrealistic Interpolations in Generative Adversarial Networks

    Authors: Henning Petzka, Ted Kronvall, Cristian Sminchisescu

    Abstract: Interpolations in the latent space of deep generative models is one of the standard tools to synthesize semantically meaningful mixtures of generated samples. As the generator function is non-linear, commonly used linear interpolations in the latent space do not yield the shortest paths in the sample space, resulting in non-smooth interpolations. Recent work has therefore equipped the latent space… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: The first two authors made equal contribution

  3. arXiv:2001.00939  [pdf, other

    cs.LG stat.ML

    Relative Flatness and Generalization

    Authors: Henning Petzka, Michael Kamp, Linara Adilova, Cristian Sminchisescu, Mario Boley

    Abstract: Flatness of the loss curve is conjectured to be connected to the generalization ability of machine learning models, in particular neural networks. While it has been empirically observed that flatness measures consistently correlate strongly with generalization, it is still an open theoretical problem why and under which circumstances flatness is connected to generalization, in particular in light… ▽ More

    Submitted 4 November, 2021; v1 submitted 3 January, 2020; originally announced January 2020.

    Comments: The first two authors made equal contribution; Accepted for publication at NeurIPS 2021; arXiv admin note: substantial text overlap with arXiv:1912.00058

  4. arXiv:1912.00058  [pdf, other

    cs.LG stat.ML

    A Reparameterization-Invariant Flatness Measure for Deep Neural Networks

    Authors: Henning Petzka, Linara Adilova, Michael Kamp, Cristian Sminchisescu

    Abstract: The performance of deep neural networks is often attributed to their automated, task-related feature construction. It remains an open question, though, why this leads to solutions with good generalization, even in cases where the number of parameters is larger than the number of samples. Back in the 90s, Hochreiter and Schmidhuber observed that flatness of the loss surface around a local minimum c… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: 14 pages; accepted at Workshop "Science meets Engineering of Deep Learning", 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  5. arXiv:1812.06486  [pdf, other

    cs.LG stat.ML

    Non-attracting Regions of Local Minima in Deep and Wide Neural Networks

    Authors: Henning Petzka, Cristian Sminchisescu

    Abstract: Understanding the loss surface of neural networks is essential for the design of models with predictable performance and their success in applications. Experimental results suggest that sufficiently deep and wide neural networks are not negatively impacted by suboptimal local minima. Despite recent progress, the reason for this outcome is not fully understood. Could deep networks have very few, if… ▽ More

    Submitted 31 August, 2020; v1 submitted 16 December, 2018; originally announced December 2018.

  6. arXiv:1709.08894  [pdf, other

    stat.ML cs.LG

    On the regularization of Wasserstein GANs

    Authors: Henning Petzka, Asja Fischer, Denis Lukovnicov

    Abstract: Since their invention, generative adversarial networks (GANs) have become a popular approach for learning to model a distribution of real (unlabeled) data. Convergence problems during training are overcome by Wasserstein GANs which minimize the distance between the model and the empirical distribution in terms of a different metric, but thereby introduce a Lipschitz constraint into the optimizatio… ▽ More

    Submitted 5 March, 2018; v1 submitted 26 September, 2017; originally announced September 2017.

    Comments: Published as a conference paper at ICLR 2018. * Henning Petzka and Asja Fischer contributed equally to this work (11 pages +13 pages appendix)