Skip to main content

Showing 1–2 of 2 results for author: Pavasovic, K L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.18455  [pdf, other

    cs.LG stat.ML

    Approximate Heavy Tails in Offline (Multi-Pass) Stochastic Gradient Descent

    Authors: Krunoslav Lehman Pavasovic, Alain Durmus, Umut Simsekli

    Abstract: A recent line of empirical studies has demonstrated that SGD might exhibit a heavy-tailed behavior in practical settings, and the heaviness of the tails might correlate with the overall performance. In this paper, we investigate the emergence of such heavy tails. Previous works on this problem only considered, up to our knowledge, online (also called single-pass) SGD, in which the emergence of hea… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: In Neural Information Processing Systems (NeurIPS), Spotlight Presentation, 2023

  2. arXiv:2210.13319  [pdf, other

    cs.LG stat.ML

    MARS: Meta-Learning as Score Matching in the Function Space

    Authors: Krunoslav Lehman Pavasovic, Jonas Rothfuss, Andreas Krause

    Abstract: Meta-learning aims to extract useful inductive biases from a set of related datasets. In Bayesian meta-learning, this is typically achieved by constructing a prior distribution over neural network parameters. However, specifying families of computationally viable prior distributions over the high-dimensional neural network parameters is difficult. As a result, existing approaches resort to meta-le… ▽ More

    Submitted 10 June, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: In International Conference on Learning Representations (ICLR), 2023