Skip to main content

Showing 1–5 of 5 results for author: Nickl, P

.
  1. arXiv:2405.01165  [pdf, other

    cs.SI cs.HC

    How A/B testing changes the dynamics of information spreading on a social network

    Authors: Matteo Ottaviani, Stefan M. Herzog, Pietro Leonardo Nickl, Philipp Lorenz-Spreen

    Abstract: A/B testing methodology is generally performed by private companies to increase user engagement and satisfaction about online features. Their usage is far from being transparent and may undermine user autonomy (e.g. polarizing individual opinions, mis- and dis- information spreading). For our analysis we leverage a crucial case study dataset (i.e. Upworthy) where news headlines were allocated to u… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2402.17641  [pdf, other

    cs.LG cs.AI cs.CL math.OC stat.ML

    Variational Learning is Effective for Large Deep Networks

    Authors: Yuesong Shen, Nico Daheim, Bai Cong, Peter Nickl, Gian Maria Marconi, Clement Bazan, Rio Yokota, Iryna Gurevych, Daniel Cremers, Mohammad Emtiyaz Khan, Thomas Möllenhoff

    Abstract: We give extensive empirical evidence against the common belief that variational learning is ineffective for large neural networks. We show that an optimizer called Improved Variational Online Newton (IVON) consistently matches or outperforms Adam for training large networks such as GPT-2 and ResNets from scratch. IVON's computational costs are nearly identical to Adam but its predictive uncertaint… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Published at International Conference on Machine Learning (ICML), 2024. The first two authors contributed equally. Code is available here: https://github.com/team-approx-bayes/ivon

  3. arXiv:2310.19273  [pdf, other

    cs.LG cs.AI stat.ML

    The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

    Authors: Peter Nickl, Lu Xu, Dharmesh Tailor, Thomas Möllenhoff, Mohammad Emtiyaz Khan

    Abstract: Understanding model's sensitivity to its training data is crucial but can also be challenging and costly, especially during training. To simplify such issues, we present the Memory-Perturbation Equation (MPE) which relates model's sensitivity to perturbation in its training data. Derived using Bayesian principles, the MPE unifies existing sensitivity measures, generalizes them to a wide-variety of… ▽ More

    Submitted 16 January, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  4. arXiv:2211.01120  [pdf, other

    cs.LG cs.AI cs.RO

    Variational Hierarchical Mixtures for Probabilistic Learning of Inverse Dynamics

    Authors: Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

    Abstract: Well-calibrated probabilistic regression models are a crucial learning component in robotics applications as datasets grow rapidly and tasks become more complex. Unfortunately, classical regression models are usually either probabilistic kernel machines with a flexible structure that does not scale gracefully with data or deterministic and vastly scalable automata, albeit with a restrictive parame… ▽ More

    Submitted 10 September, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2011.05217

  5. arXiv:2011.05217  [pdf, other

    cs.LG cs.RO

    A Variational Infinite Mixture for Probabilistic Inverse Dynamics Learning

    Authors: Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

    Abstract: Probabilistic regression techniques in control and robotics applications have to fulfill different criteria of data-driven adaptability, computational efficiency, scalability to high dimensions, and the capacity to deal with different modalities in the data. Classical regressors usually fulfill only a subset of these properties. In this work, we extend seminal work on Bayesian nonparametric mixtur… ▽ More

    Submitted 30 March, 2021; v1 submitted 10 November, 2020; originally announced November 2020.