Skip to main content

Showing 1–8 of 8 results for author: Mlodozeniec, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18457  [pdf, other

    cs.LG stat.ML

    Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across so… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Preprint. arXiv admin note: text overlap with arXiv:2405.18328

  2. arXiv:2405.18328  [pdf, other

    cs.LG stat.ML

    Warm Start Marginal Likelihood Optimisation for Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, José Miguel Hernández-Lobato

    Abstract: Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between co… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Advances in Approximate Bayesian Inference 2024

  3. arXiv:2403.01946  [pdf, other

    cs.LG

    A Generative Model of Symmetry Transformations

    Authors: James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger, Richard E. Turner, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: Correctly capturing the symmetry transformations of data can lead to efficient models with strong generalization capabilities, though methods incorporating symmetries often require prior knowledge. While recent advancements have been made in learning those symmetries directly from the dataset, most of this work has focused on the discriminative setting. In this paper, we take inspiration from grou… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2402.04384  [pdf, other

    cs.LG stat.ML

    Denoising Diffusion Probabilistic Models in Six Simple Steps

    Authors: Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) are a very popular class of deep generative model that have been successfully applied to a diverse range of problems including image and video generation, protein and material synthesis, weather forecasting, and neural surrogates of partial differential equations. Despite their ubiquity it is hard to find an introduction to DDPMs which is simple, co… ▽ More

    Submitted 10 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  5. arXiv:2310.15047  [pdf, other

    cs.LG cs.AI

    Implicit meta-learning may lead language models to trust more reliable sources

    Authors: Dmitrii Krasheninnikov, Egor Krasheninnikov, Bruno Mlodozeniec, Tegan Maharaj, David Krueger

    Abstract: We demonstrate that LLMs may learn indicators of document usefulness and modulate their updates accordingly. We introduce random strings ("tags") as indicators of usefulness in a synthetic fine-tuning dataset. Fine-tuning on this dataset leads to implicit meta-learning (IML): in further fine-tuning, the model updates to make more use of text that is tagged as useful. We perform a thorough empirica… ▽ More

    Submitted 15 May, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  6. arXiv:2304.14766  [pdf, other

    cs.LG stat.ML

    Hyperparameter Optimization through Neural Network Partitioning

    Authors: Bruno Mlodozeniec, Matthias Reisser, Christos Louizos

    Abstract: Well-tuned hyperparameters are crucial for obtaining good generalization behavior in neural networks. They can enforce appropriate inductive biases, regularize the model and improve performance -- especially in the presence of limited data. In this work, we propose a simple and efficient way for optimizing hyperparameters inspired by the marginal likelihood, an optimization objective that requires… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Published as a conference paper at ICLR 2023

  7. arXiv:2302.01170  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG physics.chem-ph

    Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened Dynamics

    Authors: Leon Klein, Andrew Y. K. Foong, Tor Erlend Fjelde, Bruno Mlodozeniec, Marc Brockschmidt, Sebastian Nowozin, Frank Noé, Ryota Tomioka

    Abstract: Molecular dynamics (MD) simulation is a widely used technique to simulate molecular systems, most commonly at the all-atom resolution where equations of motion are integrated with timesteps on the order of femtoseconds ($1\textrm{fs}=10^{-15}\textrm{s}$). MD is often used to compute equilibrium properties, which requires sampling from an equilibrium distribution such as the Boltzmann distribution.… ▽ More

    Submitted 1 December, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  8. arXiv:1905.00076  [pdf, other

    stat.ML cs.LG

    Ensemble Distribution Distillation

    Authors: Andrey Malinin, Bruno Mlodozeniec, Mark Gales

    Abstract: Ensembles of models often yield improvements in system performance. These ensemble approaches have also been empirically shown to yield robust measures of uncertainty, and are capable of distinguishing between different \emph{forms} of uncertainty. However, ensembles come at a computational and memory cost which may be prohibitive for many applications. There has been significant work done on the… ▽ More

    Submitted 25 November, 2019; v1 submitted 30 April, 2019; originally announced May 2019.