Skip to main content

Showing 1–12 of 12 results for author: Janz, D

.
  1. arXiv:2311.08376  [pdf, ps, other

    stat.ML cs.LG

    Ensemble sampling for linear bandits: small ensembles suffice

    Authors: David Janz, Alexander E. Litvak, Csaba Szepesvári

    Abstract: We provide the first useful and rigorous analysis of ensemble sampling for the stochastic linear bandit setting. In particular, we show that, under standard assumptions, for a $d$-dimensional stochastic linear bandit with an interaction horizon $T$, ensemble sampling with an ensemble of size of order $\smash{d \log T}$ incurs regret at most of the order $\smash{(d \log T)^{5/2} \sqrt{T}}$. Ours is… ▽ More

    Submitted 6 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  2. arXiv:2311.07565  [pdf, other

    cs.LG stat.ML

    Exploration via linearly perturbed loss minimisation

    Authors: David Janz, Shuai Liu, Alex Ayoub, Csaba Szepesvári

    Abstract: We introduce exploration via linear loss perturbations (EVILL), a randomised exploration method for structured stochastic bandit problems that works by solving for the minimiser of a linearly perturbed regularised negative log-likelihood function. We show that, for the case of generalised linear bandits, EVILL reduces to perturbed history exploration (PHE), a method where exploration is done by tr… ▽ More

    Submitted 6 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  3. arXiv:2310.20581  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient Descent for Gaussian Processes Done Right

    Authors: Jihao Andreas Lin, Shreyas Padhy, Javier Antorán, Austin Tripp, Alexander Terenin, Csaba Szepesvári, José Miguel Hernández-Lobato, David Janz

    Abstract: As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gra… ▽ More

    Submitted 28 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  4. arXiv:2306.11589  [pdf, other

    cs.LG stat.ML

    Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent

    Authors: Jihao Andreas Lin, Javier Antorán, Shreyas Padhy, David Janz, José Miguel Hernández-Lobato, Alexander Terenin

    Abstract: Gaussian processes are a powerful framework for quantifying uncertainty and for sequential decision-making but are limited by the requirement of solving linear systems. In general, this has a cubic cost in dataset size and is sensitive to conditioning. We explore stochastic gradient algorithms as a computationally efficient method of approximately solving these linear systems: we develop low-varia… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  5. arXiv:2210.04994  [pdf, other

    stat.ML cs.AI cs.LG

    Sampling-based inference for large linear models, with application to linearised Laplace

    Authors: Javier Antorán, Shreyas Padhy, Riccardo Barbano, Eric Nalisnick, David Janz, José Miguel Hernández-Lobato

    Abstract: Large-scale linear models are ubiquitous throughout machine learning, with contemporary application as surrogate models for neural network uncertainty quantification; that is, the linearised Laplace method. Alas, the computational cost associated with Bayesian linear models constrains this method's application to small networks, small output spaces and small datasets. We address this limitation by… ▽ More

    Submitted 16 March, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Published at ICLR 2023. This latest Arxiv version is extended with a demonstration of the proposed methods on the Imagenet dataset

  6. arXiv:2206.08900  [pdf, other

    stat.ML cs.AI cs.LG

    Adapting the Linearised Laplace Model Evidence for Modern Deep Learning

    Authors: Javier Antorán, David Janz, James Urquhart Allingham, Erik Daxberger, Riccardo Barbano, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: The linearised Laplace method for estimating model uncertainty has received renewed attention in the Bayesian deep learning community. The method provides reliable error bars and admits a closed-form expression for the model evidence, allowing for scalable selection of model hyperparameters. In this work, we examine the assumptions behind this method, particularly in conjunction with model selecti… ▽ More

    Submitted 8 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Paper appearing at ICML 2022

  7. arXiv:2001.10396  [pdf, other

    cs.LG stat.ML

    Bandit optimisation of functions in the Matérn kernel RKHS

    Authors: David Janz, David R. Burt, Javier González

    Abstract: We consider the problem of optimising functions in the reproducing kernel Hilbert space (RKHS) of a Matérn kernel with smoothness parameter $ν$ over the domain $[0,1]^d$ under noisy bandit feedback. Our contribution, the $π$-GP-UCB algorithm, is the first practical approach with guaranteed sublinear regret for all $ν>1$ and $d \geq 1$. Empirical validation suggests better performance and drastical… ▽ More

    Submitted 26 February, 2023; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: Included an errata highlighting an omission in the proof of lemma 1 and pointing to a fix in the author's thesis; the omission does not affect the main result

  8. arXiv:1810.06530  [pdf, other

    cs.LG stat.ML

    Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning

    Authors: David Janz, Jiri Hron, Przemysław Mazur, Katja Hofmann, José Miguel Hernández-Lobato, Sebastian Tschiatschek

    Abstract: Posterior sampling for reinforcement learning (PSRL) is an effective method for balancing exploration and exploitation in reinforcement learning. Randomised value functions (RVF) can be viewed as a promising approach to scaling PSRL. However, we show that most contemporary algorithms combining RVF with neural network function approximation do not possess the properties which make PSRL effective, a… ▽ More

    Submitted 3 December, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Camera ready version, NeurIPS 2019

  9. arXiv:1807.00412  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Learning to Drive in a Day

    Authors: Alex Kendall, Jeffrey Hawke, David Janz, Przemyslaw Mazur, Daniele Reda, John-Mark Allen, Vinh-Dieu Lam, Alex Bewley, Amar Shah

    Abstract: We demonstrate the first application of deep reinforcement learning to autonomous driving. From randomly initialised parameters, our model is able to learn a policy for lane following in a handful of training episodes using a single monocular image as input. We provide a general and easy to obtain reward: the distance travelled by the vehicle without the safety driver taking control. We use a cont… ▽ More

    Submitted 11 September, 2018; v1 submitted 1 July, 2018; originally announced July 2018.

    Comments: Further results and demo videos can be viewed at: https://wayve.ai/blog/l2diad

  10. arXiv:1712.01664  [pdf, other

    stat.ML cs.LG

    Learning a Generative Model for Validity in Complex Discrete Structures

    Authors: David Janz, Jos van der Westhuizen, Brooks Paige, Matt J. Kusner, José Miguel Hernández-Lobato

    Abstract: Deep generative models have been successfully used to learn representations for high-dimensional discrete spaces by representing discrete objects as sequences and employing powerful sequence-based deep models. Unfortunately, these sequence-based models often produce invalid sequences: sequences which do not represent any underlying discrete structure; invalid sequences hinder the utility of such m… ▽ More

    Submitted 1 November, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: Conference paper at ICLR 2018. Code available online

  11. arXiv:1708.04465  [pdf, ps, other

    stat.ML cs.LG

    Actively Learning what makes a Discrete Sequence Valid

    Authors: David Janz, Jos van der Westhuizen, José Miguel Hernández-Lobato

    Abstract: Deep learning techniques have been hugely successful for traditional supervised and unsupervised machine learning problems. In large part, these techniques solve continuous optimization problems. Recently however, discrete generative deep learning models have been successfully used to efficiently search high-dimensional discrete spaces. These methods work by representing discrete objects as sequen… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: 6 pages, 2 figures

  12. arXiv:1611.06863  [pdf, other

    stat.ML cs.LG

    Probabilistic structure discovery in time series data

    Authors: David Janz, Brooks Paige, Tom Rainforth, Jan-Willem van de Meent, Frank Wood

    Abstract: Existing methods for structure discovery in time series data construct interpretable, compositional kernels for Gaussian process regression models. While the learned Gaussian process model provides posterior mean and variance estimates, typically the structure is learned via a greedy optimization procedure. This restricts the space of possible solutions and leads to over-confident uncertainty esti… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.