Skip to main content

Showing 1–10 of 10 results for author: van Amersfoort, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2207.07411  [pdf, other

    cs.LG stat.ML

    Plex: Towards Reliability using Pretrained Large Model Extensions

    Authors: Dustin Tran, Jeremiah Liu, Michael W. Dusenberry, Du Phan, Mark Collier, Jie Ren, Kehang Han, Zi Wang, Zelda Mariet, Huiyi Hu, Neil Band, Tim G. J. Rudner, Karan Singhal, Zachary Nado, Joost van Amersfoort, Andreas Kirsch, Rodolphe Jenatton, Nithum Thain, Honglin Yuan, Kelly Buchanan, Kevin Murphy, D. Sculley, Yarin Gal, Zoubin Ghahramani, Jasper Snoek , et al. (1 additional authors not shown)

    Abstract: A recent trend in artificial intelligence is the use of pretrained models for language and vision tasks, which have achieved extraordinary performance but also puzzling failures. Probing these models' abilities in diverse ways is therefore critical to the field. In this paper, we explore the reliability of models, where we define a reliable model as one that not only achieves strong predictive per… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Code available at https://goo.gle/plex-code

  2. arXiv:2111.02275  [pdf, other

    cs.LG stat.ML

    Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

    Authors: Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort, Andreas Kirsch, Uri Shalit, Yarin Gal

    Abstract: Estimating personalized treatment effects from high-dimensional observational data is essential in situations where experimental designs are infeasible, unethical, or expensive. Existing approaches rely on fitting deep models on outcomes observed for treated and control populations. However, when measuring individual outcomes is costly, as is the case of a tumor biopsy, a sample-efficient strategy… ▽ More

    Submitted 1 February, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 24 pages, 8 Figures, 5 tables, NeurIPS 2021

  3. arXiv:2106.02469  [pdf, other

    cs.LG stat.ML

    Can convolutional ResNets approximately preserve input distances? A frequency analysis perspective

    Authors: Lewis Smith, Joost van Amersfoort, Haiwen Huang, Stephen Roberts, Yarin Gal

    Abstract: ResNets constrained to be bi-Lipschitz, that is, approximately distance preserving, have been a crucial component of recently proposed techniques for deterministic uncertainty quantification in neural models. We show that theoretical justifications for recent regularisation schemes trying to enforce such a constraint suffer from a crucial flaw -- the theoretical link between the regularisation sch… ▽ More

    Submitted 17 June, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Main paper 10 pages including references, appendix 10 pages. 7 figures and 6 tables including appendix

  4. arXiv:2102.11582  [pdf, other

    cs.LG stat.ML

    Deep Deterministic Uncertainty: A Simple Baseline

    Authors: Jishnu Mukhoti, Andreas Kirsch, Joost van Amersfoort, Philip H. S. Torr, Yarin Gal

    Abstract: Reliable uncertainty from deterministic single-forward pass models is sought after because conventional methods of uncertainty quantification are computationally expensive. We take two complex single-forward-pass uncertainty approaches, DUQ and SNGP, and examine whether they mainly rely on a well-regularized feature space. Crucially, without using their more complex methods for estimating uncertai… ▽ More

    Submitted 28 January, 2022; v1 submitted 23 February, 2021; originally announced February 2021.

  5. arXiv:2102.11409  [pdf, other

    cs.LG stat.ML

    On Feature Collapse and Deep Kernel Learning for Single Forward Pass Uncertainty

    Authors: Joost van Amersfoort, Lewis Smith, Andrew Jesson, Oscar Key, Yarin Gal

    Abstract: Inducing point Gaussian process approximations are often considered a gold standard in uncertainty estimation since they retain many of the properties of the exact GP and scale to large datasets. A major drawback is that they have difficulty scaling to high dimensional inputs. Deep Kernel Learning (DKL) promises a solution: a deep feature extractor transforms the inputs over which an inducing poin… ▽ More

    Submitted 7 March, 2022; v1 submitted 22 February, 2021; originally announced February 2021.

  6. arXiv:2007.00389  [pdf, other

    cs.LG stat.ML

    Single Shot Structured Pruning Before Training

    Authors: Joost van Amersfoort, Milad Alizadeh, Sebastian Farquhar, Nicholas Lane, Yarin Gal

    Abstract: We introduce a method to speed up training by 2x and inference by 3x in deep neural networks using structured pruning applied before training. Unlike previous works on pruning before training which prune individual weights, our work develops a methodology to remove entire channels and hidden units with the explicit aim of speeding up training and inference. We introduce a compute-aware scoring mec… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  7. arXiv:2003.02037  [pdf, other

    cs.LG stat.ML

    Uncertainty Estimation Using a Single Deep Deterministic Neural Network

    Authors: Joost van Amersfoort, Lewis Smith, Yee Whye Teh, Yarin Gal

    Abstract: We propose a method for training a deterministic deep model that can find and reject out of distribution data points at test time with a single forward pass. Our approach, deterministic uncertainty quantification (DUQ), builds upon ideas of RBF networks. We scale training in these with a novel loss function and centroid updating scheme and match the accuracy of softmax models. By enforcing detecta… ▽ More

    Submitted 29 June, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

  8. arXiv:1906.08158  [pdf, other

    cs.LG stat.ML

    BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning

    Authors: Andreas Kirsch, Joost van Amersfoort, Yarin Gal

    Abstract: We develop BatchBALD, a tractable approximation to the mutual information between a batch of points and model parameters, which we use as an acquisition function to select multiple informative points jointly for the task of deep Bayesian active learning. BatchBALD is a greedy linear-time $1 - \frac{1}{e}$-approximate algorithm amenable to dynamic programming and efficient caching. We compare Batch… ▽ More

    Submitted 28 October, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

  9. arXiv:1902.03876  [pdf, other

    cs.LG cs.IR stat.ML

    Deep Hashing using Entropy Regularised Product Quantisation Network

    Authors: Jo Schlemper, Jose Caballero, Andy Aitken, Joost van Amersfoort

    Abstract: In large scale systems, approximate nearest neighbour search is a crucial algorithm to enable efficient data retrievals. Recently, deep learning-based hashing algorithms have been proposed as a promising paradigm to enable data dependent schemes. Often their efficacy is only demonstrated on data sets with fixed, limited numbers of classes. In practical scenarios, those labels are not always availa… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

  10. arXiv:1412.6581  [pdf, other

    stat.ML cs.LG cs.NE

    Variational Recurrent Auto-Encoders

    Authors: Otto Fabius, Joost R. van Amersfoort

    Abstract: In this paper we propose a model that combines the strengths of RNNs and SGVB: the Variational Recurrent Auto-Encoder (VRAE). Such a model can be used for efficient, large scale unsupervised learning on time series data, map** the time series data to a latent vector representation. The model is generative, such that data can be generated from samples of the latent space. An important contributio… ▽ More

    Submitted 15 June, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: Accepted at ICLR workshop track