Skip to main content

Showing 1–16 of 16 results for author: Farquhar, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2304.08151  [pdf, other

    cs.LG stat.ML

    Prediction-Oriented Bayesian Active Learning

    Authors: Freddie Bickford Smith, Andreas Kirsch, Sebastian Farquhar, Yarin Gal, Adam Foster, Tom Rainforth

    Abstract: Information-theoretic approaches to active learning have traditionally focused on maximising the information gathered about the model parameters, most commonly by optimising the BALD score. We highlight that this can be suboptimal from the perspective of predictive performance. For example, BALD lacks a notion of an input distribution and so is prone to prioritise data of limited relevance. To add… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Published at AISTATS 2023

  2. arXiv:2301.05062  [pdf, other

    cs.LG cs.AI stat.ML

    Tracr: Compiled Transformers as a Laboratory for Interpretability

    Authors: David Lindner, János Kramár, Sebastian Farquhar, Matthew Rahtz, Thomas McGrath, Vladimir Mikulik

    Abstract: We show how to "compile" human-readable programs into standard decoder-only transformer models. Our compiler, Tracr, generates models with known structure. This structure can be used to design experiments. For example, we use it to study "superposition" in transformers that execute multi-step algorithms. Additionally, the known structure of Tracr-compiled models can serve as ground-truth for evalu… ▽ More

    Submitted 3 November, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Presented at NeurIPS 2023 (Spotlight)

  3. arXiv:2211.06291  [pdf, other

    cs.LG cs.AI stat.ML

    Do Bayesian Neural Networks Need To Be Fully Stochastic?

    Authors: Mrinank Sharma, Sebastian Farquhar, Eric Nalisnick, Tom Rainforth

    Abstract: We investigate the benefit of treating all the parameters in a Bayesian neural network stochastically and find compelling theoretical and empirical evidence that this standard construction may be unnecessary. To this end, we prove that expressive predictive distributions require only small amounts of stochasticity. In particular, partially stochastic networks with only $n$ stochastic biases are un… ▽ More

    Submitted 20 February, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: Published at AISTATS2023 (Oral)

  4. arXiv:2211.06139  [pdf, other

    stat.ML cs.LG

    Understanding Approximation for Bayesian Inference in Neural Networks

    Authors: Sebastian Farquhar

    Abstract: Bayesian inference has theoretical attractions as a principled framework for reasoning about beliefs. However, the motivations of Bayesian inference which claim it to be the only 'rational' kind of reasoning do not apply in practice. They create a binary split in which all approximate inference is equally 'irrational'. Instead, we should ask ourselves how to define a spectrum of more- and less-rat… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: Accepted as a thesis satisfying the requirements of a D.Phil at the Universty of Oxford

  5. arXiv:2204.10018  [pdf, other

    cs.AI stat.ML

    Path-Specific Objectives for Safer Agent Incentives

    Authors: Sebastian Farquhar, Ryan Carey, Tom Everitt

    Abstract: We present a general framework for training safe agents whose naive incentives are unsafe. As an example, manipulative or deceptive behaviour can improve rewards but should be avoided. Most approaches fail here: agents maximize expected return by any means necessary. We formally describe settings with 'delicate' parts of the state which should not be used as a means to an end. We then train agents… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Presented at AAAI 2022

  6. arXiv:2202.06881  [pdf, other

    cs.LG stat.ML

    Active Surrogate Estimators: An Active Learning Approach to Label-Efficient Model Evaluation

    Authors: Jannik Kossen, Sebastian Farquhar, Yarin Gal, Tom Rainforth

    Abstract: We propose Active Surrogate Estimators (ASEs), a new method for label-efficient model evaluation. Evaluating model performance is a challenging and important problem when labels are expensive. ASEs address this active testing problem using a surrogate-based estimation approach that interpolates the errors of points with unknown labels, rather than forming a Monte Carlo estimator. ASEs actively lea… ▽ More

    Submitted 18 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted for publication at NeurIPS 2022

  7. arXiv:2106.12059  [pdf, other

    cs.LG stat.ML

    Stochastic Batch Acquisition: A Simple Baseline for Deep Active Learning

    Authors: Andreas Kirsch, Sebastian Farquhar, Parmida Atighehchian, Andrew Jesson, Frederic Branchaud-Charron, Yarin Gal

    Abstract: We examine a simple stochastic strategy for adapting well-known single-point acquisition functions to allow batch active learning. Unlike acquiring the top-K points from the pool set, score- or rank-based sampling takes into account that acquisition scores change as new data are acquired. This simple strategy for adapting standard single-sample acquisition strategies can even perform just as well… ▽ More

    Submitted 19 September, 2023; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: TMLR Paper: https://openreview.net/forum?id=vcHwQyNBjW

  8. arXiv:2103.05331  [pdf, other

    stat.ML cs.LG

    Active Testing: Sample-Efficient Model Evaluation

    Authors: Jannik Kossen, Sebastian Farquhar, Yarin Gal, Tom Rainforth

    Abstract: We introduce a new framework for sample-efficient model evaluation that we call active testing. While approaches like active learning reduce the number of labels needed for model training, existing literature largely ignores the cost of labeling test data, typically unrealistically assuming large test sets for model evaluation. This creates a disconnect to real applications, where test labels are… ▽ More

    Submitted 14 June, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: Published at the 38th International Conference on Machine Learning (ICML 2021)

  9. arXiv:2101.11665  [pdf, other

    stat.ML cs.LG

    On Statistical Bias In Active Learning: How and When To Fix It

    Authors: Sebastian Farquhar, Yarin Gal, Tom Rainforth

    Abstract: Active learning is a powerful tool when labelling data is expensive, but it introduces a bias because the training data no longer follows the population distribution. We formalize this bias and investigate the situations in which it can be harmful and sometimes even helpful. We further introduce novel corrective weights to remove bias when doing so is beneficial. Through this, our work not only pr… ▽ More

    Submitted 31 May, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Published at ICLR 2021 (Spotlight)

  10. arXiv:2007.00389  [pdf, other

    cs.LG stat.ML

    Single Shot Structured Pruning Before Training

    Authors: Joost van Amersfoort, Milad Alizadeh, Sebastian Farquhar, Nicholas Lane, Yarin Gal

    Abstract: We introduce a method to speed up training by 2x and inference by 3x in deep neural networks using structured pruning applied before training. Unlike previous works on pruning before training which prune individual weights, our work develops a methodology to remove entire channels and hidden units with the explicit aim of speeding up training and inference. We introduce a compute-aware scoring mec… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  11. arXiv:2002.03704  [pdf, other

    cs.LG stat.ML

    Liberty or Depth: Deep Bayesian Neural Nets Do Not Need Complex Weight Posterior Approximations

    Authors: Sebastian Farquhar, Lewis Smith, Yarin Gal

    Abstract: We challenge the longstanding assumption that the mean-field approximation for variational inference in Bayesian neural networks is severely restrictive, and show this is not the case in deep networks. We prove several results indicating that deep mean-field variational weight posteriors can induce similar distributions in function-space to those induced by shallower networks with complex weight p… ▽ More

    Submitted 10 March, 2021; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: Advances In Neural Information Processing Systems. 2020

  12. arXiv:1912.10481  [pdf, other

    stat.ML cs.LG eess.IV

    A Systematic Comparison of Bayesian Deep Learning Robustness in Diabetic Retinopathy Tasks

    Authors: Angelos Filos, Sebastian Farquhar, Aidan N. Gomez, Tim G. J. Rudner, Zachary Kenton, Lewis Smith, Milad Alizadeh, Arnoud de Kroon, Yarin Gal

    Abstract: Evaluation of Bayesian deep learning (BDL) methods is challenging. We often seek to evaluate the methods' robustness and scalability, assessing whether new tools give `better' uncertainty estimates than old ones. These evaluations are paramount for practitioners when choosing BDL tools on-top of which they build their applications. Current popular evaluations of BDL methods, such as the UCI experi… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

  13. arXiv:1907.00865  [pdf, other

    stat.ML cs.LG

    Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning

    Authors: Sebastian Farquhar, Michael Osborne, Yarin Gal

    Abstract: We propose Radial Bayesian Neural Networks (BNNs): a variational approximate posterior for BNNs which scales well to large models while maintaining a distribution over weight-space with full support. Other scalable Bayesian deep learning methods, like MC dropout or deep ensembles, have discrete support-they assign zero probability to almost all of the weight-space. Unlike these discrete support me… ▽ More

    Submitted 31 May, 2021; v1 submitted 1 July, 2019; originally announced July 2019.

    Journal ref: AI Stats, PMLR 108:1352-1362, 2020

  14. arXiv:1902.06497  [pdf, other

    stat.ML cs.LG

    Differentially Private Continual Learning

    Authors: Sebastian Farquhar, Yarin Gal

    Abstract: Catastrophic forgetting can be a significant problem for institutions that must delete historic data for privacy reasons. For example, hospitals might not be able to retain patient data permanently. But neural networks trained on recent data alone will tend to forget lessons learned on old data. We present a differentially private continual learning framework based on variational inference. We est… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: Presented at the Privacy in Machine Learning and AI workshop at ICML 2018

  15. arXiv:1902.06494  [pdf, other

    stat.ML cs.LG

    A Unifying Bayesian View of Continual Learning

    Authors: Sebastian Farquhar, Yarin Gal

    Abstract: Some machine learning applications require continual learning - where data comes in a sequence of datasets, each is used for training and then permanently discarded. From a Bayesian perspective, continual learning seems straightforward: Given the model posterior one would simply use this as the prior for the next task. However, exact posterior evaluation is intractable with many models, especially… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: Presented at the Bayesian Deep Learning Workshop at Neural Information Processing Systems December 2018

  16. arXiv:1805.09733  [pdf, other

    stat.ML cs.LG

    Towards Robust Evaluations of Continual Learning

    Authors: Sebastian Farquhar, Yarin Gal

    Abstract: Experiments used in current continual learning research do not faithfully assess fundamental challenges of learning continually. Instead of assessing performance on challenging and representative experiment designs, recent research has focused on increased dataset difficulty, while still using flawed experiment set-ups. We examine standard evaluations and show why these evaluations make some conti… ▽ More

    Submitted 26 June, 2019; v1 submitted 24 May, 2018; originally announced May 2018.