Skip to main content

Showing 1–21 of 21 results for author: Jesson, A

.
  1. arXiv:2406.07457  [pdf, other

    cs.LG stat.ML

    Estimating the Hallucination Rate of Generative AI

    Authors: Andrew Jesson, Nicolas Beltran-Velez, Quentin Chu, Sweta Karlekar, Jannik Kossen, Yarin Gal, John P. Cunningham, David Blei

    Abstract: This work is about estimating the hallucination rate for in-context learning (ICL) with Generative AI. In ICL, a conditional generative model (CGM) is prompted with a dataset and asked to make a prediction based on that dataset. The Bayesian interpretation of ICL assumes that the CGM is calculating a posterior predictive distribution over an unknown Bayesian model of a latent parameter and data. W… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2312.04064  [pdf, other

    q-bio.QM cs.LG stat.ME

    DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design

    Authors: Clare Lyle, Arash Mehrjou, Pascal Notin, Andrew Jesson, Stefan Bauer, Yarin Gal, Patrick Schwab

    Abstract: The discovery of therapeutics to treat genetically-driven pathologies relies on identifying genes involved in the underlying disease mechanisms. Existing approaches search over the billions of potential interventions to maximize the expected influence on the target phenotype. However, to reduce the risk of failure in future stages of trials, practical experiment design aims to find a set of interv… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Journal ref: International Conference on Machine Learning, 2023

  3. arXiv:2306.15058  [pdf, other

    cs.LG stat.ML

    BatchGFN: Generative Flow Networks for Batch Active Learning

    Authors: Shreshth A. Malik, Salem Lahlou, Andrew Jesson, Moksh Jain, Nikolay Malkin, Tristan Deleu, Yoshua Bengio, Yarin Gal

    Abstract: We introduce BatchGFN -- a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward. With an appropriate reward function to quantify the utility of acquiring a batch, such as the joint mutual information between the batch and the model parameters, BatchGFN is able to construct highly informative batches for active… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at the Structured Probabilistic Inference & Generative Modeling workshop, ICML 2023

  4. arXiv:2306.01460  [pdf, other

    cs.LG

    ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

    Authors: Andrew Jesson, Chris Lu, Gunshi Gupta, Angelos Filos, Jakob Nicolaus Foerster, Yarin Gal

    Abstract: This paper introduces an effective and practical step toward approximate Bayesian inference in on-policy actor-critic deep reinforcement learning. This step manifests as three simple modifications to the Asynchronous Advantage Actor-Critic (A3C) algorithm: (1) applying a ReLU function to advantage estimates, (2) spectral normalization of actor-critic weights, and (3) incorporating dropout as a Bay… ▽ More

    Submitted 24 November, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  5. arXiv:2304.10577  [pdf, other

    cs.LG stat.ML

    B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding

    Authors: Miruna Oprescu, Jacob Dorn, Marah Ghoummaid, Andrew Jesson, Nathan Kallus, Uri Shalit

    Abstract: Estimating heterogeneous treatment effects from observational data is a crucial task across many fields, hel** policy and decision-makers take better actions. There has been recent progress on robust and efficient methods for estimating the conditional average treatment effect (CATE) function, but these methods often do not take into account the risk of hidden confounding, which could arbitraril… ▽ More

    Submitted 13 June, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 20 pages, 4 figures, ICML 2023

    Journal ref: PMLR 202 (2023) 26599-26618

  6. arXiv:2302.10607  [pdf, other

    cs.LG cs.AI stat.ME

    Differentiable Multi-Target Causal Bayesian Experimental Design

    Authors: Yashas Annadani, Panagiotis Tigas, Desi R. Ivanova, Andrew Jesson, Yarin Gal, Adam Foster, Stefan Bauer

    Abstract: We introduce a gradient-based approach for the problem of Bayesian optimal experimental design to learn causal models in a batch setting -- a critical component for causal discovery from finite data where interventions can be costly or risky. Existing methods rely on greedy approximations to construct a batch of experiments while using black-box methods to optimize over a single target-state pair… ▽ More

    Submitted 2 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Camera-ready version ICML 2023

  7. arXiv:2301.11921  [pdf, other

    physics.data-an cs.LG physics.ao-ph

    Using uncertainty-aware machine learning models to study aerosol-cloud interactions

    Authors: Maëlys Solal, Andrew Jesson, Yarin Gal, Alyson Douglas

    Abstract: Aerosol-cloud interactions (ACI) include various effects that result from aerosols entering a cloud, and affecting cloud properties. In general, an increase in aerosol concentration results in smaller droplet sizes which leads to larger, brighter, longer-lasting clouds that reflect more sunlight and cool the Earth. The strength of the effect is however heterogeneous, meaning it depends on the surr… ▽ More

    Submitted 30 November, 2022; originally announced January 2023.

  8. arXiv:2204.11206  [pdf, other

    stat.ME cs.LG stat.ML

    Partial Identification of Dose Responses with Hidden Confounders

    Authors: Myrl G. Marmarelis, Elizabeth Haddad, Andrew Jesson, Neda Jahanshad, Aram Galstyan, Greg Ver Steeg

    Abstract: Inferring causal effects of continuous-valued treatments from observational data is a crucial task promising to better inform policy- and decision-makers. A critical assumption needed to identify these effects is that all confounding variables -- causal parents of both the treatment and the outcome -- are included as covariates. Unfortunately, given observational data alone, we cannot know with ce… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 April, 2022; originally announced April 2022.

  9. arXiv:2204.10022  [pdf, other

    cs.LG stat.ML

    Scalable Sensitivity and Uncertainty Analysis for Causal-Effect Estimates of Continuous-Valued Interventions

    Authors: Andrew Jesson, Alyson Douglas, Peter Manshausen, Maëlys Solal, Nicolai Meinshausen, Philip Stier, Yarin Gal, Uri Shalit

    Abstract: Estimating the effects of continuous-valued interventions from observational data is a critically important task for climate science, healthcare, and economics. Recent work focuses on designing neural network architectures and regularization functions to allow for scalable estimation of average and individual-level dose-response curves from high-dimensional, large-sample data. Such methodologies a… ▽ More

    Submitted 12 October, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: 33 pages

  10. arXiv:2203.02016  [pdf, other

    cs.LG cs.AI stat.ML

    Interventions, Where and How? Experimental Design for Causal Models at Scale

    Authors: Panagiotis Tigas, Yashas Annadani, Andrew Jesson, Bernhard Schölkopf, Yarin Gal, Stefan Bauer

    Abstract: Causal discovery from observational and interventional data is challenging due to limited data and non-identifiability: factors that introduce uncertainty in estimating the underlying structural causal model (SCM). Selecting experiments (interventions) based on the uncertainty arising from both factors can expedite the identification of the SCM. Existing methods in experimental design for causal d… ▽ More

    Submitted 21 October, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Presented at the thirty-sixth Conference on Neural Information Processing Systems (2022)

  11. arXiv:2111.02275  [pdf, other

    cs.LG stat.ML

    Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

    Authors: Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort, Andreas Kirsch, Uri Shalit, Yarin Gal

    Abstract: Estimating personalized treatment effects from high-dimensional observational data is essential in situations where experimental designs are infeasible, unethical, or expensive. Existing approaches rely on fitting deep models on outcomes observed for treated and control populations. However, when measuring individual outcomes is costly, as is the case of a tumor biopsy, a sample-efficient strategy… ▽ More

    Submitted 1 February, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 24 pages, 8 Figures, 5 tables, NeurIPS 2021

  12. arXiv:2110.15084  [pdf, other

    physics.ao-ph cs.LG physics.data-an

    Using Non-Linear Causal Models to Study Aerosol-Cloud Interactions in the Southeast Pacific

    Authors: Andrew Jesson, Peter Manshausen, Alyson Douglas, Duncan Watson-Parris, Yarin Gal, Philip Stier

    Abstract: Aerosol-cloud interactions include a myriad of effects that all begin when aerosol enters a cloud and acts as cloud condensation nuclei (CCN). An increase in CCN results in a decrease in the mean cloud droplet size (r$_{e}$). The smaller droplet size leads to brighter, more expansive, and longer lasting clouds that reflect more incoming sunlight, thus cooling the earth. Globally, aerosol-cloud int… ▽ More

    Submitted 3 November, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

  13. arXiv:2110.11875  [pdf, other

    cs.LG stat.ML

    GeneDisco: A Benchmark for Experimental Design in Drug Discovery

    Authors: Arash Mehrjou, Ashkan Soleymani, Andrew Jesson, Pascal Notin, Yarin Gal, Stefan Bauer, Patrick Schwab

    Abstract: In vitro cellular experimentation with genetic interventions, using for example CRISPR technologies, is an essential step in early-stage drug discovery and target validation that serves to assess initial hypotheses about causal associations between biological mechanisms and disease pathologies. With billions of potential hypotheses to test, the experimental design space for in vitro genetic experi… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  14. arXiv:2106.12059  [pdf, other

    cs.LG stat.ML

    Stochastic Batch Acquisition: A Simple Baseline for Deep Active Learning

    Authors: Andreas Kirsch, Sebastian Farquhar, Parmida Atighehchian, Andrew Jesson, Frederic Branchaud-Charron, Yarin Gal

    Abstract: We examine a simple stochastic strategy for adapting well-known single-point acquisition functions to allow batch active learning. Unlike acquiring the top-K points from the pool set, score- or rank-based sampling takes into account that acquisition scores change as new data are acquired. This simple strategy for adapting standard single-sample acquisition strategies can even perform just as well… ▽ More

    Submitted 19 September, 2023; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: TMLR Paper: https://openreview.net/forum?id=vcHwQyNBjW

  15. arXiv:2103.04850  [pdf, other

    cs.LG stat.ML

    Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding

    Authors: Andrew Jesson, Sören Mindermann, Yarin Gal, Uri Shalit

    Abstract: We study the problem of learning conditional average treatment effects (CATE) from high-dimensional, observational data with unobserved confounders. Unobserved confounders introduce ignorance -- a level of unidentifiability -- about an individual's response to treatment by inducing bias in CATE estimates. We present a new parametric interval estimator suited for high-dimensional data, that estimat… ▽ More

    Submitted 1 February, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: 19 pages, 5 figures, ICML 2021

    Journal ref: PMLR 139 (2021) 4829-4838

  16. arXiv:2102.11409  [pdf, other

    cs.LG stat.ML

    On Feature Collapse and Deep Kernel Learning for Single Forward Pass Uncertainty

    Authors: Joost van Amersfoort, Lewis Smith, Andrew Jesson, Oscar Key, Yarin Gal

    Abstract: Inducing point Gaussian process approximations are often considered a gold standard in uncertainty estimation since they retain many of the properties of the exact GP and scale to large datasets. A major drawback is that they have difficulty scaling to high dimensional inputs. Deep Kernel Learning (DKL) promises a solution: a deep feature extractor transforms the inputs over which an inducing poin… ▽ More

    Submitted 7 March, 2022; v1 submitted 22 February, 2021; originally announced February 2021.

  17. arXiv:2007.00163  [pdf, other

    cs.LG stat.ML

    Identifying Causal-Effect Inference Failure with Uncertainty-Aware Models

    Authors: Andrew Jesson, Sören Mindermann, Uri Shalit, Yarin Gal

    Abstract: Recommending the best course of action for an individual is a major application of individual-level causal effect estimation. This application is often needed in safety-critical domains such as healthcare, where estimating and communicating uncertainty to decision-makers is crucial. We introduce a practical approach for integrating uncertainty estimation into a class of state-of-the-art neural net… ▽ More

    Submitted 22 October, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

  18. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  19. CASED: Curriculum Adaptive Sampling for Extreme Data Imbalance

    Authors: Andrew Jesson, Nicolas Guizard, Sina Hamidi Ghalehjegh, Damien Goblot, Florian Soudan, Nicolas Chapados

    Abstract: We introduce CASED, a novel curriculum sampling algorithm that facilitates the optimization of deep learning segmentation or detection models on data sets with extreme class imbalance. We evaluate the CASED learning framework on the task of lung nodule detection in chest CT. In contrast to two-stage solutions, wherein nodule candidates are first proposed by a segmentation model and refined by a se… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

    Comments: 20th International Conference on Medical Image Computing and Computer Assisted Intervention 2017

  20. arXiv:1807.05344  [pdf

    stat.ML cs.LG

    Adversarially Learned Mixture Model

    Authors: Andrew Jesson, Cécile Low-Kam, Tanya Nair, Florian Soudan, Florent Chandelier, Nicolas Chapados

    Abstract: The Adversarially Learned Mixture Model (AMM) is a generative model for unsupervised or semi-supervised data clustering. The AMM is the first adversarially optimized method to model the conditional dependence between inferred continuous and categorical latent variables. Experiments on the MNIST and SVHN datasets show that the AMM allows for semantic separation of complex data when little or no lab… ▽ More

    Submitted 23 April, 2022; v1 submitted 14 July, 2018; originally announced July 2018.

  21. arXiv:1806.00852  [pdf, other

    cs.LG cs.AI stat.ML

    On the Importance of Attention in Meta-Learning for Few-Shot Text Classification

    Authors: Xiang Jiang, Mohammad Havaei, Gabriel Chartrand, Hassan Chouaib, Thomas Vincent, Andrew Jesson, Nicolas Chapados, Stan Matwin

    Abstract: Current deep learning based text classification methods are limited by their ability to achieve fast learning and generalization when the data is scarce. We address this problem by integrating a meta-learning procedure that uses the knowledge learned across many tasks as an inductive bias towards better natural language understanding. Based on the Model-Agnostic Meta-Learning framework (MAML), we… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: 13 pages, 4 figures, submitted to NIPS