Skip to main content

Showing 1–50 of 133 results for author: Blei, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.07658  [pdf, other

    cs.LG stat.ML

    Treeffuser: Probabilistic Predictions via Conditional Diffusions with Gradient-Boosted Trees

    Authors: Nicolas Beltran-Velez, Alessandro Antonio Grande, Achille Nazaret, Alp Kucukelbir, David Blei

    Abstract: Probabilistic prediction aims to compute predictive distributions rather than single-point predictions. These distributions enable practitioners to quantify uncertainty, compute risk, and detect outliers. However, most probabilistic methods assume parametric responses, such as Gaussian or Poisson distributions. When these assumptions fail, such models lead to bad predictions and poorly calibrated… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.07457  [pdf, other

    cs.LG stat.ML

    Estimating the Hallucination Rate of Generative AI

    Authors: Andrew Jesson, Nicolas Beltran-Velez, Quentin Chu, Sweta Karlekar, Jannik Kossen, Yarin Gal, John P. Cunningham, David Blei

    Abstract: This work is about estimating the hallucination rate for in-context learning (ICL) with Generative AI. In ICL, a conditional generative model (CGM) is prompted with a dataset and asked to make a prediction based on that dataset. The Bayesian interpretation of ICL assumes that the CGM is calculating a posterior predictive distribution over an unknown Bayesian model of a latent parameter and data. W… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2404.09113  [pdf, other

    stat.ML cs.LG math.ST

    Extending Mean-Field Variational Inference via Entropic Regularization: Theory and Computation

    Authors: Bohan Wu, David Blei

    Abstract: Variational inference (VI) has emerged as a popular method for approximate inference for high-dimensional Bayesian models. In this paper, we propose a novel VI method that extends the naive mean field via entropic regularization, referred to as $Ξ$-variational inference ($Ξ$-VI). $Ξ$-VI has a close connection to the entropic optimal transport problem and benefits from the computationally efficient… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  4. arXiv:2402.14758  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Batch and match: black-box variational inference with a score-based divergence

    Authors: Diana Cai, Chirag Modi, Loucas Pillaud-Vivien, Charles C. Margossian, Robert M. Gower, David M. Blei, Lawrence K. Saul

    Abstract: Most leading implementations of black-box variational inference (BBVI) are based on optimizing a stochastic evidence lower bound (ELBO). But such approaches to BBVI often converge slowly due to the high variance of their gradient estimates and their sensitivity to hyperparameters. In this work, we propose batch and match (BaM), an alternative approach to BBVI based on a score-based divergence. Not… ▽ More

    Submitted 12 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 49 pages, 14 figures. To appear in the Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  5. arXiv:2401.05330  [pdf, other

    stat.ME stat.ML

    Hierarchical Causal Models

    Authors: Eli N. Weinstein, David M. Blei

    Abstract: Scientists often want to learn about cause and effect from hierarchical data, collected from subunits nested inside units. Consider students in schools, cells in patients, or cities in states. In such settings, unit-level variables (e.g. each school's budget) may affect subunit-level variables (e.g. the test scores of each student in each school) and vice versa. To address causal questions with hi… ▽ More

    Submitted 26 June, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 75 pages, 29 figures. Supplementary code: https://github.com/EWeinstein/HCM

  6. arXiv:2311.10263  [pdf, other

    cs.LG stat.ME

    Stable Differentiable Causal Discovery

    Authors: Achille Nazaret, Justin Hong, Elham Azizi, David Blei

    Abstract: Inferring causal relationships as directed acyclic graphs (DAGs) is an important but challenging problem. Differentiable Causal Discovery (DCD) is a promising approach to this problem, framing the search as a continuous optimization. But existing DCD methods are numerically unstable, with poor performance beyond tens of variables. In this paper, we propose Stable Differentiable Causal Discovery (S… ▽ More

    Submitted 27 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  7. arXiv:2307.11018  [pdf, other

    stat.ML cs.LG

    Amortized Variational Inference: When and Why?

    Authors: Charles C. Margossian, David M. Blei

    Abstract: In a probabilistic latent variable model, factorized (or mean-field) variational inference (F-VI) fits a separate parametric distribution for each latent variable. Amortized variational inference (A-VI) instead learns a common inference function, which maps each observation to its corresponding latent variable's approximate posterior. Typically, A-VI is used as a step in the training of variationa… ▽ More

    Submitted 23 May, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

  8. arXiv:2307.07849  [pdf, other

    stat.ML cs.LG

    Variational Inference with Gaussian Score Matching

    Authors: Chirag Modi, Charles Margossian, Yuling Yao, Robert Gower, David Blei, Lawrence Saul

    Abstract: Variational inference (VI) is a method to approximate the computationally intractable posterior distributions that arise in Bayesian statistics. Typically, VI fits a simple parametric distribution to the target posterior by minimizing an appropriate objective such as the evidence lower bound (ELBO). In this work, we present a new approach to VI based on the principle of score matching, that if two… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: A Python code for GSM-VI algorithm is at https://github.com/modichirag/GSM-VI

  9. arXiv:2306.17775  [pdf, other

    stat.ML cs.LG q-bio.BM

    Practical and Asymptotically Exact Conditional Sampling in Diffusion Models

    Authors: Luhuan Wu, Brian L. Trippe, Christian A. Naesseth, David M. Blei, John P. Cunningham

    Abstract: Diffusion models have been successful on a range of conditional generation tasks including molecular design and text-to-image generation. However, these achievements have primarily depended on task-specific conditional training or error-prone heuristic approximations. Ideally, a conditional generation method should provide exact samples for a broad range of conditional distributions without requir… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Code: https://github.com/blt2114/twisted_diffusion_sampler

  10. arXiv:2306.12497  [pdf, other

    cs.LG stat.ML

    Density Uncertainty Layers for Reliable Uncertainty Estimation

    Authors: Yookoon Park, David M. Blei

    Abstract: Assessing the predictive uncertainty of deep neural networks is crucial for safety-related applications of deep learning. Although Bayesian deep learning offers a principled framework for estimating model uncertainty, the common approaches that approximate the parameter posterior often fail to deliver reliable estimates of predictive uncertainty. In this paper, we propose a novel criterion for rel… ▽ More

    Submitted 4 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Published in AISTATS 2024

  11. arXiv:2306.00542  [pdf, other

    stat.ML cs.AI cs.LG

    Nonparametric Identifiability of Causal Representations from Unknown Interventions

    Authors: Julius von Kügelgen, Michel Besserve, Liang Wendong, Luigi Gresele, Armin Kekić, Elias Bareinboim, David M. Blei, Bernhard Schölkopf

    Abstract: We study causal representation learning, the task of inferring latent causal variables and their causal relations from high-dimensional mixtures of the variables. Prior work relies on weak supervision, in the form of counterfactual pre- and post-intervention views or temporal structure; places restrictive assumptions, such as linearity, on the mixing function or latent causal model; or requires pa… ▽ More

    Submitted 28 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready version; 36 pages, 4 figures

    MSC Class: 68T05 ACM Class: I.2.6

  12. arXiv:2302.12777  [pdf, other

    stat.ME econ.EM

    On the Misspecification of Linear Assumptions in Synthetic Control

    Authors: Achille Nazaret, Claudia Shi, David M. Blei

    Abstract: The synthetic control (SC) method is a popular approach for estimating treatment effects from observational panel data. It rests on a crucial assumption that we can write the treated unit as a linear combination of the untreated units. This linearity assumption, however, can be unlikely to hold in practice and, when violated, the resulting SC estimates are incorrect. In this paper we examine two q… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  13. arXiv:2301.00537  [pdf, other

    stat.ML cs.LG

    Posterior Collapse and Latent Variable Non-identifiability

    Authors: Yixin Wang, David M. Blei, John P. Cunningham

    Abstract: Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful re… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: 19 pages, 4 figures; NeurIPS 2021

  14. arXiv:2209.10091  [pdf, other

    cs.LG stat.ML

    Variational Inference for Infinitely Deep Neural Networks

    Authors: Achille Nazaret, David Blei

    Abstract: We introduce the unbounded depth neural network (UDN), an infinitely deep probabilistic model that adapts its complexity to the training data. The UDN contains an infinite sequence of hidden layers and places an unbounded prior on a truncation L, the layer from which it produces its data. Given a dataset of observations, the posterior UDN provides a conditional distribution of both the parameters… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Published at ICML 2022

  15. arXiv:2207.09535  [pdf, other

    cs.LG stat.ML

    Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse

    Authors: Sachit Menon, David Blei, Carl Vondrick

    Abstract: Variational autoencoders (VAEs) suffer from posterior collapse, where the powerful neural networks used for modeling and inference optimize the objective without meaningfully using the latent representation. We introduce inference critics that detect and incentivize against posterior collapse by requiring correspondence between latent variables and the observations. By connecting the critic's obje… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI) 2022

  16. arXiv:2206.15433  [pdf, other

    astro-ph.IM astro-ph.CO stat.ML

    Reconstructing the Universe with Variational self-Boosted Sampling

    Authors: Chirag Modi, Yin Li, David Blei

    Abstract: Forward modeling approaches in cosmology have made it possible to reconstruct the initial conditions at the beginning of the Universe from the observed survey data. However the high dimensionality of the parameter space still poses a challenge to explore the full posterior, with traditional algorithms such as Hamiltonian Monte Carlo (HMC) being computationally inefficient due to generating correla… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: A shorter version of this paper is accepted for spotlight presentation in Machine Learning for Astrophysics Workshop at ICML, 2022

  17. arXiv:2206.06584  [pdf, other

    stat.ML cs.LG stat.ME

    Probabilistic Conformal Prediction Using Conditional Random Samples

    Authors: Zhendong Wang, Ruijiang Gao, Mingzhang Yin, Mingyuan Zhou, David M. Blei

    Abstract: This paper proposes probabilistic conformal prediction (PCP), a predictive inference algorithm that estimates a target variable by a discontinuous predictive set. Given inputs, PCP construct the predictive set based on random samples from an estimated generative model. It is efficient and compatible with either explicit or implicit conditional generative models. Theoretically, we show that PCP gua… ▽ More

    Submitted 20 June, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  18. arXiv:2202.06797  [pdf, other

    astro-ph.GA stat.AP

    Map** Interstellar Dust with Gaussian Processes

    Authors: Andrew C. Miller, Lauren Anderson, Boris Leistedt, John P. Cunningham, David W. Hogg, David M. Blei

    Abstract: Interstellar dust corrupts nearly every stellar observation, and accounting for it is crucial to measuring physical properties of stars. We model the dust distribution as a spatially varying latent field with a Gaussian process (GP) and develop a likelihood model and inference method that scales to millions of astronomical observations. Modeling interstellar dust is complicated by two factors. The… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  19. arXiv:2202.01841  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

    Authors: Liyi Zhang, David M. Blei, Christian A. Naesseth

    Abstract: Variational inference often minimizes the "reverse" Kullbeck-Leibler (KL) KL(q||p) from the approximate distribution q to the posterior p. Recent work studies the "forward" KL KL(p||q), which unlike reverse KL does not lead to variational approximations that underestimate uncertainty. This paper introduces Transport Score Climbing (TSC), a method that optimizes KL(p||q) by using Hamiltonian Monte… ▽ More

    Submitted 2 September, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: 14 pages, 8 figures

  20. arXiv:2112.05671  [pdf, other

    stat.ME econ.EM

    On the Assumptions of Synthetic Control Methods

    Authors: Claudia Shi, Dhanya Sridhar, Vishal Misra, David M. Blei

    Abstract: Synthetic control (SC) methods have been widely applied to estimate the causal effect of large-scale interventions, e.g., the state-wide effect of a change in policy. The idea of synthetic controls is to approximate one unit's counterfactual outcomes using a weighted combination of some other units' observed outcomes. The motivating question of this paper is: how does the SC strategy lead to valid… ▽ More

    Submitted 14 December, 2021; v1 submitted 10 December, 2021; originally announced December 2021.

  21. arXiv:2112.03493  [pdf, other

    stat.ME

    Conformal Sensitivity Analysis for Individual Treatment Effects

    Authors: Mingzhang Yin, Claudia Shi, Yixin Wang, David M. Blei

    Abstract: Estimating an individual treatment effect (ITE) is essential to personalized decision making. However, existing methods for estimating the ITE often rely on unconfoundedness, an assumption that is fundamentally untestable with observed data. To assess the robustness of individual-level causal conclusion with unconfoundedness, this paper proposes a method for sensitivity analysis of the ITE, a way… ▽ More

    Submitted 12 July, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Journal of the American Statistical Association

  22. The Posterior Predictive Null

    Authors: Gemma E. Moran, John P. Cunningham, David M. Blei

    Abstract: Bayesian model criticism is an important part of the practice of Bayesian statistics. Traditionally, model criticism methods have been based on the predictive check, an adaptation of goodness-of-fit testing to Bayesian modeling and an effective method to understand how well a model captures the distribution of the data. In modern practice, however, researchers iteratively build and develop many mo… ▽ More

    Submitted 6 July, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: To appear in Bayesian Analysis

  23. Adjusting for indirectly measured confounding using large-scale propensity scores

    Authors: Linying Zhang, Yixin Wang, Martijn Schuemie, David Blei, George Hripcsak

    Abstract: Confounding remains one of the major challenges to causal inference with observational data. This problem is paramount in medicine, where we would like to answer causal questions from large observational datasets like electronic health records (EHRs) and administrative claims. Modern medical data typically contain tens of thousands of covariates. Such a large set carries hope that many of the conf… ▽ More

    Submitted 8 January, 2024; v1 submitted 23 October, 2021; originally announced October 2021.

  24. arXiv:2110.10804  [pdf, other

    stat.ML cs.LG stat.ME

    Identifiable Deep Generative Models via Sparse Decoding

    Authors: Gemma E. Moran, Dhanya Sridhar, Yixin Wang, David M. Blei

    Abstract: We develop the sparse VAE for unsupervised representation learning on high-dimensional data. The sparse VAE learns a set of latent factors (representations) which summarize the associations in the observed data features. The underlying model is sparse in that each observed feature (i.e. each dimension of the data) depends on a small subset of the latent factors. As examples, in ratings data each m… ▽ More

    Submitted 17 February, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

  25. arXiv:2109.11990  [pdf, other

    stat.ME cs.LG stat.ML

    Optimization-based Causal Estimation from Heterogenous Environments

    Authors: Mingzhang Yin, Yixin Wang, David M. Blei

    Abstract: This paper presents a new optimization approach to causal estimation. Given data that contains covariates and an outcome, which covariates are causes of the outcome, and what is the strength of the causality? In classical machine learning (ML), the goal of optimization is to maximize predictive accuracy. However, some covariates might exhibit a non-causal association with the outcome. Such spuriou… ▽ More

    Submitted 10 June, 2024; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: Journal of Machine Learning Research (JMLR). Code at https://github.com/mingzhang-yin/CoCo

  26. arXiv:2106.00075  [pdf, other

    stat.ML cs.LG stat.CO

    Variational Combinatorial Sequential Monte Carlo Methods for Bayesian Phylogenetic Inference

    Authors: Antonio Khalil Moretti, Liyi Zhang, Christian A. Naesseth, Hadiah Venner, David Blei, Itsik Pe'er

    Abstract: Bayesian phylogenetic inference is often conducted via local or sequential search over topologies and branch lengths using algorithms such as random-walk Markov chain Monte Carlo (MCMC) or Combinatorial Sequential Monte Carlo (CSMC). However, when MCMC is used for evolutionary parameter learning, convergence requires long runs with inefficient exploration of the state space. We introduce Variation… ▽ More

    Submitted 17 June, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: 15 pages, 9 figures

  27. arXiv:2103.00393  [pdf, other

    cs.LG stat.ML

    Hierarchical Inducing Point Gaussian Process for Inter-domain Observations

    Authors: Luhuan Wu, Andrew Miller, Lauren Anderson, Geoff Pleiss, David Blei, John Cunningham

    Abstract: We examine the general problem of inter-domain Gaussian Processes (GPs): problems where the GP realization and the noisy observations of that realization lie on different domains. When the map** between those domains is linear, such as integration or differentiation, inference is still closed form. However, many of the scaling and approximation techniques that our community has developed do not… ▽ More

    Submitted 24 June, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

  28. arXiv:2011.12379  [pdf, other

    cs.LG stat.ML

    Invariant Representation Learning for Treatment Effect Estimation

    Authors: Claudia Shi, Victor Veitch, David Blei

    Abstract: The defining challenge for causal inference from observational data is the presence of `confounders', covariates that affect both treatment assignment and the outcome. To address this challenge, practitioners collect and adjust for the covariates, ho** that they adequately correct for confounding. However, including every observed covariate in the adjustment runs the risk of including `bad contr… ▽ More

    Submitted 27 July, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

  29. arXiv:2005.04232  [pdf, other

    cs.CL cs.LG stat.ML

    Text-Based Ideal Points

    Authors: Keyon Vafa, Suresh Naidu, David M. Blei

    Abstract: Ideal point models analyze lawmakers' votes to quantify their political positions, or ideal points. But votes are not the only way to express a political position. Lawmakers also give speeches, release press statements, and post tweets. In this paper, we introduce the text-based ideal point model (TBIP), an unsupervised probabilistic topic model that analyzes texts to quantify the political positi… ▽ More

    Submitted 21 July, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: Appeared in Proceedings of the 2020 Conference of the Association for Computational Linguistics (ACL 2020)

  30. arXiv:2003.10374  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Markovian Score Climbing: Variational Inference with KL(p||q)

    Authors: Christian A. Naesseth, Fredrik Lindsten, David Blei

    Abstract: Modern variational inference (VI) uses stochastic gradients to avoid intractable expectations, enabling large-scale probabilistic inference in complex models. VI posits a family of approximating distributions q and then finds the member of that family that is closest to the exact posterior p. Traditionally, VI algorithms minimize the "exclusive Kullback-Leibler (KL)" KL(q || p), often for computat… ▽ More

    Submitted 22 February, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

  31. arXiv:2003.05554  [pdf, other

    stat.ML cs.LG

    Linear-time inference for Gaussian Processes on one dimension

    Authors: Jackson Loper, David Blei, John P. Cunningham, Liam Paninski

    Abstract: Gaussian Processes (GPs) provide powerful probabilistic frameworks for interpolation, forecasting, and smoothing, but have been hampered by computational scaling issues. Here we investigate data sampled on one dimension (e.g., a scalar or vector time series sampled at arbitrarily-spaced intervals), for which state-space models are popular due to their linearly-scaling computational costs. It has l… ▽ More

    Submitted 12 October, 2021; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Accepted to JMLR

    MSC Class: 60G15 (Primary) 68W10; 47B34 (Secondary)

    Journal ref: The Journal of Machine Learning Research, 2021

  32. arXiv:2003.04948  [pdf, ps, other

    stat.ML cs.LG

    Towards Clarifying the Theory of the Deconfounder

    Authors: Yixin Wang, David M. Blei

    Abstract: Wang and Blei (2019) studies multiple causal inference and proposes the deconfounder algorithm. The paper discusses theoretical requirements and presents empirical studies. Several refinements have been suggested around the theory of the deconfounder. Among these, Imai and Jiang clarified the assumption of "no unobserved single-cause confounders." Using their assumption, this paper clarifies the t… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  33. arXiv:1910.12991  [pdf, other

    stat.ML cs.LG

    Poisson-Randomized Gamma Dynamical Systems

    Authors: Aaron Schein, Scott W. Linderman, Mingyuan Zhou, David M. Blei, Hanna Wallach

    Abstract: This paper presents the Poisson-randomized gamma dynamical system (PRGDS), a model for sequentially observed count tensors that encodes a strong inductive bias toward sparsity and burstiness. The PRGDS is based on a new motif in Bayesian latent variable modeling, an alternating chain of discrete Poisson and continuous gamma latent states that is analytically convenient and computationally tractabl… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: To appear in the Proceedings of the 32nd Advances in Neural Information Processing Systems (NeurIPS 2019)

  34. arXiv:1910.07320  [pdf, ps, other

    stat.ML cs.LG

    The Blessings of Multiple Causes: A Reply to Ogburn et al. (2019)

    Authors: Yixin Wang, David M. Blei

    Abstract: Ogburn et al. (2019, arXiv:1910.05438) discuss "The Blessings of Multiple Causes" (Wang and Blei, 2018, arXiv:1805.06826). Many of their remarks are interesting. But they also claim that the paper has "foundational errors" and that its "premise is...incorrect." These claims are not substantiated. There are no foundational errors; the premise is correct.

    Submitted 20 December, 2019; v1 submitted 15 October, 2019; originally announced October 2019.

  35. arXiv:1910.04302  [pdf, other

    stat.ML cs.LG stat.ME

    Prescribed Generative Adversarial Networks

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei, Michalis K. Titsias

    Abstract: Generative adversarial networks (GANs) are a powerful approach to unsupervised learning. They have achieved state-of-the-art performance in the image domain. However, GANs are limited in two ways. They often learn distributions with low support---a phenomenon known as mode collapse---and they do not guarantee the existence of a probability density, which makes evaluating generalization using predi… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: Code for this paper can be found at https://github.com/adjidieng/PresGANs

  36. Population Predictive Checks

    Authors: Gemma E. Moran, David M. Blei, Rajesh Ranganath

    Abstract: Bayesian modeling helps applied researchers articulate assumptions about their data and develop models tailored for specific applications. Thanks to good methods for approximate posterior inference, researchers can now easily build, use, and revise complicated Bayesian models for large and rich data. These capabilities, however, bring into focus the problem of model criticism. Researchers need too… ▽ More

    Submitted 15 July, 2022; v1 submitted 2 August, 2019; originally announced August 2019.

  37. arXiv:1907.05545  [pdf, other

    cs.CL stat.ML

    The Dynamic Embedded Topic Model

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei

    Abstract: Topic modeling analyzes documents to learn meaningful patterns of words. For documents collected in sequence, dynamic topic models capture how these patterns vary over time. We develop the dynamic embedded topic model (D-ETM), a generative model of documents that combines dynamic latent Dirichlet allocation (D-LDA) and word embeddings. The D-ETM models each word with a categorical distribution par… ▽ More

    Submitted 10 October, 2019; v1 submitted 11 July, 2019; originally announced July 2019.

  38. arXiv:1907.04907  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Topic Modeling in Embedding Spaces

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei

    Abstract: Topic modeling analyzes documents to learn meaningful patterns of words. However, existing topic models fail to learn interpretable topics when working with large and heavy-tailed vocabularies. To this end, we develop the Embedded Topic Model (ETM), a generative model of documents that marries traditional topic models with word embeddings. In particular, it models each word with a categorical dist… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: Code can be found at https://github.com/adjidieng/ETM

  39. arXiv:1906.04072  [pdf, other

    stat.ML cs.LG stat.ME

    A Bayesian Model of Dose-Response for Cancer Drug Studies

    Authors: Wesley Tansey, Christopher Tosh, David M. Blei

    Abstract: Exploratory cancer drug studies test multiple tumor cell lines against multiple candidate drugs. The goal in each paired (cell line, drug) experiment is to map out the dose-response curve of the cell line as the dose level of the drug increases. We propose Bayesian Tensor Filtering (BTF), a hierarchical Bayesian model for dose-response modeling in multi-sample, multi-treatment cancer drug studies.… ▽ More

    Submitted 22 March, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Extended to handle covariates; additional benchmarks comparing to related work

  40. arXiv:1906.02635  [pdf, other

    cs.LG econ.EM stat.ML

    Counterfactual Inference for Consumer Choice Across Many Product Categories

    Authors: Rob Donnelly, Francisco R. Ruiz, David Blei, Susan Athey

    Abstract: This paper proposes a method for estimating consumer preferences among discrete choices, where the consumer chooses at most one product in a category, but selects from multiple categories in parallel. The consumer's utility is additive in the different categories. Her preferences about product attributes as well as her price sensitivity vary across products and are in general correlated across pro… ▽ More

    Submitted 6 August, 2023; v1 submitted 6 June, 2019; originally announced June 2019.

    Journal ref: Quantitative Marketing and Economics, volume 19, pages 369-407 (2021)

  41. arXiv:1906.02120  [pdf, other

    stat.ML cs.LG stat.ME

    Adapting Neural Networks for the Estimation of Treatment Effects

    Authors: Claudia Shi, David M. Blei, Victor Veitch

    Abstract: This paper addresses the use of neural networks for the estimation of treatment effects from observational data. Generally, estimation proceeds in two stages. First, we fit models for the expected outcome and the probability of treatment (propensity score) for each unit. Second, we plug these fitted models into a downstream estimator of the effect. Neural networks are a natural choice for the mode… ▽ More

    Submitted 17 October, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

  42. arXiv:1905.12793  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Multiple Causes: A Causal Graphical View

    Authors: Yixin Wang, David M. Blei

    Abstract: Unobserved confounding is a major hurdle for causal inference from observational data. Confounders---the variables that affect both the causes and the outcome---induce spurious non-causal correlations between the two. Wang & Blei (2018) lower this hurdle with "the blessings of multiple causes," where the correlation structure of multiple causes provides indirect evidence for unobserved confounding… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 23 pages

  43. arXiv:1905.12741  [pdf, other

    cs.LG cs.CL stat.ML

    Adapting Text Embeddings for Causal Inference

    Authors: Victor Veitch, Dhanya Sridhar, David M. Blei

    Abstract: Does adding a theorem to a paper affect its chance of acceptance? Does labeling a post with the author's gender affect the post popularity? This paper develops a method to estimate such causal effects from observational text data, adjusting for confounding features of the text such as the subject or writing quality. We assume that the text suffices for causal adjustment but that, in practice, it i… ▽ More

    Submitted 25 July, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

  44. arXiv:1905.10870  [pdf, other

    stat.ML cs.LG

    Equal Opportunity and Affirmative Action via Counterfactual Predictions

    Authors: Yixin Wang, Dhanya Sridhar, David M. Blei

    Abstract: Machine learning (ML) can automate decision-making by learning to predict decisions from historical data. However, these predictors may inherit discriminatory policies from past decisions and reproduce unfair decisions. In this paper, we propose two algorithms that adjust fitted ML predictors to make them fair. We focus on two legal notions of fairness: (a) providing equal opportunity (EO) to indi… ▽ More

    Submitted 29 May, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: 18 pages

  45. arXiv:1905.10859  [pdf, other

    stat.ML cs.LG math.ST

    Variational Bayes under Model Misspecification

    Authors: Yixin Wang, David M. Blei

    Abstract: Variational Bayes (VB) is a scalable alternative to Markov chain Monte Carlo (MCMC) for Bayesian posterior inference. Though popular, VB comes with few theoretical guarantees, most of which focus on well-specified models. However, models are rarely well-specified in practice. In this work, we study VB under model misspecification. We prove the VB posterior is asymptotically normal and centers at t… ▽ More

    Submitted 11 August, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  46. arXiv:1904.02098  [pdf, other

    stat.ML cs.LG

    The Medical Deconfounder: Assessing Treatment Effects with Electronic Health Records

    Authors: Linying Zhang, Yixin Wang, Anna Ostropolets, Jami J. Mulgrave, David M. Blei, George Hripcsak

    Abstract: The treatment effects of medications play a key role in guiding medical prescriptions. They are usually assessed with randomized controlled trials (RCTs), which are expensive. Recently, large-scale electronic health records (EHRs) have become available, opening up new opportunities for more cost-effective assessments. However, assessing a treatment effect from EHRs is challenging: it is biased by… ▽ More

    Submitted 17 August, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

  47. arXiv:1902.04114  [pdf, other

    stat.ML cs.LG

    Using Embeddings to Correct for Unobserved Confounding in Networks

    Authors: Victor Veitch, Yixin Wang, David M. Blei

    Abstract: We consider causal inference in the presence of unobserved confounding. We study the case where a proxy is available for the unobserved confounding in the form of a network connecting the units. For example, the link structure of a social network carries information about its members. We show how to effectively use the proxy to do causal inference. The main idea is to reduce the causal estimation… ▽ More

    Submitted 31 May, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: An earlier version also addressed the use of text embeddings. That material has been expanded and moved to arxiv:1905.12741, "Using Text Embeddings for Causal Inference"

  48. arXiv:1812.05691  [pdf, other

    stat.AP

    Dose-response modeling in high-throughput cancer drug screenings: An end-to-end approach

    Authors: Wesley Tansey, Kathy Li, Haoran Zhang, Scott W. Linderman, Raul Rabadan, David M. Blei, Chris H. Wiggins

    Abstract: Personalized cancer treatments based on the molecular profile of a patient's tumor are an emerging and exciting class of treatments in oncology. As genomic tumor profiling is becoming more common, targeted treatments to specific molecular alterations are gaining traction. To discover new potential therapeutics that may apply to broad classes of tumors matching some molecular pattern, experimentali… ▽ More

    Submitted 22 May, 2020; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: Added biomarker discovery testing section, among other revisions

  49. arXiv:1812.00209  [pdf, other

    stat.ML cs.LG q-bio.QM

    A Probabilistic Model of Cardiac Physiology and Electrocardiograms

    Authors: Andrew C. Miller, Ziad Obermeyer, David M. Blei, John P. Cunningham, Sendhil Mullainathan

    Abstract: An electrocardiogram (EKG) is a common, non-invasive test that measures the electrical activity of a patient's heart. EKGs contain useful diagnostic information about patient health that may be absent from other electronic health record (EHR) data. As multi-dimensional waveforms, they could be modeled using generic machine learning tools, such as a linear factor model or a variational autoencoder.… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

    Report number: ML4H/2018/97

  50. arXiv:1811.00645  [pdf, other

    stat.ME

    The Holdout Randomization Test for Feature Selection in Black Box Models

    Authors: Wesley Tansey, Victor Veitch, Haoran Zhang, Raul Rabadan, David M. Blei

    Abstract: We propose the holdout randomization test (HRT), an approach to feature selection using black box predictive models. The HRT is a specialized version of the conditional randomization test (CRT; Candes et al., 2018) that uses data splitting for feasible computation. The HRT works with any predictive model and produces a valid $p$-value for each feature. To make the HRT more practical, we propose a… ▽ More

    Submitted 22 March, 2021; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: New algorithms and simulations; accepted for publication at JCGS