Skip to main content

Showing 1–49 of 49 results for author: Ranganath, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.01210  [pdf, other

    stat.ME cs.LG stat.ML

    When accurate prediction models yield harmful self-fulfilling prophecies

    Authors: Wouter A. C. van Amsterdam, Nan van Geloven, Jesse H. Krijthe, Rajesh Ranganath, Giovanni Ciná

    Abstract: Objective: Prediction models are popular in medical research and practice. By predicting an outcome of interest for specific patients, these models may help inform difficult treatment decisions, and are often hailed as the poster children for personalized, data-driven healthcare. Many prediction models are deployed for decision support based on their prediction accuracy in validation studies. We i… ▽ More

    Submitted 8 February, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  2. arXiv:2310.03725  [pdf, other

    cs.LG stat.ML

    Stochastic interpolants with data-dependent couplings

    Authors: Michael S. Albergo, Mark Goldstein, Nicholas M. Boffi, Rajesh Ranganath, Eric Vanden-Eijnden

    Abstract: Generative models inspired by dynamical transport of measure -- such as flows and diffusions -- construct a continuous-time map between two probability densities. Conventionally, one of these is the target density, only accessible through samples, while the other is taken as a simple base density that is data-agnostic. In this work, using the framework of stochastic interpolants, we formalize how… ▽ More

    Submitted 15 December, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

  3. arXiv:2308.12553  [pdf, other

    cs.LG stat.ML

    Don't blame Dataset Shift! Shortcut Learning due to Gradients and Cross Entropy

    Authors: Aahlad Puli, Lily Zhang, Yoav Wald, Rajesh Ranganath

    Abstract: Common explanations for shortcut learning assume that the shortcut improves prediction under the training distribution but not in the test distribution. Thus, models trained via the typical gradient-based optimization of cross-entropy, which we call default-ERM, utilize the shortcut. However, even when the stable feature determines the label in the training distribution and the shortcut does not p… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  4. arXiv:2306.01196  [pdf, other

    cs.LG cs.AI stat.ML

    An Effective Meaningful Way to Evaluate Survival Models

    Authors: Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner

    Abstract: One straightforward metric to evaluate a survival prediction model is based on the Mean Absolute Error (MAE) -- the average of the absolute difference between the time predicted by the model and the true event time, over all subjects. Unfortunately, this is challenging because, in practice, the test set includes (right) censored individuals, meaning we do not know when a censored individual actual… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023

  5. arXiv:2302.07261  [pdf, other

    cs.LG stat.ML

    Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions

    Authors: Raghav Singhal, Mark Goldstein, Rajesh Ranganath

    Abstract: Diffusion-based generative models (DBGMs) perturb data to a target noise distribution and reverse this process to generate samples. The choice of noising process, or inference diffusion process, affects both likelihoods and sample quality. For example, extending the inference process with auxiliary variables leads to improved sample quality. While there are many such multivariate diffusions to exp… ▽ More

    Submitted 3 March, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  6. arXiv:2209.07397  [pdf, other

    cs.LG cs.CY stat.ML

    From algorithms to action: improving patient care requires causality

    Authors: Wouter A. C. van Amsterdam, Pim A. de Jong, Joost J. C. Verhoeff, Tim Leiner, Rajesh Ranganath

    Abstract: In cancer research there is much interest in building and validating outcome predicting outcomes to support treatment decisions. However, because most outcome prediction models are developed and validated without regard to the causal aspects of treatment decision making, many published outcome prediction models may cause harm when used for decision making, despite being found accurate in validatio… ▽ More

    Submitted 1 April, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

    Journal ref: BMC Medical Informatics and Decision Making, 24(1), 2024

  7. arXiv:2208.10759  [pdf, other

    cs.LG stat.ML

    Survival Mixture Density Networks

    Authors: Xintian Han, Mark Goldstein, Rajesh Ranganath

    Abstract: Survival analysis, the art of time-to-event modeling, plays an important role in clinical treatment decisions. Recently, continuous time models built from neural ODEs have been proposed for survival analysis. However, the training of neural ODEs is slow due to the high computational complexity of neural ODE solvers. Here, we propose an efficient alternative for flexible continuous time models, cal… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Machine Learning for Healthcare 2022

  8. arXiv:2208.08579  [pdf, other

    stat.ME cs.LG stat.ML

    DIET: Conditional independence testing with marginal dependence measures of residual information

    Authors: Mukund Sudarshan, Aahlad Manas Puli, Wesley Tansey, Rajesh Ranganath

    Abstract: Conditional randomization tests (CRTs) assess whether a variable $x$ is predictive of another variable $y$, having observed covariates $z$. CRTs require fitting a large number of predictive models, which is often computationally intractable. Existing solutions to reduce the cost of CRTs typically split the dataset into a train and test portion, or rely on heuristics for interactions, both of which… ▽ More

    Submitted 11 April, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

  9. Conditional average treatment effect estimation with marginally constrained models

    Authors: Wouter A. C. van Amsterdam, Rajesh Ranganath

    Abstract: Treatment effect estimates are often available from randomized controlled trials as a single average treatment effect for a certain patient population. Estimates of the conditional average treatment effect (CATE) are more useful for individualized treatment decision making, but randomized trials are often too small to estimate the CATE. Examples in medical literature make use of the relative treat… ▽ More

    Submitted 23 July, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: accepted for publication in Journal of Causal Inference, 2023

  10. arXiv:2112.00950  [pdf, other

    cs.LG stat.ML

    Quantile Filtered Imitation Learning

    Authors: David Brandfonbrener, William F. Whitney, Rajesh Ranganath, Joan Bruna

    Abstract: We introduce quantile filtered imitation learning (QFIL), a novel policy improvement operator designed for offline reinforcement learning. QFIL performs policy improvement by running imitation learning on a filtered version of the offline dataset. The filtering process removes $ s,a $ pairs whose estimated Q values fall below a given quantile of the pushforward distribution over values induced by… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2021

  11. arXiv:2112.00881  [pdf, other

    cs.LG stat.ML

    Learning Invariant Representations with Missing Data

    Authors: Mark Goldstein, Jörn-Henrik Jacobsen, Olina Chau, Adriel Saporta, Aahlad Puli, Rajesh Ranganath, Andrew C. Miller

    Abstract: Spurious correlations allow flexible models to predict well during training but poorly on related test distributions. Recent work has shown that models that satisfy particular independencies involving correlation-inducing \textit{nuisance} variables have guarantees on their test performance. Enforcing such independencies requires nuisances to be observed during training. However, nuisances, such a… ▽ More

    Submitted 8 June, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: CLeaR (Causal Learning and Reasoning) 2022

  12. arXiv:2111.08175  [pdf, other

    cs.LG stat.ML

    Inverse-Weighted Survival Games

    Authors: Xintian Han, Mark Goldstein, Aahlad Puli, Thomas Wies, Adler J Perotte, Rajesh Ranganath

    Abstract: Deep models trained through maximum likelihood have achieved state-of-the-art results for survival analysis. Despite this training scheme, practitioners evaluate models under other criteria, such as binary classification losses at a chosen set of time horizons, e.g. Brier score (BS) and Bernoulli log likelihood (BLL). Models trained with maximum likelihood may have poor BS or BLL since maximum lik… ▽ More

    Submitted 31 January, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: Neurips 2021

  13. arXiv:2107.07436  [pdf, other

    stat.ML cs.CV cs.LG

    FastSHAP: Real-Time Shapley Value Estimation

    Authors: Neil Jethani, Mukund Sudarshan, Ian Covert, Su-In Lee, Rajesh Ranganath

    Abstract: Shapley values are widely used to explain black-box models, but they are costly to calculate because they require many model evaluations. We introduce FastSHAP, a method for estimating Shapley values in a single forward pass using a learned explainer model. FastSHAP amortizes the cost of explaining many inputs via a learning approach inspired by the Shapley value's weighted least squares character… ▽ More

    Submitted 22 March, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: ICLR 2022 Camera Ready, 20 pages, 10 figures, 3 tables

  14. arXiv:2107.00520  [pdf, other

    cs.LG stat.ML

    Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious Correlations

    Authors: Aahlad Puli, Lily H. Zhang, Eric K. Oermann, Rajesh Ranganath

    Abstract: In many prediction problems, spurious correlations are induced by a changing relationship between the label and a nuisance variable that is also correlated with the covariates. For example, in classifying animals in natural images, the background, which is a nuisance, can predict the type of animal. This nuisance-label relationship does not always hold, and the performance of a model trained under… ▽ More

    Submitted 12 February, 2023; v1 submitted 29 June, 2021; originally announced July 2021.

  15. arXiv:2106.08909  [pdf, other

    cs.LG stat.ML

    Offline RL Without Off-Policy Evaluation

    Authors: David Brandfonbrener, William F. Whitney, Rajesh Ranganath, Joan Bruna

    Abstract: Most prior approaches to offline reinforcement learning (RL) have taken an iterative actor-critic approach involving off-policy evaluation. In this paper we show that simply doing one step of constrained/regularized policy improvement using an on-policy Q estimate of the behavior policy performs surprisingly well. This one-step algorithm beats the previously reported results of iterative algorithm… ▽ More

    Submitted 3 December, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Thirty-fifth Conference on Neural Information Processing Systems, 2021

  16. arXiv:2103.01890  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Have We Learned to Explain?: How Interpretability Methods Can Learn to Encode Predictions in their Interpretations

    Authors: Neil Jethani, Mukund Sudarshan, Yindalon Aphinyanaphongs, Rajesh Ranganath

    Abstract: While the need for interpretable machine learning has been established, many common approaches are slow, lack fidelity, or hard to evaluate. Amortized explanation methods reduce the cost of providing interpretations by learning a global selector model that returns feature importances for a single instance of data. The selector model is trained to optimize the fidelity of the interpretations, as ev… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: 15 pages, 3 figures, Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021

    Journal ref: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021

  17. arXiv:2102.08533  [pdf, other

    stat.ME cs.LG stat.ML

    Causal Estimation with Functional Confounders

    Authors: Aahlad Puli, Adler J. Perotte, Rajesh Ranganath

    Abstract: Causal inference relies on two fundamental assumptions: ignorability and positivity. We study causal inference when the true confounder value can be expressed as a function of the observed data; we call this setting estimation with functional confounders (EFC). In this setting, ignorability is satisfied, however positivity is violated, and causal inference is impossible in general. We consider two… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: 17 pages, 7 figures, 2 tables

  18. arXiv:2101.05346  [pdf, other

    cs.LG stat.ML

    X-CAL: Explicit Calibration for Survival Analysis

    Authors: Mark Goldstein, Xintian Han, Aahlad Puli, Adler J. Perotte, Rajesh Ranganath

    Abstract: Survival analysis models the distribution of time until an event of interest, such as discharge from the hospital or admission to the ICU. When a model's predicted number of events within any time interval is similar to the observed number, it is called well-calibrated. A survival model's calibration can be measured using, for instance, distributional calibration (D-CALIBRATION) [Haider et al., 20… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  19. arXiv:2009.11087  [pdf, other

    stat.ML cs.CY cs.LG

    Probabilistic Machine Learning for Healthcare

    Authors: Irene Y. Chen, Shalmali Joshi, Marzyeh Ghassemi, Rajesh Ranganath

    Abstract: Machine learning can be used to make sense of healthcare data. Probabilistic machine learning models help provide a complete picture of observed data in healthcare. In this review, we examine how probabilistic machine learning can advance healthcare. We consider challenges in the predictive model building pipeline where probabilistic models can be beneficial including calibration and missing data.… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

    Comments: Annual Reviews of Biomedical Data Science 2021

  20. arXiv:2007.15835  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Direct Likelihood Knockoffs

    Authors: Mukund Sudarshan, Wesley Tansey, Rajesh Ranganath

    Abstract: Predictive modeling often uses black box machine learning methods, such as deep neural networks, to achieve state-of-the-art performance. In scientific domains, the scientist often wishes to discover which features are actually important for making the predictions. These discoveries may lead to costly follow-up experiments and as such it is important that the error rate on discoveries is not too h… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

  21. arXiv:2006.15368  [pdf, other

    cs.LG stat.ML

    Offline Contextual Bandits with Overparameterized Models

    Authors: David Brandfonbrener, William F. Whitney, Rajesh Ranganath, Joan Bruna

    Abstract: Recent results in supervised learning suggest that while overparameterized models have the capacity to overfit, they in fact generalize quite well. We ask whether the same phenomenon occurs for offline contextual bandits. Our results are mixed. Value-based algorithms benefit from the same generalization behavior as overparameterized supervised learning, but policy-based algorithms do not. We show… ▽ More

    Submitted 16 June, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  22. arXiv:2001.03115  [pdf, other

    cs.LG stat.ML

    The Counterfactual $χ$-GAN

    Authors: Amelia J. Averitt, Natnicha Vanitchanant, Rajesh Ranganath, Adler J. Perotte

    Abstract: Causal inference often relies on the counterfactual framework, which requires that treatment assignment is independent of the outcome, known as strong ignorability. Approaches to enforcing strong ignorability in causal analyses of observational data include weighting and matching methods. Effect estimates, such as the average treatment effect (ATE), are then estimated as expectations under the rew… ▽ More

    Submitted 3 December, 2020; v1 submitted 9 January, 2020; originally announced January 2020.

    Comments: 9 pages; 3 figures; See peer-reviewed work at Journal of Biomedical Informatics

    Journal ref: JBI. 2020. PMID: 32771540

  23. arXiv:1910.14265  [pdf, other

    cs.LG stat.ML

    Energy-Inspired Models: Learning with Sampler-Induced Distributions

    Authors: Dieterich Lawson, George Tucker, Bo Dai, Rajesh Ranganath

    Abstract: Energy-based models (EBMs) are powerful probabilistic models, but suffer from intractable sampling and density evaluation due to the partition function. As a result, inference in EBMs relies on approximate sampling algorithms, leading to a mismatch between the model and inference. Motivated by this, we consider the sampler-induced distribution as the model of interest and maximize the likelihood o… ▽ More

    Submitted 9 January, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: Presented at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  24. Population Predictive Checks

    Authors: Gemma E. Moran, David M. Blei, Rajesh Ranganath

    Abstract: Bayesian modeling helps applied researchers articulate assumptions about their data and develop models tailored for specific applications. Thanks to good methods for approximate posterior inference, researchers can now easily build, use, and revise complicated Bayesian models for large and rich data. These capabilities, however, bring into focus the problem of model criticism. Researchers need too… ▽ More

    Submitted 15 July, 2022; v1 submitted 2 August, 2019; originally announced August 2019.

  25. arXiv:1907.03451  [pdf, other

    cs.LG stat.ML

    General Control Functions for Causal Effect Estimation from Instrumental Variables

    Authors: Aahlad Manas Puli, Rajesh Ranganath

    Abstract: Causal effect estimation relies on separating the variation in the outcome into parts due to the treatment and due to the confounders. To achieve this separation, practitioners often use external sources of randomness that only influence the treatment called instrumental variables (IVs). We study variables constructed from treatment and IV that help estimate effects, called control functions. We c… ▽ More

    Submitted 2 February, 2021; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: 24 pages, 6 figures

  26. arXiv:1907.01463  [pdf, other

    cs.LG cs.CY stat.ML

    Reproducibility in Machine Learning for Health

    Authors: Matthew B. A. McDermott, Shirly Wang, Nikki Marinsek, Rajesh Ranganath, Marzyeh Ghassemi, Luca Foschini

    Abstract: Machine learning algorithms designed to characterize, monitor, and intervene on human health (ML4H) are expected to perform safely and reliably when operating at scale, potentially outside strict human supervision. This requirement warrants a stricter attention to issues of reproducibility than other fields of machine learning. In this work, we conduct a systematic evaluation of over 100 recentl… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: Presented at the ICLR 2019 Reproducibility in Machine Learning Workshop

  27. arXiv:1905.05163  [pdf, other

    eess.SP cs.CR cs.LG stat.ML

    Adversarial Examples for Electrocardiograms

    Authors: Xintian Han, Yuxuan Hu, Luca Foschini, Larry Chinitz, Lior Jankelson, Rajesh Ranganath

    Abstract: In recent years, the electrocardiogram (ECG) has seen a large diffusion in both medical and commercial applications, fueled by the rise of single-lead versions. Single-lead ECG can be embedded in medical devices and wearable products such as the injectable Medtronic Linq monitor, the iRhythm Ziopatch wearable monitor, and the Apple Watch Series 4. Recently, deep neural networks have been used to a… ▽ More

    Submitted 4 June, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

  28. arXiv:1904.04478  [pdf, other

    stat.ML cs.LG

    Kernelized Complete Conditional Stein Discrepancy

    Authors: Raghav Singhal, Xintian Han, Saad Lahlou, Rajesh Ranganath

    Abstract: Much of machine learning relies on comparing distributions with discrepancy measures. Stein's method creates discrepancy measures between two distributions that require only the unnormalized density of one and samples from the other. Stein discrepancies can be combined with kernels to define kernelized Stein discrepancies (KSDs). While kernels make Stein discrepancies tractable, they pose several… ▽ More

    Submitted 17 July, 2020; v1 submitted 9 April, 2019; originally announced April 2019.

  29. arXiv:1903.03448  [pdf, other

    stat.ML cs.LG

    Support and Invertibility in Domain-Invariant Representations

    Authors: Fredrik D. Johansson, David Sontag, Rajesh Ranganath

    Abstract: Learning domain-invariant representations has become a popular approach to unsupervised domain adaptation and is often justified by invoking a particular suite of theoretical results. We argue that there are two significant flaws in such arguments. First, the results in question hold only for a fixed representation and do not account for information lost in non-invertible transformations. Second,… ▽ More

    Submitted 3 July, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

  30. arXiv:1903.02984  [pdf, other

    cs.LG stat.ML

    The Variational Predictive Natural Gradient

    Authors: Da Tang, Rajesh Ranganath

    Abstract: Variational inference transforms posterior inference into parametric optimization thereby enabling the use of latent variable models where otherwise impractical. However, variational inference can be finicky when different variational parameters control variables that are strongly correlated under the model. Traditional natural gradients based on the variational approximation fail to correct for c… ▽ More

    Submitted 29 November, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: International Conference on Machine Learning (ICML), 2019

  31. arXiv:1806.00388  [pdf

    cs.LG cs.CY stat.ML

    A Review of Challenges and Opportunities in Machine Learning for Health

    Authors: Marzyeh Ghassemi, Tristan Naumann, Peter Schulam, Andrew L. Beam, Irene Y. Chen, Rajesh Ranganath

    Abstract: Modern electronic health records (EHRs) provide data to answer clinically meaningful questions. The growing data in EHRs makes healthcare ripe for the use of machine learning. However, learning in a clinical setting presents unique challenges that complicate the use of common machine learning methodologies. For example, diseases in EHRs are poorly labeled, conditions can encompass multiple underly… ▽ More

    Submitted 5 December, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Updated version

  32. arXiv:1805.08273  [pdf, other

    stat.ML cs.LG

    Multiple Causal Inference with Latent Confounding

    Authors: Rajesh Ranganath, Adler Perotte

    Abstract: Causal inference from observational data requires assumptions. These assumptions range from measuring confounders to identifying instruments. Traditionally, causal inference assumptions have focused on estimation of effects for a single treatment. In this work, we construct techniques for estimation with multiple treatments in the presence of unobserved confounding. We develop two assumptions base… ▽ More

    Submitted 1 March, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

  33. arXiv:1805.01500  [pdf, other

    stat.ML cs.LG stat.ME

    Noisin: Unbiased Regularization for Recurrent Neural Networks

    Authors: Adji B. Dieng, Rajesh Ranganath, Jaan Altosaar, David M. Blei

    Abstract: Recurrent neural networks (RNNs) are powerful models of sequential data. They have been successfully used in domains such as text and speech. However, RNNs are susceptible to overfitting; regularization is important. In this paper we develop Noisin, a new method for regularizing RNNs. Noisin injects random noise into the hidden states of the RNN and then maximizes the corresponding marginal likeli… ▽ More

    Submitted 12 July, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: In Proceedings of the International Conference on Machine Learning, 2018

  34. arXiv:1705.11140  [pdf, other

    stat.ML stat.CO stat.ME

    Variational Sequential Monte Carlo

    Authors: Christian A. Naesseth, Scott W. Linderman, Rajesh Ranganath, David M. Blei

    Abstract: Many recent advances in large scale probabilistic inference rely on variational methods. The success of variational approaches depends on (i) formulating a flexible parametric family of distributions, and (ii) optimizing the parameters to find the member of this family that most closely approximates the exact posterior. In this paper we present a new approximating family of distributions, the vari… ▽ More

    Submitted 21 February, 2018; v1 submitted 31 May, 2017; originally announced May 2017.

  35. arXiv:1705.08931  [pdf, other

    stat.ML cs.LG stat.CO

    Proximity Variational Inference

    Authors: Jaan Altosaar, Rajesh Ranganath, David M. Blei

    Abstract: Variational inference is a powerful approach for approximate posterior inference. However, it is sensitive to initialization and can be subject to poor local optima. In this paper, we develop proximity variational inference (PVI). PVI is a new method for optimizing the variational objective that constrains subsequent iterates of the variational parameters to robustify the optimization path. Conseq… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

    MSC Class: 68T10 ACM Class: G.3; I.5.0; I.5.1

  36. arXiv:1702.08896  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Hierarchical Implicit Models and Likelihood-Free Variational Inference

    Authors: Dustin Tran, Rajesh Ranganath, David M. Blei

    Abstract: Implicit probabilistic models are a flexible class of models defined by a simulation process for data. They form the basis for theories which encompass our understanding of the physical world. Despite this fundamental nature, the use of implicit models remains limited due to challenges in specifying complex latent structure in them, and in performing inferences in such models with large data sets.… ▽ More

    Submitted 4 November, 2017; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: Appears in Neural Information Processing Systems, 2017

  37. arXiv:1611.00328  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Variational Inference via $χ$-Upper Bound Minimization

    Authors: Adji B. Dieng, Dustin Tran, Rajesh Ranganath, John Paisley, David M. Blei

    Abstract: Variational inference (VI) is widely used as an efficient alternative to Markov chain Monte Carlo. It posits a family of approximating distributions $q$ and finds the closest member to the exact posterior $p$. Closeness is usually measured via a divergence $D(q || p)$ from $q$ to $p$. While successful, this approach also has problems. Notably, it typically leads to underestimation of the posterior… ▽ More

    Submitted 12 November, 2017; v1 submitted 1 November, 2016; originally announced November 2016.

    Comments: Neural Information Processing Systems, 2017

  38. arXiv:1610.09033  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Operator Variational Inference

    Authors: Rajesh Ranganath, Jaan Altosaar, Dustin Tran, David M. Blei

    Abstract: Variational inference is an umbrella term for algorithms which cast Bayesian inference as optimization. Classically, variational inference uses the Kullback-Leibler divergence to define the optimization. Though this divergence has been widely used, the resultant posterior approximation can suffer from undesirable statistical properties. To address this, we reexamine variational inference from its… ▽ More

    Submitted 14 March, 2018; v1 submitted 27 October, 2016; originally announced October 2016.

    Comments: Appears in Neural Information Processing Systems, 2016

  39. arXiv:1608.02158  [pdf, other

    stat.ML cs.AI stat.ME

    Deep Survival Analysis

    Authors: Rajesh Ranganath, Adler Perotte, Noémie Elhadad, David Blei

    Abstract: The electronic health record (EHR) provides an unprecedented opportunity to build actionable tools to support physicians at the point of care. In this paper, we investigate survival analysis in the context of EHR data. We introduce deep survival analysis, a hierarchical generative approach to survival analysis. It departs from previous approaches in two primary ways: (1) all observations, includin… ▽ More

    Submitted 18 September, 2016; v1 submitted 6 August, 2016; originally announced August 2016.

    Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA

  40. arXiv:1603.00788  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Automatic Differentiation Variational Inference

    Authors: Alp Kucukelbir, Dustin Tran, Rajesh Ranganath, Andrew Gelman, David M. Blei

    Abstract: Probabilistic modeling is iterative. A scientist posits a simple model, fits it to her data, refines it according to her analysis, and repeats. However, fitting complex models to large data is a bottleneck in this process. Deriving algorithms for new models can be both mathematically and computationally challenging, which makes it difficult to efficiently cycle through the steps. To this end, we d… ▽ More

    Submitted 2 March, 2016; originally announced March 2016.

  41. arXiv:1511.06499  [pdf, other

    stat.ML cs.LG cs.NE stat.CO

    The Variational Gaussian Process

    Authors: Dustin Tran, Rajesh Ranganath, David M. Blei

    Abstract: Variational inference is a powerful tool for approximate inference, and it has been recently applied for representation learning with deep generative models. We develop the variational Gaussian process (VGP), a Bayesian nonparametric variational family, which adapts its shape to match complex posterior distributions. The VGP generates approximate posterior samples by generating latent inputs and w… ▽ More

    Submitted 17 April, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: Appears in International Conference on Learning Representations, 2016

  42. arXiv:1511.02386  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Hierarchical Variational Models

    Authors: Rajesh Ranganath, Dustin Tran, David M. Blei

    Abstract: Black box variational inference allows researchers to easily prototype and evaluate an array of models. Recent advances allow such algorithms to scale to high dimensions. However, a central question remains: How to specify an expressive variational distribution that maintains efficient computation? To address this, we develop hierarchical variational models (HVMs). HVMs augment a variational appro… ▽ More

    Submitted 30 May, 2016; v1 submitted 7 November, 2015; originally announced November 2015.

    Comments: Appears in International Conference on Machine Learning, 2016

  43. arXiv:1509.04640  [pdf, other

    cs.LG cs.IR stat.ML

    Dynamic Poisson Factorization

    Authors: Laurent Charlin, Rajesh Ranganath, James McInerney, David M. Blei

    Abstract: Models for recommender systems use latent factors to explain the preferences and behaviors of users with respect to a set of items (e.g., movies, books, academic papers). Typically, the latent factors are assumed to be static and, given these factors, the observed preferences and behaviors of users are assumed to be generated without order. These assumptions limit the explorative and predictive ca… ▽ More

    Submitted 15 September, 2015; originally announced September 2015.

    Comments: RecSys 2015

  44. arXiv:1507.05253  [pdf, other

    stat.ML

    The Population Posterior and Bayesian Inference on Streams

    Authors: James McInerney, Rajesh Ranganath, David M. Blei

    Abstract: Many modern data analysis problems involve inferences from streaming data. However, streaming data is not easily amenable to the standard probabilistic modeling approaches, which assume that we condition on finite data. We develop population variational Bayes, a new approach for using Bayesian modeling to analyze streams of data. It approximates a new type of distribution, the population posterior… ▽ More

    Submitted 21 July, 2015; v1 submitted 19 July, 2015; originally announced July 2015.

  45. arXiv:1507.00720  [pdf, other

    stat.ML stat.ME

    Correlated Random Measures

    Authors: Rajesh Ranganath, David Blei

    Abstract: We develop correlated random measures, random measures where the atom weights can exhibit a flexible pattern of dependence, and use them to develop powerful hierarchical Bayesian nonparametric models. Hierarchical Bayesian nonparametric models are usually built from completely random measures, a Poisson-process based construction in which the atom weights are independent. Completely random measure… ▽ More

    Submitted 9 November, 2016; v1 submitted 2 July, 2015; originally announced July 2015.

  46. arXiv:1506.03431  [pdf, other

    stat.ML

    Automatic Variational Inference in Stan

    Authors: Alp Kucukelbir, Rajesh Ranganath, Andrew Gelman, David M. Blei

    Abstract: Variational inference is a scalable technique for approximate Bayesian inference. Deriving variational inference algorithms requires tedious model-specific calculations; this makes it difficult to automate. We propose an automatic variational inference algorithm, automatic differentiation variational inference (ADVI). The user only provides a Bayesian model and a dataset; nothing else. We make no… ▽ More

    Submitted 12 June, 2015; v1 submitted 10 June, 2015; originally announced June 2015.

  47. arXiv:1411.2581  [pdf, other

    stat.ML cs.LG

    Deep Exponential Families

    Authors: Rajesh Ranganath, Linpeng Tang, Laurent Charlin, David M. Blei

    Abstract: We describe \textit{deep exponential families} (DEFs), a class of latent variable models that are inspired by the hidden structures used in deep neural networks. DEFs capture a hierarchy of dependencies between latent variables, and are easily generalized to many settings through exponential families. We perform inference using recent "black box" variational inference techniques. We then evaluate… ▽ More

    Submitted 10 November, 2014; originally announced November 2014.

  48. arXiv:1411.1810  [pdf, other

    stat.ML cs.LG

    Variational Tempering

    Authors: Stephan Mandt, James McInerney, Farhan Abrol, Rajesh Ranganath, David Blei

    Abstract: Variational inference (VI) combined with data subsampling enables approximate posterior inference over large data sets, but suffers from poor local optima. We first formulate a deterministic annealing approach for the generic class of conditionally conjugate exponential family models. This approach uses a decreasing temperature parameter which deterministically deforms the objective during the cou… ▽ More

    Submitted 28 May, 2016; v1 submitted 6 November, 2014; originally announced November 2014.

    Comments: published version, 8 pages, 4 figures

    Journal ref: Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS 2016), pages 704-712

  49. arXiv:1401.0118  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Black Box Variational Inference

    Authors: Rajesh Ranganath, Sean Gerrish, David M. Blei

    Abstract: Variational inference has become a widely used method to approximate posteriors in complex latent variables models. However, deriving a variational inference algorithm generally requires significant model-specific analysis, and these efforts can hinder and deter us from quickly develo** and exploring a variety of models for a problem at hand. In this paper, we present a "black box" variational i… ▽ More

    Submitted 31 December, 2013; originally announced January 2014.