Search | arXiv e-print repository

Active Learning for Fair and Stable Online Allocations

Authors: Riddhiman Bhattacharya, Thanh Nguyen, Will Wei Sun, Mohit Tawarmalani

Abstract: We explore an active learning approach for dynamic fair resource allocation problems. Unlike previous work that assumes full feedback from all agents on their allocations, we consider feedback from a select subset of agents at each epoch of the online resource allocation process. Despite this restriction, our proposed algorithms provide regret bounds that are sub-linear in number of time-periods f… ▽ More We explore an active learning approach for dynamic fair resource allocation problems. Unlike previous work that assumes full feedback from all agents on their allocations, we consider feedback from a select subset of agents at each epoch of the online resource allocation process. Despite this restriction, our proposed algorithms provide regret bounds that are sub-linear in number of time-periods for various measures that include fairness metrics commonly used in resource allocation problems and stability considerations in matching mechanisms. The key insight of our algorithms lies in adaptively identifying the most informative feedback using dueling upper and lower confidence bounds. With this strategy, we show that efficient decision-making does not require extensive feedback and produces efficient outcomes for a variety of problem classes. △ Less

Submitted 20 June, 2024; originally announced June 2024.

arXiv:2403.02996 [pdf, ps, other]

A Convex Optimization Framework for Computing Robustness Margins of Kalman Filters

Authors: Himanshu Prabhat, Raktim Bhattacharya

Abstract: This paper proposes a novel convex optimization framework for designing robust Kalman filters that guarantee a user-specified steady-state error while maximizing process and sensor noise. The proposed framework simultaneously determines the Kalman gain and the robustness margin in terms of the process and sensor noise. This is the first paper to present such a joint formulation for Kalman filterin… ▽ More This paper proposes a novel convex optimization framework for designing robust Kalman filters that guarantee a user-specified steady-state error while maximizing process and sensor noise. The proposed framework simultaneously determines the Kalman gain and the robustness margin in terms of the process and sensor noise. This is the first paper to present such a joint formulation for Kalman filtering. The proposed methodology is validated through two distinct examples: the Clohessy-Wiltshire-Hill equations for a chaser spacecraft in an elliptical orbit and the longitudinal motion model of an F-16 aircraft. △ Less

Submitted 5 March, 2024; originally announced March 2024.

arXiv:2402.17699 [pdf, other]

Gradient-based Discrete Sampling with Automatic Cyclical Scheduling

Authors: Patrick Pynadath, Riddhiman Bhattacharya, Arun Hariharan, Ruqi Zhang

Abstract: Discrete distributions, particularly in high-dimensional deep models, are often highly multimodal due to inherent discontinuities. While gradient-based discrete sampling has proven effective, it is susceptible to becoming trapped in local modes due to the gradient information. To tackle this challenge, we propose an automatic cyclical scheduling, designed for efficient and accurate sampling in mul… ▽ More Discrete distributions, particularly in high-dimensional deep models, are often highly multimodal due to inherent discontinuities. While gradient-based discrete sampling has proven effective, it is susceptible to becoming trapped in local modes due to the gradient information. To tackle this challenge, we propose an automatic cyclical scheduling, designed for efficient and accurate sampling in multimodal discrete distributions. Our method contains three key components: (1) a cyclical step size schedule where large steps discover new modes and small steps exploit each mode; (2) a cyclical balancing schedule, ensuring ``balanced" proposals for given step sizes and high efficiency of the Markov chain; and (3) an automatic tuning scheme for adjusting the hyperparameters in the cyclical schedules, allowing adaptability across diverse datasets with minimal tuning. We prove the non-asymptotic convergence and inference guarantee for our method in general discrete distributions. Extensive experiments demonstrate the superiority of our method in sampling complex multimodal discrete distributions. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2401.06687 [pdf, other]

Proximal Causal Inference With Text Data

Authors: Jacob M. Chen, Rohit Bhattacharya, Katherine A. Keith

Abstract: Recent text-based causal methods attempt to mitigate confounding bias by estimating proxies of confounding variables that are partially or imperfectly measured from unstructured text data. These approaches, however, assume analysts have supervised labels of the confounders given text for a subset of instances, a constraint that is sometimes infeasible due to data privacy or annotation costs. In th… ▽ More Recent text-based causal methods attempt to mitigate confounding bias by estimating proxies of confounding variables that are partially or imperfectly measured from unstructured text data. These approaches, however, assume analysts have supervised labels of the confounders given text for a subset of instances, a constraint that is sometimes infeasible due to data privacy or annotation costs. In this work, we address settings in which an important confounding variable is completely unobserved. We propose a new causal inference method that uses multiple instances of pre-treatment text data, infers two proxies from two zero-shot models on the separate instances, and applies these proxies in the proximal g-formula. We prove that our text-based proxy method satisfies identification conditions required by the proximal g-formula while other seemingly reasonable proposals do not. We evaluate our method in synthetic and semi-synthetic settings and find that it produces estimates with low bias. To address untestable assumptions associated with the proximal g-formula, we further propose an odds ratio falsification heuristic. This new combination of proximal causal inference and zero-shot classifiers expands the set of text-specific causal methods available to practitioners. △ Less

Submitted 21 May, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

Comments: 26 pages

arXiv:2310.10393 [pdf, other]

Statistical and Causal Robustness for Causal Null Hypothesis Tests

Authors: Junhui Yang, Rohit Bhattacharya, You** Lee, Ted Westling

Abstract: Prior work applying semiparametric theory to causal inference has primarily focused on deriving estimators that exhibit statistical robustness under a prespecified causal model that permits identification of a desired causal parameter. However, a fundamental challenge is correct specification of such a model, which usually involves making untestable assumptions. Evidence factors is an approach to… ▽ More Prior work applying semiparametric theory to causal inference has primarily focused on deriving estimators that exhibit statistical robustness under a prespecified causal model that permits identification of a desired causal parameter. However, a fundamental challenge is correct specification of such a model, which usually involves making untestable assumptions. Evidence factors is an approach to combining hypothesis tests of a common causal null hypothesis under two or more candidate causal models. Under certain conditions, this yields a test that is valid if at least one of the underlying models is correct, which is a form of causal robustness. We propose a method of combining semiparametric theory with evidence factors. We develop a causal null hypothesis test based on joint asymptotic normality of K asymptotically linear semiparametric estimators, where each estimator is based on a distinct identifying functional derived from each of K candidate causal models. We show that this test provides both statistical and causal robustness in the sense that it is valid if at least one of the K proposed causal models is correct, while also allowing for slower than parametric rates of convergence in estimating nuisance functions. We demonstrate the effectiveness of our method via simulations and applications to the Framingham Heart Study and Wisconsin Longitudinal Study. △ Less

Submitted 29 June, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

arXiv:2310.07542 [pdf, other]

Fast Sampling and Inference via Preconditioned Langevin Dynamics

Authors: Riddhiman Bhattacharya, Tiefeng Jiang

Abstract: Sampling from distributions play a crucial role in aiding practitioners with statistical inference. However, in numerous situations, obtaining exact samples from complex distributions is infeasible. Consequently, researchers often turn to approximate sampling techniques to address this challenge. Fast approximate sampling from complicated distributions has gained much traction in the last few year… ▽ More Sampling from distributions play a crucial role in aiding practitioners with statistical inference. However, in numerous situations, obtaining exact samples from complex distributions is infeasible. Consequently, researchers often turn to approximate sampling techniques to address this challenge. Fast approximate sampling from complicated distributions has gained much traction in the last few years with considerable progress in this field. Previous work has shown that for some problems a preconditioning can make the algorithm faster. In our research, we explore the Langevin Monte Carlo (LMC) algorithm and demonstrate its effectiveness in enabling inference from the obtained samples. Additionally, we establish a convergence rate for the LMC Markov chain in total variation. Lastly, we derive non-asymptotic bounds for approximate sampling from specific target distributions in the Wasserstein distance, particularly when the preconditioning is spatially invariant. △ Less

Submitted 29 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

arXiv:2307.15176 [pdf, other]

RCT Rejection Sampling for Causal Estimation Evaluation

Authors: Katherine A. Keith, Sergey Feldman, David Jurgens, Jonathan Bragg, Rohit Bhattacharya

Abstract: Confounding is a significant obstacle to unbiased estimation of causal effects from observational data. For settings with high-dimensional covariates -- such as text data, genomics, or the behavioral social sciences -- researchers have proposed methods to adjust for confounding by adapting machine learning methods to the goal of causal estimation. However, empirical evaluation of these adjustment… ▽ More Confounding is a significant obstacle to unbiased estimation of causal effects from observational data. For settings with high-dimensional covariates -- such as text data, genomics, or the behavioral social sciences -- researchers have proposed methods to adjust for confounding by adapting machine learning methods to the goal of causal estimation. However, empirical evaluation of these adjustment methods has been challenging and limited. In this work, we build on a promising empirical evaluation strategy that simplifies evaluation design and uses real data: subsampling randomized controlled trials (RCTs) to create confounded observational datasets while using the average causal effects from the RCTs as ground-truth. We contribute a new sampling algorithm, which we call RCT rejection sampling, and provide theoretical guarantees that causal identification holds in the observational data to allow for valid comparisons to the ground-truth RCT. Using synthetic data, we show our algorithm indeed results in low bias when oracle estimators are evaluated on the confounded samples, which is not always the case for a previously proposed algorithm. In addition to this identification result, we highlight several finite data considerations for evaluation designers who plan to use RCT rejection sampling on their own datasets. As a proof of concept, we implement an example evaluation pipeline and walk through these finite data considerations with a novel, real-world RCT -- which we release publicly -- consisting of approximately 70k observations and text data as high-dimensional covariates. Together, these contributions build towards a broader agenda of improved empirical evaluation for causal estimation. △ Less

Submitted 31 January, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

Comments: Code and data at https://github.com/kakeith/rct_rejection_sampling

Journal ref: Transactions on Machine Learning Research (TMLR) 2023

arXiv:2306.05511 [pdf, other]

Causal Inference With Outcome-Dependent Missingness And Self-Censoring

Authors: Jacob M Chen, Daniel Malinsky, Rohit Bhattacharya

Abstract: We consider missingness in the context of causal inference when the outcome of interest may be missing. If the outcome directly affects its own missingness status, i.e., it is "self-censoring", this may lead to severely biased causal effect estimates. Miao et al. [2015] proposed the shadow variable method to correct for bias due to self-censoring; however, verifying the required model assumptions… ▽ More We consider missingness in the context of causal inference when the outcome of interest may be missing. If the outcome directly affects its own missingness status, i.e., it is "self-censoring", this may lead to severely biased causal effect estimates. Miao et al. [2015] proposed the shadow variable method to correct for bias due to self-censoring; however, verifying the required model assumptions can be difficult. Here, we propose a test based on a randomized incentive variable offered to encourage reporting of the outcome that can be used to verify identification assumptions that are sufficient to correct for both self-censoring and confounding bias. Concretely, the test confirms whether a given set of pre-treatment covariates is sufficient to block all backdoor paths between the treatment and outcome as well as all paths between the treatment and missingness indicator after conditioning on the outcome. We show that under these conditions, the causal effect is identified by using the treatment as a shadow variable, and it leads to an intuitive inverse probability weighting estimator that uses a product of the treatment and response weights. We evaluate the efficacy of our test and downstream estimator via simulations. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: 15 pages. In proceedings of the 39th Conference on Uncertainty in Artificial Intelligence

arXiv:2304.01953 [pdf, other]

Graphical Models of Entangled Missingness

Authors: Ranjani Srinivasan, Rohit Bhattacharya, Razieh Nabi, Elizabeth L. Ogburn, Ilya Shpitser

Abstract: Despite the growing interest in causal and statistical inference for settings with data dependence, few methods currently exist to account for missing data in dependent data settings; most classical missing data methods in statistics and causal inference treat data units as independent and identically distributed (i.i.d.). We develop a graphical modeling based framework for causal inference in the… ▽ More Despite the growing interest in causal and statistical inference for settings with data dependence, few methods currently exist to account for missing data in dependent data settings; most classical missing data methods in statistics and causal inference treat data units as independent and identically distributed (i.i.d.). We develop a graphical modeling based framework for causal inference in the presence of entangled missingness, defined as missingness with data dependence. We distinguish three different types of entanglements that can occur, supported by real-world examples. We give sound and complete identification results for all three settings. We show that existing missing data models may be extended to cover entanglements arising from (1) target law dependence and (2) missingness process dependence, while those arising from (3) missingness interference require a novel approach. We demonstrate the use of our entangled missingness framework on synthetic data. Finally, we discuss how, subject to a certain reinterpretation of the variables in the model, our model for missingness interference extends missing data methods to novel missing data patterns in i.i.d. settings. △ Less

Submitted 4 April, 2023; originally announced April 2023.

arXiv:2301.11477 [pdf, other]

Ananke: A Python Package For Causal Inference Using Graphical Models

Authors: Jaron J. R. Lee, Rohit Bhattacharya, Razieh Nabi, Ilya Shpitser

Abstract: We implement Ananke: an object-oriented Python package for causal inference with graphical models. At the top of our inheritance structure is an easily extensible Graph class that provides an interface to several broadly useful graph-based algorithms and methods for visualization. We use best practices of object-oriented programming to implement subclasses of the Graph superclass that correspond t… ▽ More We implement Ananke: an object-oriented Python package for causal inference with graphical models. At the top of our inheritance structure is an easily extensible Graph class that provides an interface to several broadly useful graph-based algorithms and methods for visualization. We use best practices of object-oriented programming to implement subclasses of the Graph superclass that correspond to types of causal graphs that are popular in the current literature. This includes directed acyclic graphs for modeling causally sufficient systems, acyclic directed mixed graphs for modeling unmeasured confounding, and chain graphs for modeling data dependence and interference. Within these subclasses, we implement specialized algorithms for common statistical and causal modeling tasks, such as separation criteria for reading conditional independence, nonparametric identification, and parametric and semiparametric estimation of model parameters. Here, we present a broad overview of the package and example usage for a problem with unmeasured confounding. Up to date documentation is available at \url{https://ananke.readthedocs.io/en/latest/}. △ Less

Submitted 26 January, 2023; originally announced January 2023.

arXiv:2210.05558 [pdf, ps, other]

Causal and counterfactual views of missing data models

Authors: Razieh Nabi, Rohit Bhattacharya, Ilya Shpitser, James Robins

Abstract: It is often said that the fundamental problem of causal inference is a missing data problem -- the comparison of responses to two hypothetical treatment assignments is made difficult because for every experimental unit only one potential response is observed. In this paper, we consider the implications of the converse view: that missing data problems are a form of causal inference. We make explici… ▽ More It is often said that the fundamental problem of causal inference is a missing data problem -- the comparison of responses to two hypothetical treatment assignments is made difficult because for every experimental unit only one potential response is observed. In this paper, we consider the implications of the converse view: that missing data problems are a form of causal inference. We make explicit how the missing data problem of recovering the complete data law from the observed law can be viewed as identification of a joint distribution over counterfactual variables corresponding to values had we (possibly contrary to fact) been able to observe them. Drawing analogies with causal inference, we show how identification assumptions in missing data can be encoded in terms of graphical models defined over counterfactual and observed variables. We review recent results in missing data identification from this viewpoint. In doing so, we note interesting similarities and differences between missing data and causal identification theories. △ Less

Submitted 11 October, 2022; originally announced October 2022.

arXiv:2203.00161 [pdf, other]

On Testability of the Front-Door Model via Verma Constraints

Authors: Rohit Bhattacharya, Razieh Nabi

Abstract: The front-door criterion can be used to identify and compute causal effects despite the existence of unmeasured confounders between a treatment and outcome. However, the key assumptions -- (i) the existence of a variable (or set of variables) that fully mediates the effect of the treatment on the outcome, and (ii) which simultaneously does not suffer from similar issues of confounding as the treat… ▽ More The front-door criterion can be used to identify and compute causal effects despite the existence of unmeasured confounders between a treatment and outcome. However, the key assumptions -- (i) the existence of a variable (or set of variables) that fully mediates the effect of the treatment on the outcome, and (ii) which simultaneously does not suffer from similar issues of confounding as the treatment-outcome pair -- are often deemed implausible. This paper explores the testability of these assumptions. We show that under mild conditions involving an auxiliary variable, the assumptions encoded in the front-door model (and simple extensions of it) may be tested via generalized equality constraints a.k.a Verma constraints. We propose two goodness-of-fit tests based on this observation, and evaluate the efficacy of our proposal on real and synthetic data. We also provide theoretical and empirical comparisons to instrumental variable approaches to handling unmeasured confounding. △ Less

Submitted 16 June, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

Comments: 17 pages. In proceedings of the 38th Conference on Uncertainty in Artificial Intelligence

arXiv:2203.00132 [pdf, other]

On Testability and Goodness of Fit Tests in Missing Data Models

Authors: Razieh Nabi, Rohit Bhattacharya

Abstract: Significant progress has been made in develo** identification and estimation techniques for missing data problems where modeling assumptions can be described via a directed acyclic graph. The validity of results using such techniques rely on the assumptions encoded by the graph holding true; however, verification of these assumptions has not received sufficient attention in prior work. In this p… ▽ More Significant progress has been made in develo** identification and estimation techniques for missing data problems where modeling assumptions can be described via a directed acyclic graph. The validity of results using such techniques rely on the assumptions encoded by the graph holding true; however, verification of these assumptions has not received sufficient attention in prior work. In this paper, we provide new insights on the testable implications of three broad classes of missing data graphical models, and design goodness-of-fit tests for them. The classes of models explored are: sequential missing-at-random and missing-not-at-random models which can be used for modeling longitudinal studies with dropout/censoring, and a no self-censoring model which can be applied to cross-sectional studies and surveys. △ Less

Submitted 10 June, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

Journal ref: Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence (UAI), 2023

arXiv:2112.07110 [pdf, other]

Non-Asymptotic Analysis of Online Multiplicative Stochastic Gradient Descent

Authors: Riddhiman Bhattacharya, Tiefeng Jiang

Abstract: Past research has indicated that the covariance of the Stochastic Gradient Descent (SGD) error done via minibatching plays a critical role in determining its regularization and escape from low potential points. Motivated by some new research in this area, we prove universality results by showing that noise classes that have the same mean and covariance structure of SGD via minibatching have simila… ▽ More Past research has indicated that the covariance of the Stochastic Gradient Descent (SGD) error done via minibatching plays a critical role in determining its regularization and escape from low potential points. Motivated by some new research in this area, we prove universality results by showing that noise classes that have the same mean and covariance structure of SGD via minibatching have similar properties. We mainly consider the Multiplicative Stochastic Gradient Descent (M-SGD) algorithm as introduced in previous work, which has a much more general noise class than the SGD algorithm done via minibatching. We establish non asymptotic bounds for the M-SGD algorithm in the Wasserstein distance. We also show that the M-SGD error is approximately a scaled Gaussian distribution with mean $0$ at any fixed point of the M-SGD algorithm. △ Less

Submitted 1 March, 2023; v1 submitted 13 December, 2021; originally announced December 2021.

arXiv:2111.12962 [pdf, ps, other]

Simultaneous best linear invariant prediction of future order statistics for location-scale and scale families and associated optimality properties

Authors: Narayanaswamy Balakrishnan, Ritwik Bhattacharya

Abstract: In this article, we first derive an explicit expression for the marginal best linear invariant predictor (BLIP) of an unobserved future order statistic based on a set of early observed ordered statistics. We then derive the joint BLIPs of two future order statistics and prove that the joint predictors are trace-efficient as well as determinant-efficient linear invariant predictors. More generally,… ▽ More In this article, we first derive an explicit expression for the marginal best linear invariant predictor (BLIP) of an unobserved future order statistic based on a set of early observed ordered statistics. We then derive the joint BLIPs of two future order statistics and prove that the joint predictors are trace-efficient as well as determinant-efficient linear invariant predictors. More generally, the BLIPs are shown to possess complete mean squared predictive error matrix dominance property in the class of all linear invariant predictors of two future unobserved order statistics. Finally, these results are extended to the case of simultaneous BLIPs of any $\ell$ future order statistics. Both scale and location-scale families of distributions are considered as the parent distribution for the development of results. △ Less

Submitted 25 November, 2021; originally announced November 2021.

arXiv:2109.14859 [pdf, ps, other]

On simultaneous best linear unbiased prediction of future order statistics and associated properties

Authors: Narayanaswamy Balakrishnan, Ritwik Bhattacharya

Abstract: In this article, the joint best linear unbiased predictors (BLUPs) of two future unobserved order statistics, based on a set of observed order statistics, are developed explicitly. It is shown that these predictors are trace-efficient as well as determinant-efficient BLUPs. More generally, the BLUPs are shown to possess complete mean squared predictive error matrix dominance in the class of all li… ▽ More In this article, the joint best linear unbiased predictors (BLUPs) of two future unobserved order statistics, based on a set of observed order statistics, are developed explicitly. It is shown that these predictors are trace-efficient as well as determinant-efficient BLUPs. More generally, the BLUPs are shown to possess complete mean squared predictive error matrix dominance in the class of all linear unbiased predictors of two future unobserved order statistics. Finally, these results are extended to the case of simultaneous BLUPs of any $l$ future order statistics. △ Less

Submitted 30 September, 2021; originally announced September 2021.

arXiv:2104.03792 [pdf, ps, other]

A MCMC-type simple probabilistic approach for determining optimal progressive censoring schemes

Authors: Ritwik Bhattacharya, Narayanaswamy Balakrishnan

Abstract: We present here a simple probabilistic approach for determining an optimal progressive censoring scheme by defining a probability structure on the set of feasible solutions. Given an initial solution, the new updated solution is computed within the probabilistic structure. This approach will be especially useful when the cardinality of the set of feasible solutions is large. The validation of the… ▽ More We present here a simple probabilistic approach for determining an optimal progressive censoring scheme by defining a probability structure on the set of feasible solutions. Given an initial solution, the new updated solution is computed within the probabilistic structure. This approach will be especially useful when the cardinality of the set of feasible solutions is large. The validation of the proposed approach is demonstrated by comparing the optimal scheme with these obtained by exhaustive numerical search. △ Less

Submitted 8 April, 2021; originally announced April 2021.

arXiv:2012.11836 [pdf, ps, other]

D-optimal joint best linear unbiased prediction of order statistics

Authors: Narayanaswamy Balakrishnan, Ritwik Bhattacharya

Abstract: In life-testing experiments, it is often of interest to predict unobserved future failure times based on observed early failure times. A point best linear unbiased predictor (BLUP) has been developed in this context by Kaminsky and Nelson (1975). In this article, we develop joint BLUPs of two future failure times based on early failure times by minimizing the determinant of the variance-covariance… ▽ More In life-testing experiments, it is often of interest to predict unobserved future failure times based on observed early failure times. A point best linear unbiased predictor (BLUP) has been developed in this context by Kaminsky and Nelson (1975). In this article, we develop joint BLUPs of two future failure times based on early failure times by minimizing the determinant of the variance-covariance matrix of the predictors. The advantage of applying joint prediction is demonstrated by using a real data set. The non-existence of joint BLUPs in certain setups is also discussed. △ Less

Submitted 22 December, 2020; originally announced December 2020.

arXiv:2010.06978 [pdf, other]

Differentiable Causal Discovery Under Unmeasured Confounding

Authors: Rohit Bhattacharya, Tushar Nagarajan, Daniel Malinsky, Ilya Shpitser

Abstract: The data drawn from biological, economic, and social systems are often confounded due to the presence of unmeasured variables. Prior work in causal discovery has focused on discrete search procedures for selecting acyclic directed mixed graphs (ADMGs), specifically ancestral ADMGs, that encode ordinary conditional independence constraints among the observed variables of the system. However, confou… ▽ More The data drawn from biological, economic, and social systems are often confounded due to the presence of unmeasured variables. Prior work in causal discovery has focused on discrete search procedures for selecting acyclic directed mixed graphs (ADMGs), specifically ancestral ADMGs, that encode ordinary conditional independence constraints among the observed variables of the system. However, confounded systems also exhibit more general equality restrictions that cannot be represented via these graphs, placing a limit on the kinds of structures that can be learned using ancestral ADMGs. In this work, we derive differentiable algebraic constraints that fully characterize the space of ancestral ADMGs, as well as more general classes of ADMGs, arid ADMGs and bow-free ADMGs, that capture all equality restrictions on the observed variables. We use these constraints to cast causal discovery as a continuous optimization problem and design differentiable procedures to find the best fitting ADMG when the data comes from a confounded linear system of equations with correlated errors. We demonstrate the efficacy of our method through simulations and application to a protein expression dataset. Code implementing our methods is open-source and publicly available at https://gitlab.com/rbhatta8/dcd and will be incorporated into the Ananke package. △ Less

Submitted 24 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

Comments: Main draft: 9 pages. Appendix: 9 pages

ACM Class: G.3; J.3; F.2.2

arXiv:2008.10706 [pdf, other]

Path Dependent Structural Equation Models

Authors: Ranjani Srinivasan, Jaron Lee, Rohit Bhattacharya, Narges Ahmidi, Ilya Shpitser

Abstract: Causal analyses of longitudinal data generally assume that the qualitative causal structure relating variables remains invariant over time. In structured systems that transition between qualitatively different states in discrete time steps, such an approach is deficient on two fronts. First, time-varying variables may have state-specific causal relationships that need to be captured. Second, an in… ▽ More Causal analyses of longitudinal data generally assume that the qualitative causal structure relating variables remains invariant over time. In structured systems that transition between qualitatively different states in discrete time steps, such an approach is deficient on two fronts. First, time-varying variables may have state-specific causal relationships that need to be captured. Second, an intervention can result in state transitions downstream of the intervention different from those actually observed in the data. In other words, interventions may counterfactually alter the subsequent temporal evolution of the system. We introduce a generalization of causal graphical models, Path Dependent Structural Equation Models (PDSEMs), that can describe such systems. We show how causal inference may be performed in such models and illustrate its use in simulations and data obtained from a septoplasty surgical procedure. △ Less

Submitted 9 November, 2020; v1 submitted 24 August, 2020; originally announced August 2020.

arXiv:2004.08533 [pdf, ps, other]

Determination of Bayesian optimal warranty length under Type-II unified hybrid censoring scheme

Authors: Tanmay Sen, Ritwik Bhattacharya, Biswabrata Pradhan, Yogesh Mani Tripathi

Abstract: Determination of an appropriate warranty length for the lifetime of the product is an important issue to the manufacturer. In this article, optimal warranty length of the product for the combined free replacement and the pro-rata warranty policy is computed based on the Type-II unified hybrid censored data. A non-linear pro-rata warranty policy is proposed in this context. The optimal warranty len… ▽ More Determination of an appropriate warranty length for the lifetime of the product is an important issue to the manufacturer. In this article, optimal warranty length of the product for the combined free replacement and the pro-rata warranty policy is computed based on the Type-II unified hybrid censored data. A non-linear pro-rata warranty policy is proposed in this context. The optimal warranty length is obtained by maximizing an expected utility function. The expectation is taken with respect to the posterior predictive model for the time-to-failure data. It is observed that the non-linear pro-rata warranty policy gives a larger warranty length with maximum profit as compared to linear warranty policy. Finally, a real-data set is analyzed in order to illustrate the advantage of using non-linear pro-rata warranty policy. △ Less

Submitted 18 April, 2020; originally announced April 2020.

arXiv:2004.05308 [pdf, ps, other]

Statistical inference and Bayesian optimal life-testing plans under Type-II unified hybrid censoring scheme

Authors: Tanmay Sen, Ritwik Bhattacharya, Biswabrata Pradhan, Yogesh Mani Tripathi

Abstract: This article describes the inferential procedures and Bayesian optimal life-testing issues under Type-II unified hybrid censoring scheme. First, the explicit expressions of expected number of failures, expected duration of testing and Fisher information matrix for the unknown parameters of the underlying lifetime model are derived. Then, using these quantities, the Bayesian optimal life-testing pl… ▽ More This article describes the inferential procedures and Bayesian optimal life-testing issues under Type-II unified hybrid censoring scheme. First, the explicit expressions of expected number of failures, expected duration of testing and Fisher information matrix for the unknown parameters of the underlying lifetime model are derived. Then, using these quantities, the Bayesian optimal life-testing plans are computed in subsequent section. A cost constraint D-optimal optimization problem has been formulated and the corresponding solution algorithm is provided to obtain optimal plans. Computational procedures are illustrated through numerical examples. △ Less

Submitted 11 April, 2020; originally announced April 2020.

arXiv:2004.04872 [pdf, ps, other]

Full Law Identification In Graphical Models Of Missing Data: Completeness Results

Authors: Razieh Nabi, Rohit Bhattacharya, Ilya Shpitser

Abstract: Missing data has the potential to affect analyses conducted in all fields of scientific study, including healthcare, economics, and the social sciences. Several approaches to unbiased inference in the presence of non-ignorable missingness rely on the specification of the target distribution and its missingness process as a probability distribution that factorizes with respect to a directed acyclic… ▽ More Missing data has the potential to affect analyses conducted in all fields of scientific study, including healthcare, economics, and the social sciences. Several approaches to unbiased inference in the presence of non-ignorable missingness rely on the specification of the target distribution and its missingness process as a probability distribution that factorizes with respect to a directed acyclic graph. In this paper, we address the longstanding question of the characterization of models that are identifiable within this class of missing data distributions. We provide the first completeness result in this field of study -- necessary and sufficient graphical conditions under which, the full data distribution can be recovered from the observed data distribution. We then simultaneously address issues that may arise due to the presence of both missing data and unmeasured confounding, by extending these graphical conditions and proofs of completeness, to settings where some variables are not just missing, but completely unobserved. △ Less

Submitted 31 August, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

Comments: Camera ready version published at ICML 2020

Journal ref: Proceedings of the 37th International Conference on Machine Learning, PMLR 119, 2020

arXiv:2003.12659 [pdf, other]

Semiparametric Inference For Causal Effects In Graphical Models With Hidden Variables

Authors: Rohit Bhattacharya, Razieh Nabi, Ilya Shpitser

Abstract: Identification theory for causal effects in causal models associated with hidden variable directed acyclic graphs (DAGs) is well studied. However, the corresponding algorithms are underused due to the complexity of estimating the identifying functionals they output. In this work, we bridge the gap between identification and estimation of population-level causal effects involving a single treatment… ▽ More Identification theory for causal effects in causal models associated with hidden variable directed acyclic graphs (DAGs) is well studied. However, the corresponding algorithms are underused due to the complexity of estimating the identifying functionals they output. In this work, we bridge the gap between identification and estimation of population-level causal effects involving a single treatment and a single outcome. We derive influence function based estimators that exhibit double robustness for the identified effects in a large class of hidden variable DAGs where the treatment satisfies a simple graphical criterion; this class includes models yielding the adjustment and front-door functionals as special cases. We also provide necessary and sufficient conditions under which the statistical model of a hidden variable DAG is nonparametrically saturated and implies no equality constraints on the observed data distribution. Further, we derive an important class of hidden variable DAGs that imply observed data distributions observationally equivalent (up to equality constraints) to fully observed DAGs. In these classes of DAGs, we derive estimators that achieve the semiparametric efficiency bounds for the target of interest where the treatment satisfies our graphical criterion. Finally, we provide a sound and complete identification algorithm that directly yields a weight based estimation strategy for any identifiable effect in hidden variable causal models. △ Less

Submitted 13 October, 2022; v1 submitted 27 March, 2020; originally announced March 2020.

Comments: 76 pages

arXiv:1907.00241 [pdf, ps, other]

Identification In Missing Data Models Represented By Directed Acyclic Graphs

Authors: Rohit Bhattacharya, Razieh Nabi, Ilya Shpitser, James M. Robins

Abstract: Missing data is a pervasive problem in data analyses, resulting in datasets that contain censored realizations of a target distribution. Many approaches to inference on the target distribution using censored observed data, rely on missing data models represented as a factorization with respect to a directed acyclic graph. In this paper we consider the identifiability of the target distribution wit… ▽ More Missing data is a pervasive problem in data analyses, resulting in datasets that contain censored realizations of a target distribution. Many approaches to inference on the target distribution using censored observed data, rely on missing data models represented as a factorization with respect to a directed acyclic graph. In this paper we consider the identifiability of the target distribution within this class of models, and show that the most general identification strategies proposed so far retain a significant gap in that they fail to identify a wide class of identifiable distributions. To address this gap, we propose a new algorithm that significantly generalizes the types of manipulations used in the ID algorithm, developed in the context of causal inference, in order to obtain identification. △ Less

Submitted 29 June, 2019; originally announced July 2019.

Comments: 16 pages, published in proceedings of 35th Conference on Uncertainty in Artificial Intelligence (UAI 2019)

arXiv:1907.00221 [pdf, other]

Causal Inference Under Interference And Network Uncertainty

Authors: Rohit Bhattacharya, Daniel Malinsky, Ilya Shpitser

Abstract: Classical causal and statistical inference methods typically assume the observed data consists of independent realizations. However, in many applications this assumption is inappropriate due to a network of dependences between units in the data. Methods for estimating causal effects have been developed in the setting where the structure of dependence between units is known exactly, but in practice… ▽ More Classical causal and statistical inference methods typically assume the observed data consists of independent realizations. However, in many applications this assumption is inappropriate due to a network of dependences between units in the data. Methods for estimating causal effects have been developed in the setting where the structure of dependence between units is known exactly, but in practice there is often substantial uncertainty about the precise network structure. This is true, for example, in trial data drawn from vulnerable communities where social ties are difficult to query directly. In this paper we combine techniques from the structure learning and interference literatures in causal inference, proposing a general method for estimating causal effects under data dependence when the structure of this dependence is not known a priori. We demonstrate the utility of our method on synthetic datasets which exhibit network dependence. △ Less

Submitted 29 June, 2019; originally announced July 2019.

Comments: 16 pages, published in proceedings of 35th Conference on Uncertainty in Artificial Intelligence (UAI 2019)

arXiv:0805.3282 [pdf, ps, other]

doi 10.1214/074921708000000200

Nonparametric statistics on manifolds with applications to shape spaces

Authors: Abhishek Bhattacharya, Rabi Bhattacharya

Abstract: This article presents certain recent methodologies and some new results for the statistical analysis of probability distributions on manifolds. An important example considered in some detail here is the 2-D shape space of k-ads, comprising all configurations of $k$ planar landmarks ($k>2$)-modulo translation, scaling and rotation. This article presents certain recent methodologies and some new results for the statistical analysis of probability distributions on manifolds. An important example considered in some detail here is the 2-D shape space of k-ads, comprising all configurations of $k$ planar landmarks ($k>2$)-modulo translation, scaling and rotation. △ Less

Submitted 21 May, 2008; originally announced May 2008.

Comments: Published in at http://dx.doi.org/10.1214/074921708000000200 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-COLL3-IMSCOLL320 MSC Class: 62G20 (Primary) 62E20; 62H35 (Secondary)

Journal ref: IMS Collections 2008, Vol. 3, 282-301

Showing 1–27 of 27 results for author: Bhattacharya, R