Skip to main content

Showing 1–21 of 21 results for author: Pfister, N

.
  1. arXiv:2406.19986  [pdf, other

    stat.ME

    Instrumental Variable Estimation of Distributional Causal Effects

    Authors: Lucas Kook, Niklas Pfister

    Abstract: Estimating the causal effect of a treatment on the entire response distribution is an important yet challenging task. For instance, one might be interested in how a pension plan affects not only the average savings among all individuals but also how it affects the entire savings distribution. While sufficiently large randomized studies can be used to estimate such distributional causal effects, th… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Code available at https://github.com/lucaskook/dive

  2. arXiv:2404.09962  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Invariant Subspace Decomposition

    Authors: Margherita Lazzaretto, Jonas Peters, Niklas Pfister

    Abstract: We consider the task of predicting a response Y from a set of covariates X in settings where the conditional distribution of Y given X changes over time. For this to be feasible, assumptions on how the conditional distribution changes over time are required. Existing approaches assume, for example, that changes occur smoothly over time so that short-term prediction using only the recent past becom… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  3. arXiv:2403.12677  [pdf, other

    stat.ME

    Causal Change Point Detection and Localization

    Authors: Shimeng Huang, Jonas Peters, Niklas Pfister

    Abstract: Detecting and localizing change points in sequential data is of interest in many areas of application. Various notions of change points have been proposed, such as changes in mean, variance, or the linear regression coefficient. In this work, we consider settings in which a response variable $Y$ and a set of covariates $X=(X^1,\ldots,X^{d+1})$ are observed over time and aim to find changes in the… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2402.09758  [pdf, other

    stat.ME math.ST stat.ML

    Extrapolation-Aware Nonparametric Statistical Inference

    Authors: Niklas Pfister, Peter Bühlmann

    Abstract: We define extrapolation as any type of statistical inference on a conditional function (e.g., a conditional expectation or conditional quantile) evaluated outside of the support of the conditioning variable. This type of extrapolation occurs in many data analysis applications and can invalidate the resulting conclusions if not taken into account. While extrapolating is straightforward in parametri… ▽ More

    Submitted 12 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  5. arXiv:2311.18501  [pdf, other

    stat.ME math.ST stat.ML

    Perturbation-based Effect Measures for Compositional Data

    Authors: Anton Rask Lundborg, Niklas Pfister

    Abstract: Existing effect measures for compositional features are inadequate for many modern applications for two reasons. First, modern datasets with compositional covariates, for example in microbiome research, display traits such as high-dimensionality and sparsity that can be poorly modelled with traditional parametric approaches. Second, assessing -- in an unbiased way -- how summary statistics of a co… ▽ More

    Submitted 18 June, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  6. arXiv:2310.05805  [pdf, other

    stat.ML cs.LG

    Boosted Control Functions

    Authors: Nicola Gnecco, Jonas Peters, Sebastian Engelke, Niklas Pfister

    Abstract: Modern machine learning methods and the availability of large-scale data opened the door to accurately predict target quantities from large sets of covariates. However, existing prediction methods can perform poorly when the training and testing data are different, especially in the presence of hidden confounding. While hidden confounding is well studied for causal effect estimation (e.g., instrum… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  7. arXiv:2310.04295  [pdf, other

    cs.LG cs.AI stat.ML

    Identifying Representations for Intervention Extrapolation

    Authors: Sorawit Saengkyongam, Elan Rosenfeld, Pradeep Ravikumar, Niklas Pfister, Jonas Peters

    Abstract: The premise of identifiable and causal representation learning is to improve the current representation learning paradigm in terms of generalizability or robustness. Despite recent progress in questions of identifiability, more theoretical results demonstrating concrete advantages of these methods for downstream tasks are needed. In this paper, we consider the task of intervention extrapolation: p… ▽ More

    Submitted 5 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted at the International Conference on Learning Representations (ICLR) 2024

  8. arXiv:2306.10983  [pdf, other

    stat.ML cs.LG

    Effect-Invariant Mechanisms for Policy Generalization

    Authors: Sorawit Saengkyongam, Niklas Pfister, Predrag Klasnja, Susan Murphy, Jonas Peters

    Abstract: Policy learning is an important component of many real-world learning systems. A major challenge in policy learning is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to exploit invariant conditional distributions to learn models that generalize better to unseen environments. However, assuming invariance of entire conditional distributions (which we call f… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  9. arXiv:2205.07271  [pdf, other

    stat.ML cs.LG stat.AP

    Supervised Learning and Model Analysis with Compositional Data

    Authors: Shimeng Huang, Elisabeth Ailer, Niki Kilbertus, Niklas Pfister

    Abstract: The compositionality and sparsity of high-throughput sequencing data poses a challenge for regression and classification. However, in microbiome research in particular, conditional modeling is an essential tool to investigate relationships between phenotypes and the microbiome. Existing techniques are often inadequate: they either rely on extensions of the linear log-contrast model (which adjusts… ▽ More

    Submitted 11 November, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

  10. arXiv:2203.09380  [pdf, other

    stat.ME math.ST stat.ML

    Identifiability of Sparse Causal Effects using Instrumental Variables

    Authors: Niklas Pfister, Jonas Peters

    Abstract: Exogenous heterogeneity, for example, in the form of instrumental variables can help us learn a system's underlying causal structure and predict the outcome of unseen intervention experiments. In this paper, we consider linear models in which the causal effect from covariates $X$ on a response $Y$ is sparse. We provide conditions under which the causal coefficient becomes identifiable from the obs… ▽ More

    Submitted 25 April, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

  11. arXiv:2202.06052  [pdf, other

    cs.LG cs.RO eess.SY stat.ME stat.ML

    Learning by Doing: Controlling a Dynamical System using Causality, Control, and Reinforcement Learning

    Authors: Sebastian Weichwald, Søren Wengel Mogensen, Tabitha Edith Lee, Dominik Baumann, Oliver Kroemer, Isabelle Guyon, Sebastian Trimpe, Jonas Peters, Niklas Pfister

    Abstract: Questions in causality, control, and reinforcement learning go beyond the classical machine learning task of prediction under i.i.d. observations. Instead, these fields consider the problem of learning how to actively perturb a system to achieve a certain effect on a response variable. Arguably, they have complementary views on the problem: In control, one usually aims to first identify the system… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: https://learningbydoingcompetition.github.io/

  12. arXiv:2202.01864  [pdf, other

    stat.ML cs.LG stat.ME

    Exploiting Independent Instruments: Identification and Distribution Generalization

    Authors: Sorawit Saengkyongam, Leonard Henckel, Niklas Pfister, Jonas Peters

    Abstract: Instrumental variable models allow us to identify a causal function between covariates $X$ and a response $Y$, even in the presence of unobserved confounding. Most of the existing estimators assume that the error term in the response $Y$ and the hidden confounders are uncorrelated with the instruments $Z$. This is often motivated by a graphical separation, an argument that also justifies independe… ▽ More

    Submitted 22 September, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Accepted at ICML 2022

  13. arXiv:2106.00808  [pdf, other

    cs.LG cs.AI stat.ML

    Invariant Policy Learning: A Causal Perspective

    Authors: Sorawit Saengkyongam, Nikolaj Thams, Jonas Peters, Niklas Pfister

    Abstract: Contextual bandit and reinforcement learning algorithms have been successfully used in various interactive learning systems such as online advertising, recommender systems, and dynamic pricing. However, they have yet to be widely adopted in high-stakes application domains, such as healthcare. One reason may be that existing approaches assume that the underlying mechanisms are static in the sense t… ▽ More

    Submitted 22 September, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

  14. arXiv:2105.10821  [pdf, other

    stat.ME math.ST

    Statistical Testing under Distributional Shifts

    Authors: Nikolaj Thams, Sorawit Saengkyongam, Niklas Pfister, Jonas Peters

    Abstract: In this work, we introduce statistical testing under distributional shifts. We are interested in the hypothesis $P^* \in H_0$ for a target distribution $P^*$, but observe data from a different distribution $Q^*$. We assume that $P^*$ is related to $Q^*$ through a known shift $τ$ and formally introduce hypothesis testing in this setting. We propose a general testing procedure that first resamples f… ▽ More

    Submitted 29 April, 2022; v1 submitted 22 May, 2021; originally announced May 2021.

    MSC Class: 62G10 (Primary); 68Txx (Secondary)

  15. arXiv:2006.07433  [pdf, other

    stat.ME

    A causal framework for distribution generalization

    Authors: Rune Christiansen, Niklas Pfister, Martin Emil Jakobsen, Nicola Gnecco, Jonas Peters

    Abstract: We consider the problem of predicting a response $Y$ from a set of covariates $X$ when test and training distributions differ. Since such differences may have causal explanations, we consider test distributions that emerge from interventions in a structural causal model, and focus on minimizing the worst-case risk. Causal regression models, which regress the response on its direct causes, remain u… ▽ More

    Submitted 17 August, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 52 pages, 8 figures, 2 tables. To be published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

    MSC Class: Primary 62Gxx; secondary 62G35; 62G08; 62D20

  16. arXiv:2001.06208  [pdf, other

    stat.ME math.DS

    Causal models for dynamical systems

    Authors: Jonas Peters, Stefan Bauer, Niklas Pfister

    Abstract: A probabilistic model describes a system in its observational state. In many situations, however, we are interested in the system's response under interventions. The class of structural causal models provides a language that allows us to model the behaviour under interventions. It can been taken as a starting point to answer a plethora of causal questions, including the identification of causal ef… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

  17. arXiv:1911.01850  [pdf, other

    stat.ME stat.AP

    Stabilizing Variable Selection and Regression

    Authors: Niklas Pfister, Evan G. Williams, Jonas Peters, Ruedi Aebersold, Peter Bühlmann

    Abstract: We consider regression in which one predicts a response $Y$ with a set of predictors $X$ across different experiments or environments. This is a common setup in many data-driven scientific fields and we argue that statistical inference can benefit from an analysis that takes into account the distributional changes across environments. In particular, it is useful to distinguish between stable and u… ▽ More

    Submitted 21 May, 2021; v1 submitted 5 November, 2019; originally announced November 2019.

  18. arXiv:1810.11776  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Learning stable and predictive structures in kinetic systems: Benefits of a causal approach

    Authors: Niklas Pfister, Stefan Bauer, Jonas Peters

    Abstract: Learning kinetic systems from data is one of the core challenges in many fields. Identifying stable models is essential for the generalization capabilities of data-driven inference. We introduce a computationally efficient framework, called CausalKinetiX, that identifies structure from discrete time, noisy observations, generated from heterogeneous experiments. The algorithm assumes the existence… ▽ More

    Submitted 28 November, 2019; v1 submitted 28 October, 2018; originally announced October 2018.

  19. arXiv:1806.01094  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP stat.ME

    Robustifying Independent Component Analysis by Adjusting for Group-Wise Stationary Noise

    Authors: Niklas Pfister, Sebastian Weichwald, Peter Bühlmann, Bernhard Schölkopf

    Abstract: We introduce coroICA, confounding-robust independent component analysis, a novel ICA algorithm which decomposes linearly mixed multivariate observations into independent components that are corrupted (and rendered dependent) by hidden group-wise stationary confounding. It extends the ordinary ICA model in a theoretically sound and explicit way to incorporate group-wise (or environment-wise) confou… ▽ More

    Submitted 30 October, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

    Comments: equal contribution between Pfister and Weichwald

    Journal ref: Journal of Machine Learning Research, 20(147):1-50, 2019. ( http://www.jmlr.org/papers/v20/18-399.html )

  20. arXiv:1706.08058  [pdf, ps, other

    math.ST stat.AP stat.ME

    Invariant Causal Prediction for Sequential Data

    Authors: Niklas Pfister, Peter Bühlmann, Jonas Peters

    Abstract: We investigate the problem of inferring the causal predictors of a response $Y$ from a set of $d$ explanatory variables $(X^1,\dots,X^d)$. Classical ordinary least squares regression includes all predictors that reduce the variance of $Y$. Using only the causal predictors instead leads to models that have the advantage of remaining invariant under interventions, loosely speaking they lead to invar… ▽ More

    Submitted 28 May, 2018; v1 submitted 25 June, 2017; originally announced June 2017.

    Comments: 55 pages

    MSC Class: 62L05; 62P20; 63J05 ACM Class: G.3

  21. arXiv:1603.00285  [pdf, ps, other

    math.ST stat.ML

    Kernel-based Tests for Joint Independence

    Authors: Niklas Pfister, Peter Bühlmann, Bernhard Schölkopf, Jonas Peters

    Abstract: We investigate the problem of testing whether $d$ random variables, which may or may not be continuous, are jointly (or mutually) independent. Our method builds on ideas of the two variable Hilbert-Schmidt independence criterion (HSIC) but allows for an arbitrary number of variables. We embed the $d$-dimensional joint distribution and the product of the marginals into a reproducing kernel Hilbert… ▽ More

    Submitted 4 November, 2016; v1 submitted 1 March, 2016; originally announced March 2016.

    Comments: 67 pages