Skip to main content

Showing 1–39 of 39 results for author: von Kügelgen, J

.
  1. arXiv:2406.13371  [pdf, other

    cs.LG cs.AI stat.ML

    Identifiable Causal Representation Learning: Unsupervised, Multi-View, and Multi-Environment

    Authors: Julius von Kügelgen

    Abstract: Causal models provide rich descriptions of complex systems as sets of mechanisms by which each variable is influenced by its direct causes. They support reasoning about manipulating parts of the system and thus hold promise for addressing some of the open challenges of artificial intelligence (AI), such as planning, transferring knowledge in changing environments, or robustness to distribution shi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: PhD Thesis; 190 pages, 33 figures, 6 tables

    Journal ref: University of Cambridge, 2024

  2. arXiv:2403.08335  [pdf, other

    cs.LG cs.AI stat.ML

    A Sparsity Principle for Partially Observable Causal Representation Learning

    Authors: Danru Xu, Dingling Yao, Sébastien Lachapelle, Perouz Taslakian, Julius von Kügelgen, Francesco Locatello, Sara Magliacane

    Abstract: Causal representation learning aims at identifying high-level causal variables from perceptual data. Most methods assume that all latent causal variables are captured in the high-dimensional observations. We instead consider a partially observed setting, in which each measurement only provides information about a subset of the underlying causal state. Prior work has studied this setting with multi… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 45 pages, 32 figures, 16 tables

  3. arXiv:2312.13438  [pdf, ps, other

    stat.ML cs.LG

    Independent Mechanism Analysis and the Manifold Hypothesis

    Authors: Shubhangi Ghosh, Luigi Gresele, Julius von Kügelgen, Michel Besserve, Bernhard Schölkopf

    Abstract: Independent Mechanism Analysis (IMA) seeks to address non-identifiability in nonlinear Independent Component Analysis (ICA) by assuming that the Jacobian of the mixing function has orthogonal columns. As typical in ICA, previous work focused on the case with an equal number of latent components and observed mixtures. Here, we extend IMA to settings with a larger number of mixtures that reside on a… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 6 pages, Accepted at Neurips Causal Representation Learning 2023

  4. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To address this, we introduce a more principled approach that seeks to disentangle style f… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  5. arXiv:2311.08743  [pdf, other

    stat.ME

    Kernel-based independence tests for causal structure learning on functional data

    Authors: Felix Laumann, Julius von Kügelgen, Junhyung Park, Bernhard Schölkopf, Mauricio Barahona

    Abstract: Measurements of systems taken along a continuous functional dimension, such as time or space, are ubiquitous in many fields, from the physical and biological sciences to economics and engineering.Such measurements can be viewed as realisations of an underlying smooth process sampled over the continuum. However, traditional methods for independence testing and causal learning are not directly appli… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  6. arXiv:2311.04056  [pdf, other

    cs.LG cs.AI

    Multi-View Causal Representation Learning with Partial Observability

    Authors: Dingling Yao, Danru Xu, Sébastien Lachapelle, Sara Magliacane, Perouz Taslakian, Georg Martius, Julius von Kügelgen, Francesco Locatello

    Abstract: We present a unified framework for studying the identifiability of representations learned from simultaneously observed views, such as different data modalities. We allow a partially observed setting in which each view constitutes a nonlinear mixture of a subset of underlying latent variables, which can be causally related. We prove that the information shared across all subsets of any number of v… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 28 pages, 10 figures, 11 tables

  7. arXiv:2310.07665  [pdf, other

    cs.AI cs.LG stat.ML

    Deep Backtracking Counterfactuals for Causally Compliant Explanations

    Authors: Klaus-Rudolf Kladny, Julius von Kügelgen, Bernhard Schölkopf, Michael Muehlebach

    Abstract: Counterfactuals answer questions of what would have been observed under altered circumstances and can therefore offer valuable insights. Whereas the classical interventional interpretation of counterfactuals has been studied extensively, backtracking constitutes a less studied alternative where all causal laws are kept intact. In the present work, we introduce a practical method called deep backtr… ▽ More

    Submitted 9 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  8. arXiv:2307.09933  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Spuriosity Didn't Kill the Classifier: Using Invariant Predictions to Harness Spurious Features

    Authors: Cian Eastwood, Shashank Singh, Andrei Liviu Nicolicioiu, Marin Vlastelica, Julius von Kügelgen, Bernhard Schölkopf

    Abstract: To avoid failures on out-of-distribution data, recent works have sought to extract features that have an invariant or stable relationship with the label across domains, discarding "spurious" or unstable features whose relationship with the label changes across domains. However, unstable features often carry complementary information that could boost performance if used correctly in the test domain… ▽ More

    Submitted 8 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 Camera-Ready

  9. arXiv:2306.06002  [pdf, other

    stat.ME cs.AI

    Causal Effect Estimation from Observational and Interventional Data Through Matrix Weighted Linear Estimators

    Authors: Klaus-Rudolf Kladny, Julius von Kügelgen, Bernhard Schölkopf, Michael Muehlebach

    Abstract: We study causal effect estimation from a mixture of observational and interventional data in a confounded linear regression model with multivariate treatments. We show that the statistical efficiency in terms of expected squared error can be improved by combining estimators arising from both the observational and interventional setting. To this end, we derive methods based on matrix weighted linea… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Journal ref: UAI 2023

  10. arXiv:2306.00542  [pdf, other

    stat.ML cs.AI cs.LG

    Nonparametric Identifiability of Causal Representations from Unknown Interventions

    Authors: Julius von Kügelgen, Michel Besserve, Liang Wendong, Luigi Gresele, Armin Kekić, Elias Bareinboim, David M. Blei, Bernhard Schölkopf

    Abstract: We study causal representation learning, the task of inferring latent causal variables and their causal relations from high-dimensional mixtures of the variables. Prior work relies on weak supervision, in the form of counterfactual pre- and post-intervention views or temporal structure; places restrictive assumptions, such as linearity, on the mixing function or latent causal model; or requires pa… ▽ More

    Submitted 28 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready version; 36 pages, 4 figures

    MSC Class: 68T05 ACM Class: I.2.6

  11. arXiv:2305.17225  [pdf, other

    stat.ML cs.AI cs.LG

    Causal Component Analysis

    Authors: Liang Wendong, Armin Kekić, Julius von Kügelgen, Simon Buchholz, Michel Besserve, Luigi Gresele, Bernhard Schölkopf

    Abstract: Independent Component Analysis (ICA) aims to recover independent latent variables from observed mixtures thereof. Causal Representation Learning (CRL) aims instead to infer causally related (thus often statistically dependent) latent variables, together with the unknown graph encoding their causal relationships. We introduce an intermediate problem termed Causal Component Analysis (CauCA). CauCA c… ▽ More

    Submitted 17 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 final camera-ready version

  12. arXiv:2305.14229  [pdf, other

    cs.LG cs.CV

    Provably Learning Object-Centric Representations

    Authors: Jack Brady, Roland S. Zimmermann, Yash Sharma, Bernhard Schölkopf, Julius von Kügelgen, Wieland Brendel

    Abstract: Learning structured representations of the visual world in terms of objects promises to significantly improve the generalization abilities of current machine learning models. While recent efforts to this end have shown promising empirical progress, a theoretical account of when unsupervised object-centric representation learning is possible is still lacking. Consequently, understanding the reasons… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Oral at ICML 2023. The first two authors as well as the last two authors contributed equally. Code is available at https://brendel-group.github.io/objects-identifiability

  13. arXiv:2212.08498  [pdf, other

    stat.AP cs.AI math.DS

    Evaluating vaccine allocation strategies using simulation-assisted causal modelling

    Authors: Armin Kekić, Jonas Dehning, Luigi Gresele, Julius von Kügelgen, Viola Priesemann, Bernhard Schölkopf

    Abstract: Early on during a pandemic, vaccine availability is limited, requiring prioritisation of different population groups. Evaluating vaccine allocation is therefore a crucial element of pandemics response. In the present work, we develop a model to retrospectively evaluate age-dependent counterfactual vaccine allocation strategies against the COVID-19 pandemic. To estimate the effect of allocation on… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

  14. arXiv:2211.00472  [pdf, ps, other

    cs.AI cs.LG stat.ML

    Backtracking Counterfactuals

    Authors: Julius von Kügelgen, Abdirisak Mohamed, Sander Beckers

    Abstract: Counterfactual reasoning -- envisioning hypothetical scenarios, or possible worlds, where some circumstances are different from what (f)actually occurred (counter-to-fact) -- is ubiquitous in human cognition. Conventionally, counterfactually-altered circumstances have been treated as "small miracles" that locally violate the laws of nature while sharing the same initial conditions. In Pearl's stru… ▽ More

    Submitted 30 May, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: 2nd Conference on Causal Learning and Reasoning (CLeaR 2023) (minor formatting changes from conference camera ready version)

  15. arXiv:2210.00364  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability

    Authors: Cian Eastwood, Andrei Liviu Nicolicioiu, Julius von Kügelgen, Armin Kekić, Frederik Träuble, Andrea Dittadi, Bernhard Schölkopf

    Abstract: In representation learning, a common approach is to seek representations which disentangle the underlying factors of variation. Eastwood & Williams (2018) proposed three metrics for quantifying the quality of such disentangled representations: disentanglement (D), completeness (C) and informativeness (I). In this work, we first connect this DCI framework to two common notions of linear and nonline… ▽ More

    Submitted 16 February, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  16. arXiv:2207.09944  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Probable Domain Generalization via Quantile Risk Minimization

    Authors: Cian Eastwood, Alexander Robey, Shashank Singh, Julius von Kügelgen, Hamed Hassani, George J. Pappas, Bernhard Schölkopf

    Abstract: Domain generalization (DG) seeks predictors which perform well on unseen test distributions by leveraging data drawn from multiple related training distributions or domains. To achieve this, DG is commonly formulated as an average- or worst-case problem over the set of possible domains. However, predictors that perform well on average lack robustness while predictors that perform well in the worst… ▽ More

    Submitted 22 August, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2022 camera-ready (+ minor corrections)

  17. arXiv:2206.02416  [pdf, other

    stat.ML cs.AI cs.LG

    Embrace the Gap: VAEs Perform Independent Mechanism Analysis

    Authors: Patrik Reizinger, Luigi Gresele, Jack Brady, Julius von Kügelgen, Dominik Zietlow, Bernhard Schölkopf, Georg Martius, Wieland Brendel, Michel Besserve

    Abstract: Variational autoencoders (VAEs) are a popular framework for modeling complex data distributions; they can be efficiently trained via variational inference by maximizing the evidence lower bound (ELBO), at the expense of a gap to the exact (log-)marginal likelihood. While VAEs are commonly used for representation learning, it is unclear why ELBO maximization would yield useful representations, sinc… ▽ More

    Submitted 27 January, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: NeurIPS2022 final version

  18. arXiv:2206.02063  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Active Bayesian Causal Inference

    Authors: Christian Toth, Lars Lorch, Christian Knoll, Andreas Krause, Franz Pernkopf, Robert Peharz, Julius von Kügelgen

    Abstract: Causal discovery and causal reasoning are classically treated as separate and consecutive tasks: one first infers the causal graph, and then uses it to estimate causal effects of interventions. However, such a two-stage approach is uneconomical, especially in terms of actively collected interventional data, since the causal query of interest may not require a fully-specified causal model. From a B… ▽ More

    Submitted 15 October, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version. RP & JvK are shared last authors. 10 pages + Bibliography + Appendix (34 pages total)

  19. arXiv:2206.02013  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Causal Discovery in Heterogeneous Environments Under the Sparse Mechanism Shift Hypothesis

    Authors: Ronan Perry, Julius von Kügelgen, Bernhard Schölkopf

    Abstract: Machine learning approaches commonly rely on the assumption of independent and identically distributed (i.i.d.) data. In reality, however, this assumption is almost always violated due to distribution shifts between environments. Although valuable learning signals can be provided by heterogeneous data from changing distributions, it is also known that learning under arbitrary (adversarial) changes… ▽ More

    Submitted 15 October, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version. JvK and BS are shared last authors. 10 pages + Bibliography + Appendix (26 pages total)

  20. arXiv:2204.00607  [pdf, other

    cs.AI cs.LG stat.ML

    From Statistical to Causal Learning

    Authors: Bernhard Schölkopf, Julius von Kügelgen

    Abstract: We describe basic ideas underlying research to build and understand artificially intelligent systems: from symbolic approaches via statistical learning to interventional models relying on concepts of causality. Some of the hard open problems of machine learning and AI are intrinsically related to causality, and progress may require advances in our understanding of how to model and infer causality… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: To appear in the Proceedings of the International Congress of Mathematicians 2022, EMS Press. Both authors contributed equally to this work; names are listed in alphabetical order. 34 pages (28 content pages + references), 12 figures, 2 tables. arXiv admin note: text overlap with arXiv:1911.10500

  21. arXiv:2202.06844  [pdf, other

    stat.ML cs.AI cs.LG

    On Pitfalls of Identifiability in Unsupervised Learning. A Note on: "Desiderata for Representation Learning: A Causal Perspective"

    Authors: Shubhangi Ghosh, Luigi Gresele, Julius von Kügelgen, Michel Besserve, Bernhard Schölkopf

    Abstract: Model identifiability is a desirable property in the context of unsupervised representation learning. In absence thereof, different models may be observationally indistinguishable while yielding representations that are nontrivially related to one another, thus making the recovery of a ground truth generative model fundamentally impossible, as often shown through suitably constructed counterexampl… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 5 pages, 1 figure

  22. arXiv:2202.01300  [pdf, other

    cs.AI cs.LG

    Causal Inference Through the Structural Causal Marginal Problem

    Authors: Luigi Gresele, Julius von Kügelgen, Jonas M. Kübler, Elke Kirschbaum, Bernhard Schölkopf, Dominik Janzing

    Abstract: We introduce an approach to counterfactual inference based on merging information from multiple datasets. We consider a causal reformulation of the statistical marginal problem: given a collection of marginal structural causal models (SCMs) over distinct but overlap** sets of variables, determine the set of joint SCMs that are counterfactually consistent with the marginal ones. We formalise this… ▽ More

    Submitted 14 July, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 32 pages (9 pages main paper + bibliography and appendix), 6 figures

    Journal ref: International Conference on Machine Learning (ICML 2022), 7793-7824

  23. arXiv:2110.06562  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Object Learning via Common Fate

    Authors: Matthias Tangemann, Steffen Schneider, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Thomas Brox, Matthias Kümmerer, Matthias Bethge, Bernhard Schölkopf

    Abstract: Learning generative object models from unlabelled videos is a long standing problem and required for causal scene modeling. We decompose this problem into three easier subtasks, and provide candidate solutions for each of them. Inspired by the Common Fate Principle of Gestalt Psychology, we first extract (noisy) masks of moving objects via unsupervised motion segmentation. Second, generative model… ▽ More

    Submitted 15 May, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Published at CLeaR 2023

  24. arXiv:2110.05304  [pdf, other

    cs.LG

    You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction

    Authors: Osama Makansi, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Dominik Janzing, Thomas Brox, Bernhard Schölkopf

    Abstract: Predicting the future trajectory of a moving agent can be easy when the past trajectory continues smoothly but is challenging when complex interactions with other agents are involved. Recent deep learning approaches for trajectory prediction show promising performance and partially attribute this to successful reasoning about agent-agent interactions. However, it remains unclear which features suc… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  25. arXiv:2110.03618  [pdf, other

    cs.CL cs.AI cs.LG

    Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP

    Authors: Zhi**g **, Julius von Kügelgen, **gwei Ni, Tejas Vaidhya, Ayush Kaushal, Mrinmaya Sachan, Bernhard Schölkopf

    Abstract: The principle of independent causal mechanisms (ICM) states that generative processes of real world data consist of independent modules which do not influence or inform each other. While this idea has led to fruitful developments in the field of causal inference, it is not widely-known in the NLP community. In this work, we argue that the causal direction of the data collection process bears nontr… ▽ More

    Submitted 19 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 (Oral)

  26. arXiv:2107.08221  [pdf, other

    cs.LG cs.CV

    Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

    Authors: Lukas Schott, Julius von Kügelgen, Frederik Träuble, Peter Gehler, Chris Russell, Matthias Bethge, Bernhard Schölkopf, Francesco Locatello, Wieland Brendel

    Abstract: An important component for generalization in machine learning is to uncover underlying latent factors of variation as well as the mechanism through which each factor acts in the world. In this paper, we test whether 17 unsupervised, weakly supervised, and fully supervised representation learning approaches correctly infer the generative factors of variation in simple datasets (dSprites, Shapes3D,… ▽ More

    Submitted 12 February, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

  27. arXiv:2107.01057  [pdf, other

    cs.LG cs.AI

    Backward-Compatible Prediction Updates: A Probabilistic Approach

    Authors: Frederik Träuble, Julius von Kügelgen, Matthäus Kleindessner, Francesco Locatello, Bernhard Schölkopf, Peter Gehler

    Abstract: When machine learning systems meet real world applications, accuracy is only one of several requirements. In this paper, we assay a complementary perspective originating from the increasing availability of pre-trained and regularly improving state-of-the-art models. While new improved models develop at a fast pace, downstream tasks vary more slowly or stay constant. Assume that we have a large unl… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  28. arXiv:2106.11849  [pdf, other

    stat.ML cs.AI cs.LG

    Algorithmic Recourse in Partially and Fully Confounded Settings Through Bounding Counterfactual Effects

    Authors: Julius von Kügelgen, Nikita Agarwal, Jakob Zeitler, Afsaneh Mastouri, Bernhard Schölkopf

    Abstract: Algorithmic recourse aims to provide actionable recommendations to individuals to obtain a more favourable outcome from an automated decision-making system. As it involves reasoning about interventions performed in the physical world, recourse is fundamentally a causal problem. Existing methods compute the effect of recourse actions using a causal model learnt from data under the assumption of no… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: Preliminary workshop version; work in progress

  29. arXiv:2106.05200  [pdf, other

    stat.ML cs.AI cs.LG

    Independent mechanism analysis, a new concept?

    Authors: Luigi Gresele, Julius von Kügelgen, Vincent Stimper, Bernhard Schölkopf, Michel Besserve

    Abstract: Independent component analysis provides a principled framework for unsupervised representation learning, with solid theory on the identifiability of the latent code that generated the data, given only observations of mixtures thereof. Unfortunately, when the mixing is nonlinear, the model is provably nonidentifiable, since statistical independence alone does not sufficiently constrain the problem.… ▽ More

    Submitted 9 February, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 final camera-ready version

  30. arXiv:2106.04619  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

    Authors: Julius von Kügelgen, Yash Sharma, Luigi Gresele, Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

    Abstract: Self-supervised representation learning has shown remarkable success in a number of domains. A common practice is to perform data augmentation via hand-crafted transformations intended to leave the semantics of the data invariant. We seek to understand the empirical success of this approach from a theoretical perspective. We formulate the augmentation process as a latent variable model by postulat… ▽ More

    Submitted 14 January, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 final camera-ready revision (with minor corrections)

  31. arXiv:2010.06529  [pdf, other

    cs.LG cs.AI stat.ML

    On the Fairness of Causal Algorithmic Recourse

    Authors: Julius von Kügelgen, Amir-Hossein Karimi, Umang Bhatt, Isabel Valera, Adrian Weller, Bernhard Schölkopf

    Abstract: Algorithmic fairness is typically studied from the perspective of predictions. Instead, here we investigate fairness from the perspective of recourse actions suggested to individuals to remedy an unfavourable classification. We propose two new fairness criteria at the group and individual level, which -- unlike prior work on equalising the average group-wise distance from the decision boundary --… ▽ More

    Submitted 6 March, 2022; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: AAAI 2022 extended camera-ready version with technical appendices. (9 pages main paper + references + appendices)

  32. arXiv:2010.00271  [pdf, other

    stat.ME stat.AP

    Kernel Two-Sample and Independence Tests for Non-Stationary Random Processes

    Authors: Felix Laumann, Julius von Kügelgen, Mauricio Barahona

    Abstract: Two-sample and independence tests with the kernel-based MMD and HSIC have shown remarkable results on i.i.d. data and stationary random processes. However, these statistics are not directly applicable to non-stationary random processes, a prevalent form of data in many scientific disciplines. In this work, we extend the application of MMD and HSIC to non-stationary settings by assuming access to i… ▽ More

    Submitted 4 January, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

  33. arXiv:2006.06831  [pdf, other

    cs.LG cs.AI stat.ML

    Algorithmic recourse under imperfect causal knowledge: a probabilistic approach

    Authors: Amir-Hossein Karimi, Julius von Kügelgen, Bernhard Schölkopf, Isabel Valera

    Abstract: Recent work has discussed the limitations of counterfactual explanations to recommend actions for algorithmic recourse, and argued for the need of taking causal relationships between features into consideration. Unfortunately, in practice, the true underlying structural causal model is generally unknown. In this work, we first show that it is impossible to guarantee recourse without access to the… ▽ More

    Submitted 23 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Camera ready version (NeurIPS 2020 spotlight)

    Journal ref: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  34. Simpson's paradox in Covid-19 case fatality rates: a mediation analysis of age-related causal effects

    Authors: Julius von Kügelgen, Luigi Gresele, Bernhard Schölkopf

    Abstract: We point out an instantiation of Simpson's paradox in Covid-19 case fatality rates (CFRs): comparing a large-scale study from China (17 Feb) with early reports from Italy (9 Mar), we find that CFRs are lower in Italy for every age group, but higher overall. This phenomenon is explained by a stark difference in case demographic between the two countries. Using this as a motivating example, we intro… ▽ More

    Submitted 23 June, 2021; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: Journal version with full Appendix. The first two authors contributed equally to this work

    Journal ref: IEEE Transactions on Artificial Intelligence, vol. 2, no. 1, pp. 18-27, Feb. 2021

  35. arXiv:2004.12906  [pdf, other

    stat.ML cs.CV cs.LG

    Towards causal generative scene models via competition of experts

    Authors: Julius von Kügelgen, Ivan Ustyuzhaninov, Peter Gehler, Matthias Bethge, Bernhard Schölkopf

    Abstract: Learning how to model complex scenes in a modular way with recombinable components is a pre-requisite for higher-order reasoning and acting in the physical world. However, current generative models lack the ability to capture the inherently compositional and layered nature of visual scenes. While recent work has made progress towards unsupervised learning of object-based scene representations, mos… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: Presented at the ICLR 2020 workshop "Causal learning for decision making"

  36. arXiv:2004.09318  [pdf, other

    econ.EM stat.AP

    Non-linear interlinkages and key objectives amongst the Paris Agreement and the Sustainable Development Goals

    Authors: Felix Laumann, Julius von Kügelgen, Mauricio Barahona

    Abstract: The United Nations' ambitions to combat climate change and prosper human development are manifested in the Paris Agreement and the Sustainable Development Goals (SDGs), respectively. These are inherently inter-linked as progress towards some of these objectives may accelerate or hinder progress towards others. We investigate how these two agendas influence each other by defining networks of 18 nod… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

  37. arXiv:1910.03962  [pdf, other

    stat.ML cs.LG

    Optimal experimental design via Bayesian optimization: active causal structure learning for Gaussian process networks

    Authors: Julius von Kügelgen, Paul K Rubenstein, Bernhard Schölkopf, Adrian Weller

    Abstract: We study the problem of causal discovery through targeted interventions. Starting from few observational measurements, we follow a Bayesian active learning approach to perform those experiments which, in expectation with respect to the current model, are maximally informative about the underlying causal structure. Unlike previous work, we consider the setting of continuous random variables with no… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: Working paper. Accepted as a poster at the NeurIPS 2019 workshop, "Do the right thing": machine learning and causal inference for improved decision making. (6 pages + references + appendix)

  38. arXiv:1905.12081  [pdf, other

    stat.ML cs.LG stat.OT

    Semi-Supervised Learning, Causality and the Conditional Cluster Assumption

    Authors: Julius von Kügelgen, Alexander Mey, Marco Loog, Bernhard Schölkopf

    Abstract: While the success of semi-supervised learning (SSL) is still not fully understood, Schölkopf et al. (2012) have established a link to the principle of independent causal mechanisms. They conclude that SSL should be impossible when predicting a target variable from its causes, but possible when predicting it from its effects. Since both these cases are somewhat restrictive, we extend their work by… ▽ More

    Submitted 24 June, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: 36th Conference on Uncertainty in Artificial Intelligence (2020) (Previously presented at the NeurIPS 2019 workshop "Do the right thing": machine learning and causal inference for improved decision making, Vancouver, Canada.)

  39. arXiv:1807.07879  [pdf, other

    stat.ML cs.LG

    Semi-Generative Modelling: Covariate-Shift Adaptation with Cause and Effect Features

    Authors: Julius von Kügelgen, Alexander Mey, Marco Loog

    Abstract: Current methods for covariate-shift adaptation use unlabelled data to compute importance weights or domain-invariant features, while the final model is trained on labelled data only. Here, we consider a particular case of covariate shift which allows us also to learn from unlabelled data, that is, combining adaptation with semi-supervised learning. Using ideas from causality, we argue that this re… ▽ More

    Submitted 26 February, 2019; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019, Naha, Okinawa, Japan. (Camera-ready version)