Skip to main content

Showing 1–18 of 18 results for author: Varando, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07378  [pdf, other

    cs.AI cs.CL

    Large Language Models for Constrained-Based Causal Discovery

    Authors: Kai-Hendrik Cohrs, Gherardo Varando, Emiliano Diaz, Vasileios Sitokonstantinou, Gustau Camps-Valls

    Abstract: Causality is essential for understanding complex systems, such as the economy, the brain, and the climate. Constructing causal graphs often relies on either data-driven or expert-driven approaches, both fraught with challenges. The former methods, like the celebrated PC algorithm, face issues with data requirements and assumptions of causal sufficiency, while the latter demand substantial time and… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2405.18306  [pdf, other

    stat.ML cs.LG

    Learning Staged Trees from Incomplete Data

    Authors: Jack Storror Carter, Manuele Leonelli, Eva Riccomagno, Gherardo Varando

    Abstract: Staged trees are probabilistic graphical models capable of representing any class of non-symmetric independence via a coloring of its vertices. Several structural learning routines have been defined and implemented to learn staged trees from data, under the frequentist or Bayesian paradigm. They assume a data set has been observed fully and, in practice, observations with missing entries are eithe… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.18298  [pdf, other

    stat.ML cs.LG

    Context-Specific Refinements of Bayesian Network Classifiers

    Authors: Manuele Leonelli, Gherardo Varando

    Abstract: Supervised classification is one of the most ubiquitous tasks in machine learning. Generative classifiers based on Bayesian networks are often used because of their interpretability and competitive accuracy. The widely used naive and TAN classifiers are specific instances of Bayesian network classifiers with a constrained underlying graph. This paper introduces novel classes of generative classifi… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2206.06970

  4. arXiv:2403.14228  [pdf, other

    stat.ML cs.LG

    Recovering Latent Confounders from High-dimensional Proxy Variables

    Authors: Nathan Mankovich, Homer Durand, Emiliano Diaz, Gherardo Varando, Gustau Camps-Valls

    Abstract: Detecting latent confounders from proxy variables is an essential problem in causal effect estimation. Previous approaches are limited to low-dimensional proxies, sorted proxies, and binary treatments. We remove these assumptions and present a novel Proxy Confounder Factorization (PCF) framework for continuous treatment effect estimation when latent confounders manifest through high-dimensional, m… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  5. arXiv:2403.01865  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Improving generalisation via anchor multivariate analysis

    Authors: Homer Durand, Gherardo Varando, Nathan Mankovich, Gustau Camps-Valls

    Abstract: We introduce a causal regularisation extension to anchor regression (AR) for improved out-of-distribution (OOD) generalisation. We present anchor-compatible losses, aligning with the anchor framework to ensure robustness against distribution shifts. Various multivariate analysis (MVA) algorithms, such as (Orthonormalized) PLS, RRR, and MLR, fall within the anchor framework. We observe that simple… ▽ More

    Submitted 11 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 21 pages, 15 figures

    MSC Class: 62Hxx

  6. arXiv:2402.13332  [pdf, other

    cs.LG stat.ME

    Causal hybrid modeling with double machine learning

    Authors: Kai-Hendrik Cohrs, Gherardo Varando, Nuno Carvalhais, Markus Reichstein, Gustau Camps-Valls

    Abstract: Hybrid modeling integrates machine learning with scientific knowledge to enhance interpretability, generalization, and adherence to natural laws. Nevertheless, equifinality and regularization biases pose challenges in hybrid modeling to achieve these purposes. This paper introduces a novel approach to estimating hybrid models via a causal inference framework, specifically employing Double Machine… ▽ More

    Submitted 4 April, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  7. arXiv:2305.13341  [pdf, other

    physics.data-an cs.AI cs.LG stat.ME

    Discovering Causal Relations and Equations from Data

    Authors: Gustau Camps-Valls, Andreas Gerhardus, Urmi Ninad, Gherardo Varando, Georg Martius, Emili Balaguer-Ballester, Ricardo Vinuesa, Emiliano Diaz, Laure Zanna, Jakob Runge

    Abstract: Physics is a field of science that has traditionally used the scientific method to answer questions about why natural phenomena occur and to make testable models that explain the phenomena. Discovering equations, laws and principles that are invariant, robust and causal explanations of the world has been fundamental in physical sciences throughout the centuries. Discoveries emerge from observing t… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 137 pages

  8. arXiv:2301.00629  [pdf, other

    cs.AI stat.ML

    Learning and interpreting asymmetry-labeled DAGs: a case study on COVID-19 fear

    Authors: Manuele Leonelli, Gherardo Varando

    Abstract: Bayesian networks are widely used to learn and reason about the dependence structure of discrete variables. However, they are only capable of formally encoding symmetric conditional independence, which in practice is often too strict to hold. Asymmetry-labeled DAGs have been recently proposed to both extend the class of Bayesian networks by relaxing the symmetric assumption of independence and den… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  9. arXiv:2206.06970  [pdf, other

    stat.ML cs.LG

    Highly Efficient Structural Learning of Sparse Staged Trees

    Authors: Manuele Leonelli, Gherardo Varando

    Abstract: Several structural learning algorithms for staged tree models, an asymmetric extension of Bayesian networks, have been defined. However, they do not scale efficiently as the number of variables considered increases. Here we introduce the first scalable structural learning algorithm for staged trees, which searches over a space of models where only a small number of dependencies can be imposed. A s… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2203.04390

  10. arXiv:2203.04390  [pdf, other

    stat.ML cs.LG

    Structural Learning of Simple Staged Trees

    Authors: Manuele Leonelli, Gherardo Varando

    Abstract: Bayesian networks faithfully represent the symmetric conditional independences existing between the components of a random vector. Staged trees are an extension of Bayesian networks for categorical random vectors whose graph represents non-symmetric conditional independences via vertex coloring. However, since they are based on a tree representation of the sample space, the underlying graph become… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  11. arXiv:2108.01994  [pdf, other

    stat.ML cs.AI cs.LG

    Staged trees and asymmetry-labeled DAGs

    Authors: Gherardo Varando, Federico Carli, Manuele Leonelli

    Abstract: Bayesian networks are a widely-used class of probabilistic graphical models capable of representing symmetric conditional independence between variables of interest using the topology of the underlying graph. For categorical variables, they can be seen as a special case of the much more general class of models called staged trees, which can represent any type of non-symmetric conditional independe… ▽ More

    Submitted 5 October, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

  12. arXiv:2106.04416  [pdf, other

    stat.ME cs.LG

    Context-Specific Causal Discovery for Categorical Data Using Staged Trees

    Authors: Manuele Leonelli, Gherardo Varando

    Abstract: Causal discovery algorithms aim at untangling complex causal relationships from data. Here, we study causal discovery and inference methods based on staged tree models, which can represent complex and asymmetric causal relationships between categorical variables. We provide a first graphical representation of the equivalence class of a staged tree, by looking only at a specific subset of its under… ▽ More

    Submitted 28 February, 2023; v1 submitted 8 June, 2021; originally announced June 2021.

  13. arXiv:2012.13798  [pdf, other

    cs.AI cs.LG stat.ML

    A new class of generative classifiers based on staged tree models

    Authors: Federico Carli, Manuele Leonelli, Gherardo Varando

    Abstract: Generative models for classification use the joint probability distribution of the class variable and the features to construct a decision rule. Among generative models, Bayesian networks and naive Bayes classifiers are the most commonly used and provide a clear graphical representation of the relationship among all variables. However, these have the disadvantage of highly restricting the type of… ▽ More

    Submitted 4 August, 2022; v1 submitted 26 December, 2020; originally announced December 2020.

  14. arXiv:2006.03005  [pdf, other

    stat.ML cs.LG stat.CO

    Learning DAGs without imposing acyclicity

    Authors: Gherardo Varando

    Abstract: We explore if it is possible to learn a directed acyclic graph (DAG) from data without imposing explicitly the acyclicity constraint. In particular, for Gaussian distributions, we frame structural learning as a sparse matrix factorization problem and we empirically show that solving an $\ell_1$-penalized optimization yields to good recovery of the true graph and, in general, to almost-DAG graphs.… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: 16 pages, 5 figures

  15. Sparse Cholesky covariance parametrization for recovering latent structure in ordered data

    Authors: Irene Córdoba, Concha Bielza, Pedro Larrañaga, Gherardo Varando

    Abstract: The sparse Cholesky parametrization of the inverse covariance matrix can be interpreted as a Gaussian Bayesian network; however its counterpart, the covariance Cholesky factor, has received, with few notable exceptions, little attention so far, despite having a natural interpretation as a hidden variable model for ordered signal data. To fill this gap, in this paper we focus on arbitrary zero patt… ▽ More

    Submitted 19 August, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 24 pages, 12 figures

    Journal ref: IEEE Access, 8: 154614-154624, 2020

  16. arXiv:2005.10483  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Graphical continuous Lyapunov models

    Authors: Gherardo Varando, Niels Richard Hansen

    Abstract: The linear Lyapunov equation of a covariance matrix parametrizes the equilibrium covariance matrix of a stochastic process. This parametrization can be interpreted as a new graphical model class, and we show how the model class behaves under marginalization and introduce a method for structure learning via $\ell_1$-penalized loss minimization. Our proposed method is demonstrated to outperform alte… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: 10 pages, 5 figures

  17. arXiv:2002.09573  [pdf, ps, other

    stat.ML cs.LG stat.AP

    Causal structure learning from time series: Large regression coefficients may predict causal links better in practice than small p-values

    Authors: Sebastian Weichwald, Martin E Jakobsen, Phillip B Mogensen, Lasse Petersen, Nikolaj Thams, Gherardo Varando

    Abstract: In this article, we describe the algorithms for causal structure learning from time series data that won the Causality 4 Climate competition at the Conference on Neural Information Processing Systems 2019 (NeurIPS). We examine how our combination of established ideas achieves competitive performance on semi-realistic and realistic time series data exhibiting common challenges in real-world Earth s… ▽ More

    Submitted 2 September, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Journal ref: Proceedings of the NeurIPS 2019 Competition and Demonstration Track, Proceedings of Machine Learning Research, 123:27-36, 2020 ( http://proceedings.mlr.press/v123/weichwald20a.html )

  18. arXiv:1811.04759  [pdf, ps, other

    cs.LG stat.ML

    Markov Property in Generative Classifiers

    Authors: Gherardo Varando, Concha Bielza, Pedro Larrañaga, Eva Riccomagno

    Abstract: We show that, for generative classifiers, conditional independence corresponds to linear constraints for the induced discrimination functions. Discrimination functions of undirected Markov network classifiers can thus be characterized by sets of linear constraints. These constraints are represented by a second order finite difference operator over functions of categorical variables. As an applicat… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.