Skip to main content

Showing 1–7 of 7 results for author: Diaz, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.14228  [pdf, other

    stat.ML cs.LG

    Recovering Latent Confounders from High-dimensional Proxy Variables

    Authors: Nathan Mankovich, Homer Durand, Emiliano Diaz, Gherardo Varando, Gustau Camps-Valls

    Abstract: Detecting latent confounders from proxy variables is an essential problem in causal effect estimation. Previous approaches are limited to low-dimensional proxies, sorted proxies, and binary treatments. We remove these assumptions and present a novel Proxy Confounder Factorization (PCF) framework for continuous treatment effect estimation when latent confounders manifest through high-dimensional, m… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2305.13341  [pdf, other

    physics.data-an cs.AI cs.LG stat.ME

    Discovering Causal Relations and Equations from Data

    Authors: Gustau Camps-Valls, Andreas Gerhardus, Urmi Ninad, Gherardo Varando, Georg Martius, Emili Balaguer-Ballester, Ricardo Vinuesa, Emiliano Diaz, Laure Zanna, Jakob Runge

    Abstract: Physics is a field of science that has traditionally used the scientific method to answer questions about why natural phenomena occur and to make testable models that explain the phenomena. Discovering equations, laws and principles that are invariant, robust and causal explanations of the world has been fundamental in physical sciences throughout the centuries. Discoveries emerge from observing t… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 137 pages

  3. arXiv:2012.04922  [pdf, other

    stat.ME cs.LG stat.ML

    Consistent regression of biophysical parameters with kernel methods

    Authors: Emiliano Díaz, Adrián Pérez-Suay, Valero Laparra, Gustau Camps-Valls

    Abstract: This paper introduces a novel statistical regression framework that allows the incorporation of consistency constraints. A linear and nonlinear (kernel-based) formulation are introduced, and both imply closed-form analytical solutions. The models exploit all the information from a set of drivers while being maximally independent of a set of auxiliary, protected variables. We successfully illustrat… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1710.05578

  4. arXiv:1704.01932  [pdf

    stat.AP

    Estimación de la inicial de referencia utilizando simulación

    Authors: Emiliano Díaz

    Abstract: The method proposed by Bernardo and Smith [2000] to approximate reference priors by simulation was analyzed with the objective of improving the procedure in order to obtain consistent estimators and to allow the estimation of asymptotic probability intervals. In this sense, the variance of Bernardo's estimator was derived and was used to construct probability intervals that permitted the expressio… ▽ More

    Submitted 1 April, 2017; originally announced April 2017.

    Comments: in Spanish

  5. arXiv:1704.00829  [pdf, other

    stat.AP cs.CV

    Online deforestation detection

    Authors: Emiliano Diaz

    Abstract: Deforestation detection using satellite images can make an important contribution to forest management. Current approaches can be broadly divided into those that compare two images taken at similar periods of the year and those that monitor changes by using multiple images taken during the growing season. The CMFDA algorithm described in Zhu et al. (2012) is an algorithm that builds on the latter… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

  6. arXiv:1704.00588  [pdf, other

    stat.AP

    Causality and surrogate variable analysis

    Authors: Emiliano Diaz

    Abstract: Gene expression depends on thousands of factors and we usually only have access to tens or hundreds of observations of gene expression levels meaning we are in a high-dimensional setting. Additionally we don't always observe or care about all the factors. However, many different gene expression levels depend on a set of common factors. By observing the joint variance of the gene expression levels… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

  7. arXiv:1704.00575  [pdf, other

    stat.AP cs.IT

    Sparse mean localization by information theory

    Authors: Emiliano Diaz

    Abstract: Sparse feature selection is necessary when we fit statistical models, we have access to a large group of features, don't know which are relevant, but assume that most are not. Alternatively, when the number of features is larger than the available data the model becomes over parametrized and the sparse feature selection task involves selecting the most informative variables for the model. When the… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.