Skip to main content

Showing 1–50 of 87 results for author: Uhler, C

.
  1. arXiv:2406.01823  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Causal Discovery with Fewer Conditional Independence Tests

    Authors: Kirankumar Shiragur, Jiaqi Zhang, Caroline Uhler

    Abstract: Many questions in science center around the fundamental problem of understanding causal relationships. However, most constraint-based causal discovery algorithms, including the well-celebrated PC algorithm, often incur an exponential number of conditional independence (CI) tests, posing limitations in various applications. Addressing this, our work focuses on characterizing what can be learned abo… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2405.19225  [pdf, other

    cs.LG econ.EM stat.ME

    Synthetic Potential Outcomes for Mixtures of Treatment Effects

    Authors: Bijan Mazaheri, Chandler Squires, Caroline Uhler

    Abstract: Modern data analysis frequently relies on the use of large datasets, often constructed as amalgamations of diverse populations or data-sources. Heterogeneity across these smaller datasets constitutes two major challenges for causal inference: (1) the source of each sample can introduce latent confounding between treatment and effect, and (2) diverse populations may respond differently to the same… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2404.16907  [pdf, other

    q-bio.GN cs.LG q-bio.CB

    Season combinatorial intervention predictions with Salt & Peper

    Authors: Thomas Gaudelet, Alice Del Vecchio, Eli M Carrami, Juliana Cudini, Chantriolnt-Andreas Kapourani, Caroline Uhler, Lindsay Edwards

    Abstract: Interventions play a pivotal role in the study of complex biological systems. In drug discovery, genetic interventions (such as CRISPR base editing) have become central to both identifying potential therapeutic targets and understanding a drug's mechanism of action. With the advancement of CRISPR and the proliferation of genome-scale analyses such as transcriptomics, a new challenge is to navigate… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  4. arXiv:2403.05759  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Membership Testing in Markov Equivalence Classes via Independence Query Oracles

    Authors: Jiaqi Zhang, Kirankumar Shiragur, Caroline Uhler

    Abstract: Understanding causal relationships between variables is a fundamental problem with broad impact in numerous scientific fields. While extensive research has been dedicated to learning causal graphs from data, its complementary concept of testing causal relationships has remained largely unexplored. While learning involves the task of recovering the Markov equivalence class (MEC) of the underlying c… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2402.14777  [pdf, other

    stat.ML cs.LG

    Causal Imputation for Counterfactual SCMs: Bridging Graphs and Latent Factor Models

    Authors: Alvaro Ribot, Chandler Squires, Caroline Uhler

    Abstract: We consider the task of causal imputation, where we aim to predict the outcomes of some set of actions across a wide range of possible contexts. As a running example, we consider predicting how different drugs affect cells from different cell types. We study the index-only setting, where the actions and contexts are categorical variables with a finite number of possible values. Even in this simple… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 35 pages, 17 figures

  6. arXiv:2402.08229  [pdf, other

    cs.LG cs.DS stat.ME stat.ML

    Causal Discovery under Off-Target Interventions

    Authors: Davin Choo, Kirankumar Shiragur, Caroline Uhler

    Abstract: Causal graph discovery is a significant problem with applications across various disciplines. However, with observational data alone, the underlying causal graph can only be recovered up to its Markov equivalence class, and further assumptions or interventions are necessary to narrow down the true graph. This work addresses the causal discovery problem under the setting of stochastic interventions… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted into AISTATS 2024

  7. arXiv:2312.00718  [pdf, other

    cs.LG cs.AI q-bio.BM

    Removing Biases from Molecular Representations via Information Maximization

    Authors: Chenyu Wang, Sharut Gupta, Caroline Uhler, Tommi Jaakkola

    Abstract: High-throughput drug screening -- using cell imaging or gene expression measurements as readouts of drug effect -- is a critical tool in biotechnology to assess and understand the relationship between the chemical structure and biological activity of a drug. Since large-scale screens have to be divided into multiple experiments, a key difficulty is dealing with batch effects, which can introduce s… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  8. arXiv:2310.20075  [pdf, other

    cs.LG cs.DM stat.ME stat.ML

    Meek Separators and Their Applications in Targeted Causal Discovery

    Authors: Kirankumar Shiragur, Jiaqi Zhang, Caroline Uhler

    Abstract: Learning causal structures from interventional data is a fundamental problem with broad applications across various fields. While many previous works have focused on recovering the entire causal graph, in practice, there are scenarios where learning only part of the causal graph suffices. This is called $targeted$ causal discovery. In our work, we focus on two such well-motivated problems: subset… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  9. arXiv:2307.06250  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Identifiability Guarantees for Causal Disentanglement from Soft Interventions

    Authors: Jiaqi Zhang, Chandler Squires, Kristjan Greenewald, Akash Srivastava, Karthikeyan Shanmugam, Caroline Uhler

    Abstract: Causal disentanglement aims to uncover a representation of data using latent variables that are interrelated through a causal model. Such a representation is identifiable if the latent model that explains the data is unique. In this paper, we focus on the scenario where unpaired observational and interventional data are available, with each intervention changing the mechanism of a latent variable.… ▽ More

    Submitted 8 November, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  10. arXiv:2305.19884  [pdf, ps, other

    math.ST

    Positivity in Linear Gaussian Structural Equation Models

    Authors: Asad Lodhia, Jan-Christian Hütter, Caroline Uhler, Piotr Zwiernik

    Abstract: We study a notion of positivity of Gaussian directed acyclic graphical models corresponding to a non-negativity constraint on the coefficients of the associated structural equation model. We prove that this constraint is equivalent to the distribution being conditionally increasing in sequence (CIS), a well-known subclass of positively associated random variables. These distributions require knowl… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 22 pages, 5 figures

  11. arXiv:2302.00993  [pdf, other

    stat.ML cs.LG stat.ME

    Unpaired Multi-Domain Causal Representation Learning

    Authors: Nils Sturma, Chandler Squires, Mathias Drton, Caroline Uhler

    Abstract: The goal of causal representation learning is to find a representation of data that consists of causally related latent variables. We consider a setup where one has access to data from multiple domains that potentially share a causal representation. Crucially, observations in different domains are assumed to be unpaired, that is, we only observe the marginal distribution in each domain but not the… ▽ More

    Submitted 27 October, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  12. arXiv:2301.10814  [pdf, other

    q-bio.BM cs.LG

    Unsupervised Protein-Ligand Binding Energy Prediction via Neural Euler's Rotation Equation

    Authors: Wengong **, Siranush Sarkizova, Xun Chen, Nir Hacohen, Caroline Uhler

    Abstract: Protein-ligand binding prediction is a fundamental problem in AI-driven drug discovery. Prior work focused on supervised learning methods using a large set of binding affinity data for small molecules, but it is hard to apply the same strategy to other drug classes like antibodies as labelled data is limited. In this paper, we explore unsupervised approaches and reformulate binding energy predicti… ▽ More

    Submitted 12 December, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  13. arXiv:2211.16467  [pdf, other

    stat.ML cs.LG

    Linear Causal Disentanglement via Interventions

    Authors: Chandler Squires, Anna Seigal, Salil Bhate, Caroline Uhler

    Abstract: Causal disentanglement seeks a representation of data involving latent variables that relate to one another via a causal model. A representation is identifiable if both the latent model and the transformation from latent to observed variables are unique. In this paper, we study observed variables that are a linear transformation of a linear latent causal model. Data from interventions are necessar… ▽ More

    Submitted 11 June, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  14. arXiv:2211.00227  [pdf, other

    cs.LG

    Transfer Learning with Kernel Methods

    Authors: Adityanarayanan Radhakrishnan, Max Ruiz Luyten, Neha Prasad, Caroline Uhler

    Abstract: Transfer learning refers to the process of adapting a model trained on a source task to a target task. While kernel methods are conceptually and computationally simple machine learning models that are competitive on a variety of tasks, it has been unclear how to perform transfer learning for kernel methods. In this work, we propose a transfer learning framework for kernel methods by projecting and… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

  15. arXiv:2209.04744  [pdf, other

    cs.LG stat.ME

    Active Learning for Optimal Intervention Design in Causal Models

    Authors: Jiaqi Zhang, Louis Cammarata, Chandler Squires, Themistoklis P. Sapsis, Caroline Uhler

    Abstract: Sequential experimental design to discover interventions that achieve a desired outcome is a key problem in various domains including science, engineering and public policy. When the space of possible interventions is large, making an exhaustive search infeasible, experimental design strategies are needed. In this context, encoding the causal relationships between the variables, and thus the effec… ▽ More

    Submitted 16 August, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

  16. arXiv:2207.01237  [pdf, other

    stat.ME

    Causal Structure Discovery between Clusters of Nodes Induced by Latent Factors

    Authors: Chandler Squires, Annie Yun, Eshaan Nichani, Raj Agrawal, Caroline Uhler

    Abstract: We consider the problem of learning the structure of a causal directed acyclic graph (DAG) model in the presence of latent variables. We define latent factor causal models (LFCMs) as a restriction on causal DAG models with latent variables, which are composed of clusters of observed variables that share the same latent parent and connections between these clusters given by edges pointing from the… ▽ More

    Submitted 5 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Causal Learning and Reasoning (CLeaR) 2022

  17. Causal Structure Learning: a Combinatorial Perspective

    Authors: Chandler Squires, Caroline Uhler

    Abstract: In this review, we discuss approaches for learning causal structure from data, also called causal discovery. In particular, we focus on approaches for learning directed acyclic graphs (DAGs) and various generalizations which allow for some variables to be unobserved in the available data. We devote special attention to two fundamental combinatorial aspects of causal structure learning. First, we d… ▽ More

    Submitted 19 December, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Foundations of Computational Mathematics, 2022

  18. Wide and Deep Neural Networks Achieve Optimality for Classification

    Authors: Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

    Abstract: While neural networks are used for classification tasks across domains, a long-standing open problem in machine learning is determining whether neural networks trained using standard procedures are optimal for classification, i.e., whether such models minimize the probability of misclassification for arbitrary data distributions. In this work, we identify and construct an explicit set of neural ne… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

  19. arXiv:2112.14872  [pdf, other

    math.OC cs.LG

    Local Quadratic Convergence of Stochastic Gradient Descent with Adaptive Step Size

    Authors: Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

    Abstract: Establishing a fast rate of convergence for optimization methods is crucial to their applicability in practice. With the increasing popularity of deep learning over the past decade, stochastic gradient descent and its adaptive variants (e.g. Adagrad, Adam, etc.) have become prominent methods of choice for machine learning practitioners. While a large number of works have demonstrated that these fi… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

    Comments: ICML 2021 Workshop on Beyond first-order methods in ML systems

  20. arXiv:2112.00816  [pdf, other

    stat.ME math.ST

    Maximum Likelihood Estimation for Brownian Motion Tree Models Based on One Sample

    Authors: Michael Truell, Jan-Christian Hütter, Chandler Squires, Piotr Zwiernik, Caroline Uhler

    Abstract: We study the problem of maximum likelihood estimation given one data sample ($n=1$) over Brownian Motion Tree Models (BMTMs), a class of Gaussian models on trees. BMTMs are often used as a null model in phylogenetics, where the one-sample regime is common. Specifically, we show that, almost surely, the one-sample BMTM maximum likelihood estimator (MLE) exists, is unique, and corresponds to a fully… ▽ More

    Submitted 24 November, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    MSC Class: 62F30; 62H12; 90C39; 62P10

  21. Simple, Fast, and Flexible Framework for Matrix Completion with Infinite Width Neural Networks

    Authors: Adityanarayanan Radhakrishnan, George Stefanakis, Mikhail Belkin, Caroline Uhler

    Abstract: Matrix completion problems arise in many applications including recommendation systems, computer vision, and genomics. Increasingly larger neural networks have been successful in many of these applications, but at considerable computational costs. Remarkably, taking the width of a neural network to infinity allows for improved computational performance. In this work, we develop an infinite width n… ▽ More

    Submitted 21 February, 2022; v1 submitted 30 July, 2021; originally announced August 2021.

  22. arXiv:2107.01850  [pdf, other

    stat.ME cs.LG stat.ML

    Matching a Desired Causal State via Shift Interventions

    Authors: Jiaqi Zhang, Chandler Squires, Caroline Uhler

    Abstract: Transforming a causal system from a given initial state to a desired target state is an important task permeating multiple fields including control theory, biology, and materials science. In causal models, such transformations can be achieved by performing a set of interventions. In this paper, we consider the problem of identifying a shift intervention that matches the desired mean of a system th… ▽ More

    Submitted 20 October, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

  23. arXiv:2106.15456  [pdf, other

    cs.LG cs.AI

    A Mechanism for Producing Aligned Latent Spaces with Autoencoders

    Authors: Saachi Jain, Adityanarayanan Radhakrishnan, Caroline Uhler

    Abstract: Aligned latent spaces, where meaningful semantic shifts in the input space correspond to a translation in the embedding space, play an important role in the success of downstream tasks such as unsupervised clustering and data imputation. In this work, we prove that linear and nonlinear autoencoders produce aligned latent spaces by stretching along the left singular vectors of the data. We fully ch… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  24. arXiv:2105.14024  [pdf, other

    cs.LG

    Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning

    Authors: Scott Sussex, Andreas Krause, Caroline Uhler

    Abstract: Causal structure learning is a key problem in many domains. Causal structures can be learnt by performing experiments on the system of interest. We address the largely unexplored problem of designing a batch of experiments that each simultaneously intervene on multiple variables. While potentially more informative than the commonly considered single-variable interventions, selecting such intervent… ▽ More

    Submitted 24 November, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: 10 pages, 2 figures, appendix, to be published in 35th Conference on Neural Information Processing Systems (NeurIPS 2021), fixed typos and clarified wording

  25. arXiv:2102.07921  [pdf, other

    stat.ME

    The DeCAMFounder: Non-Linear Causal Discovery in the Presence of Hidden Variables

    Authors: Raj Agrawal, Chandler Squires, Neha Prasad, Caroline Uhler

    Abstract: Many real-world decision-making tasks require learning causal relationships between a set of variables. Traditional causal discovery methods, however, require that all variables are observed, which is often not feasible in practical scenarios. Without additional assumptions about the unobserved variables, it is not possible to recover any causal relationships from observational data. Fortunately,… ▽ More

    Submitted 25 June, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: To appear in Journal of the Royal Statistical Society Series B

  26. arXiv:2101.05336  [pdf, other

    q-bio.GN math.MG math.OC

    Identifying 3D Genome Organization in Diploid Organisms via Euclidean Distance Geometry

    Authors: Anastasiya Belyaeva, Kaie Kubjas, Lawrence J. Sun, Caroline Uhler

    Abstract: The spatial organization of the DNA in the cell nucleus plays an important role for gene regulation, DNA replication, and genomic integrity. Through the development of chromosome conformation capture experiments (such as 3C, 4C, Hi-C) it is now possible to obtain the contact frequencies of the DNA at the whole-genome level. In this paper, we study the problem of reconstructing the 3D organization… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  27. arXiv:2011.03610  [pdf, other

    stat.ME cs.LG stat.ML

    Efficient Permutation Discovery in Causal DAGs

    Authors: Chandler Squires, Joshua Amaniampong, Caroline Uhler

    Abstract: The problem of learning a directed acyclic graph (DAG) up to Markov equivalence is equivalent to the problem of finding a permutation of the variables that induces the sparsest graph. Without additional assumptions, this task is known to be NP-hard. Building on the minimum degree algorithm for sparse Cholesky decomposition, but utilizing DAG-specific problem structure, we introduce an efficient al… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  28. arXiv:2011.03127  [pdf, other

    stat.ME

    Causal Imputation via Synthetic Interventions

    Authors: Chandler Squires, Dennis Shen, Anish Agarwal, Devavrat Shah, Caroline Uhler

    Abstract: Consider the problem of determining the effect of a compound on a specific cell type. To answer this question, researchers traditionally need to run an experiment applying the drug of interest to that cell type. This approach is not scalable: given a large number of different actions (compounds) and a large number of different contexts (cell types), it is infeasible to run an experiment for every… ▽ More

    Submitted 11 June, 2023; v1 submitted 5 November, 2020; originally announced November 2020.

  29. arXiv:2010.09610  [pdf, other

    cs.LG stat.ML

    Increasing Depth Leads to U-Shaped Test Risk in Over-parameterized Convolutional Networks

    Authors: Eshaan Nichani, Adityanarayanan Radhakrishnan, Caroline Uhler

    Abstract: Recent works have demonstrated that increasing model capacity through width in over-parameterized neural networks leads to a decrease in test risk. For neural networks, however, model capacity can also be increased through depth, yet understanding the impact of increasing depth on test risk remains an open question. In this work, we demonstrate that the test risk of over-parameterized convolutiona… ▽ More

    Submitted 4 June, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 27 pages, 23 figures

  30. arXiv:2010.08120  [pdf, other

    stat.ML cs.LG cs.SI eess.SP

    Joint Inference of Multiple Graphs from Matrix Polynomials

    Authors: Madeline Navarro, Yuhao Wang, Antonio G. Marques, Caroline Uhler, Santiago Segarra

    Abstract: Inferring graph structure from observations on the nodes is an important and popular network science task. Departing from the more common inference of a single graph and motivated by social and biological networks, we study the problem of jointly inferring multiple graphs from the observation of signals at their nodes (graph signals), which are assumed to be stationary in the sought graphs. From a… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 13 pages, 2 figures

  31. arXiv:2009.08574  [pdf, other

    cs.LG stat.ML

    Linear Convergence of Generalized Mirror Descent with Time-Dependent Mirrors

    Authors: Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

    Abstract: The Polyak-Lojasiewicz (PL) inequality is a sufficient condition for establishing linear convergence of gradient descent, even in non-convex settings. While several recent works use a PL-based analysis to establish linear convergence of stochastic gradient descent methods, the question remains as to whether a similar analysis can be conducted for more general optimization methods. In this work, we… ▽ More

    Submitted 6 October, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

  32. arXiv:2007.12098  [pdf, other

    cs.LG stat.ML

    Optimal Transport using GANs for Lineage Tracing

    Authors: Neha Prasad, Karren Yang, Caroline Uhler

    Abstract: In this paper, we present Super-OT, a novel approach to computational lineage tracing that combines a supervised learning framework with optimal transport based on Generative Adversarial Networks (GANs). Unlike previous approaches to lineage tracing, Super-OT has the flexibility to integrate paired data. We benchmark Super-OT based on single-cell RNA-seq data against Waddington-OT, a popular appro… ▽ More

    Submitted 5 January, 2022; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 4 pages excluding references, 2 figures, 3 tables. Accepted at ICML 2020 Workshop on Computational Biology for Spotlight Presentation. Code can be found here: https://github.com/uhlerlab/superot

  33. arXiv:2006.13431  [pdf, other

    physics.comp-ph cs.LG nlin.CD

    Multiscale Simulations of Complex Systems by Learning their Effective Dynamics

    Authors: Pantelis R. Vlachas, Georgios Arampatzis, Caroline Uhler, Petros Koumoutsakos

    Abstract: Predictive simulations of complex systems are essential for applications ranging from weather forecasting to drug design. The veracity of these predictions hinges on their capacity to capture the effective system dynamics. Massively parallel simulations predict the system dynamics by resolving all spatiotemporal scales, often at a cost that prevents experimentation while their findings may not all… ▽ More

    Submitted 19 October, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: 39 pages (Appendix included)

  34. arXiv:2006.08532  [pdf, other

    q-bio.BM cs.CV cs.LG eess.IV q-bio.QM

    Improved Conditional Flow Models for Molecule to Image Synthesis

    Authors: Karren Yang, Samuel Goldman, Wengong **, Alex Lu, Regina Barzilay, Tommi Jaakkola, Caroline Uhler

    Abstract: In this paper, we aim to synthesize cell microscopy images under different molecular interventions, motivated by practical applications to drug development. Building on the recent success of graph neural networks for learning molecular embeddings and flow-based models for image generation, we propose Mol2Image: a flow-based generative model for molecule to cell image synthesis. To generate cell fe… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    MSC Class: 92-08

  35. Causal Network Models of SARS-CoV-2 Expression and Aging to Identify Candidates for Drug Repurposing

    Authors: Anastasiya Belyaeva, Louis Cammarata, Adityanarayanan Radhakrishnan, Chandler Squires, Karren Dai Yang, G. V. Shivashankar, Caroline Uhler

    Abstract: Given the severity of the SARS-CoV-2 pandemic, a major challenge is to rapidly repurpose existing approved drugs for clinical interventions. While a number of data-driven and experimental approaches have been suggested in the context of drug repurposing, a platform that systematically integrates available transcriptomic, proteomic and structural data is missing. More importantly, given that SARS-C… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

  36. arXiv:2003.06340  [pdf, other

    cs.LG stat.ML

    On Alignment in Deep Linear Neural Networks

    Authors: Adityanarayanan Radhakrishnan, Eshaan Nichani, Daniel Bernstein, Caroline Uhler

    Abstract: We study the properties of alignment, a form of implicit regularization, in linear neural networks under gradient descent. We define alignment for fully connected networks with multidimensional outputs and show that it is a natural extension of alignment in networks with 1-dimensional outputs as defined by Ji and Telgarsky, 2018. While in fully connected networks, there always exists a global mini… ▽ More

    Submitted 16 June, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

  37. arXiv:2001.11940  [pdf, other

    stat.ML cs.LG

    Causal Structure Discovery from Distributions Arising from Mixtures of DAGs

    Authors: Basil Saeed, Snigdha Panigrahi, Caroline Uhler

    Abstract: We consider distributions arising from a mixture of causal models, where each model is represented by a directed acyclic graph (DAG). We provide a graphical representation of such mixture distributions and prove that this representation encodes the conditional independence relations of the mixture distribution. We then consider the problem of structure learning based on samples from such distribut… ▽ More

    Submitted 9 August, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

  38. arXiv:1910.09014  [pdf, other

    math.ST cs.LG stat.ML

    Ordering-Based Causal Structure Learning in the Presence of Latent Variables

    Authors: Daniel Irving Bernstein, Basil Saeed, Chandler Squires, Caroline Uhler

    Abstract: We consider the task of learning a causal graph in the presence of latent confounders given i.i.d.~samples from the model. While current algorithms for causal structure discovery in the presence of latent confounders are constraint-based, we here propose a score-based approach. We prove that under assumptions weaker than faithfulness, any sparsest independence map (IMAP) of the distribution belong… ▽ More

    Submitted 24 March, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: To appear in AISTATS 2020

  39. arXiv:1910.09007  [pdf, other

    stat.ME

    Permutation-Based Causal Structure Learning with Unknown Intervention Targets

    Authors: Chandler Squires, Yuhao Wang, Caroline Uhler

    Abstract: We consider the problem of estimating causal DAG models from a mix of observational and interventional data, when the intervention targets are partially or completely unknown. This problem is highly relevant for example in genomics, since gene knockout technologies are known to have off-target effects. We characterize the interventional Markov equivalence class of DAGs that can be identified from… ▽ More

    Submitted 20 June, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

  40. Overparameterized Neural Networks Implement Associative Memory

    Authors: Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

    Abstract: Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mechanism for real-valued data. Empirically, we show that: (1) overparameterized autoencoders store train… ▽ More

    Submitted 9 September, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

  41. arXiv:1909.04222  [pdf, other

    stat.AP stat.ME

    Covariance Matrix Estimation under Total Positivity for Portfolio Selection

    Authors: Raj Agrawal, Uma Roy, Caroline Uhler

    Abstract: Selecting the optimal Markowitz porfolio depends on estimating the covariance matrix of the returns of $N$ assets from $T$ periods of historical data. Problematically, $N$ is typically of the same order as $T$, which makes the sample covariance matrix estimator perform poorly, both empirically and theoretically. While various other general purpose covariance matrix estimators have been introduced… ▽ More

    Submitted 27 December, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: 23 pages, 4 figures

  42. arXiv:1906.09537  [pdf, other

    math.ST math.AC

    Algebraic Statistics in Practice: Applications to Networks

    Authors: Marta Casanellas, Sonja Petrović, Caroline Uhler

    Abstract: Algebraic statistics uses tools from algebra (especially from multilinear algebra, commutative algebra and computational algebra), geometry and combinatorics to provide insight into knotty problems in mathematical statistics. In this survey we illustrate this on three problems related to networks, namely network models for relational data, causal structure discovery and phylogenetics. For each pro… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    MSC Class: 62-02; 13-02

  43. arXiv:1906.05159  [pdf, other

    stat.ME

    Learning High-dimensional Gaussian Graphical Models under Total Positivity without Adjustment of Tuning Parameters

    Authors: Yuhao Wang, Uma Roy, Caroline Uhler

    Abstract: We consider the problem of estimating an undirected Gaussian graphical model when the underlying distribution is multivariate totally positive of order 2 (MTP2), a strong form of positive dependence. Such distributions are relevant for example for portfolio selection, since assets are usually positively dependent. A large body of methods have been proposed for learning undirected graphical models… ▽ More

    Submitted 19 March, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

  44. arXiv:1906.00928  [pdf, other

    stat.ME stat.AP

    Anchored Causal Inference in the Presence of Measurement Error

    Authors: Basil Saeed, Anastasiya Belyaeva, Yuhao Wang, Caroline Uhler

    Abstract: We consider the problem of learning a causal graph in the presence of measurement error. This setting is for example common in genomics, where gene expression is corrupted through the measurement process. We develop a provably consistent procedure for estimating the causal structure in a linear Gaussian structural equation model from corrupted observations on its nodes, under a variety of measurem… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

  45. arXiv:1905.00516  [pdf, other

    stat.ME math.ST

    Total positivity in exponential families with application to binary variables

    Authors: Steffen Lauritzen, Caroline Uhler, Piotr Zwiernik

    Abstract: We study exponential families of distributions that are multivariate totally positive of order 2 (MTP2), show that these are convex exponential families, and derive conditions for existence of the MLE. Quadratic exponential familes of MTP2 distributions contain attractive Gaussian graphical models and ferromagnetic Ising models as special examples. We show that these are defined by intersecting th… ▽ More

    Submitted 26 July, 2020; v1 submitted 1 May, 2019; originally announced May 2019.

    MSC Class: 60E15; 62H99; 15B48

    Journal ref: Annals of Statistics 2021, Vol. 49, 1436-1459

  46. arXiv:1903.02054  [pdf, other

    stat.ML cs.AI cs.LG

    Size of Interventional Markov Equivalence Classes in Random DAG Models

    Authors: Dmitriy Katz, Karthikeyan Shanmugam, Chandler Squires, Caroline Uhler

    Abstract: Directed acyclic graph (DAG) models are popular for capturing causal relationships. From observational and interventional data, a DAG model can only be determined up to its \emph{interventional Markov equivalence class} (I-MEC). We investigate the size of MECs for random DAG models generated by uniformly sampling and ordering an Erdős-Rényi graph. For constant density, we show that the expected… ▽ More

    Submitted 5 March, 2019; originally announced March 2019.

    Comments: 19 pages, 5 figures. Accepted to AISTATS 2019

  47. arXiv:1902.10347  [pdf, other

    stat.ME

    ABCD-Strategy: Budgeted Experimental Design for Targeted Causal Structure Discovery

    Authors: Raj Agrawal, Chandler Squires, Karren Yang, Karthik Shanmugam, Caroline Uhler

    Abstract: Determining the causal structure of a set of variables is critical for both scientific inquiry and decision-making. However, this is often challenging in practice due to limited interventional data. Given that randomized experiments are usually expensive to perform, we propose a general framework and theory based on optimal Bayesian experimental design to select experiments for targeted causal dis… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: To appear in AISTATS 2019

  48. arXiv:1902.09905  [pdf, other

    math.ST math.AG q-bio.PE

    Brownian motion tree models are toric

    Authors: Bernd Sturmfels, Caroline Uhler, Piotr Zwiernik

    Abstract: Felsenstein's classical model for Gaussian distributions on a phylogenetic tree is shown to be a toric variety in the space of concentration matrices. We present an exact semialgebraic characterization of this model, and we demonstrate how the toric structure leads to exact methods for maximum likelihood estimation. Our results also give new insights into the geometry of ultrametric matrices.

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: 22 pages, 4 figures

  49. arXiv:1902.03515  [pdf, other

    cs.LG stat.ML

    Multi-Domain Translation by Learning Uncoupled Autoencoders

    Authors: Karren D. Yang, Caroline Uhler

    Abstract: Multi-domain translation seeks to learn a probabilistic coupling between marginal distributions that reflects the correspondence between different domains. We assume that data from different domains are generated from a shared latent representation based on a structural equation model. Under this assumption, we show that the problem of computing a probabilistic coupling between marginals is equiva… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

    MSC Class: 68T01

  50. arXiv:1810.11447  [pdf, other

    cs.LG stat.ML

    Scalable Unbalanced Optimal Transport using Generative Adversarial Networks

    Authors: Karren D. Yang, Caroline Uhler

    Abstract: Generative adversarial networks (GANs) are an expressive class of neural generative models with tremendous success in modeling high-dimensional continuous measures. In this paper, we present a scalable method for unbalanced optimal transport (OT) based on the generative-adversarial framework. We formulate unbalanced OT as a problem of simultaneously learning a transport map and a scaling factor th… ▽ More

    Submitted 3 August, 2019; v1 submitted 26 October, 2018; originally announced October 2018.

    MSC Class: 68T99