-
Hyperplane Representations of Interventional Characteristic Imset Polytopes
Authors:
Benjamin Hollering,
Joseph Johnson,
Liam Solus
Abstract:
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer…
▽ More
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer a cause-effect structure from data. Linear optimization methods typically require a hyperplane representation of the feasible region, which has proven difficult to compute for CIM polytopes despite continued efforts. We solve this problem for CIM polytopes that are the convex hull of imsets associated to DAGs whose underlying graph of adjacencies is a tree. Our methods use the theory of toric fiber products as well as the novel notion of interventional CIM polytopes. Our solution is obtained as a corollary of a more general result for interventional CIM polytopes. The identified hyperplanes are applied to yield a linear optimization-based causal discovery algorithm for learning polytree causal networks from a combination of observational and interventional data.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Faithlessness in Gaussian graphical models
Authors:
Mathias Drton,
Leonard Henckel,
Benjamin Hollering,
Pratik Misra
Abstract:
The implication problem for conditional independence (CI) asks whether the fact that a probability distribution obeys a given finite set of CI relations implies that a further CI statement also holds in this distribution. This problem has a long and fascinating history, cumulating in positive results about implications now known as the semigraphoid axioms as well as impossibility results about a g…
▽ More
The implication problem for conditional independence (CI) asks whether the fact that a probability distribution obeys a given finite set of CI relations implies that a further CI statement also holds in this distribution. This problem has a long and fascinating history, cumulating in positive results about implications now known as the semigraphoid axioms as well as impossibility results about a general finite characterization of CI implications. Motivated by violation of faithfulness assumptions in causal discovery, we study the implication problem in the special setting where the CI relations are obtained from a directed acyclic graphical (DAG) model along with one additional CI statement. Focusing on the Gaussian case, we give a complete characterization of when such an implication is graphical by using algebraic techniques. Moreover, prompted by the relevance of strong faithfulness in statistical guarantees for causal discovery algorithms, we give a graphical solution for an approximate CI implication problem, in which we ask whether small values of one additional partial correlation entail small values for yet a further partial correlation.
△ Less
Submitted 24 April, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
The Pfaffian Structure of CFN Phylogenetic Networks
Authors:
Joseph Cummings,
Elizabeth Gross,
Benjamin Hollering,
Samuel Martin,
Ikenna Nometa
Abstract:
Algebraic techniques in phylogenetics have historically been successful at proving identifiability results and have also led to novel reconstruction algorithms. In this paper, we study the ideal of phylogenetic invariants of the Cavender-Farris-Neyman (CFN) model on a phylogenetic network with the goal of providing a description of the invariants which is useful for network inference. It was previ…
▽ More
Algebraic techniques in phylogenetics have historically been successful at proving identifiability results and have also led to novel reconstruction algorithms. In this paper, we study the ideal of phylogenetic invariants of the Cavender-Farris-Neyman (CFN) model on a phylogenetic network with the goal of providing a description of the invariants which is useful for network inference. It was previously shown that to characterize the invariants of any level-1 network, it suffices to understand all sunlet networks, which are those consisting of a single cycle with a leaf adjacent to each cycle vertex. We show that the parameterization of an affine open patch of the CFN sunlet model, which intersects the probability simplex factors through the space of skew-symmetric matrices via Pfaffians. We then show that this affine patch is isomorphic to a determinantal variety and give an explicit Gr{ö}bner basis for the associated ideal, which involves only polynomially many coordinates. Lastly, we show that sunlet networks with at least 6 leaves are identifiable using only these polynomials and run extensive simulations, which show that these polynomials can be used to accurately infer the correct network from DNA sequence data.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Computing Implicitizations of Multi-Graded Polynomial Maps
Authors:
Joseph Cummings,
Benjamin Hollering
Abstract:
In this paper, we focus on computing the kernel of a map of polynomial rings $\varphi$. This core problem in symbolic computation is known as implicitization. While there are extremely effective Gröbner basis methods used to solve this problem, these methods can become infeasible as the number of variables increases. In the case when the map $\varphi$ is multigraded, we consider an alternative app…
▽ More
In this paper, we focus on computing the kernel of a map of polynomial rings $\varphi$. This core problem in symbolic computation is known as implicitization. While there are extremely effective Gröbner basis methods used to solve this problem, these methods can become infeasible as the number of variables increases. In the case when the map $\varphi$ is multigraded, we consider an alternative approach. We demonstrate how to quickly compute a matrix of maximal rank for which $\varphi$ has a positive multigrading. Then in each graded component we compute the minimal generators of the kernel in that multidegree with linear algebra. We have implemented our techniques in Macaulay2 and show that our implementation can compute many generators of low degree in examples where Gröbner techniques have failed. This includes several examples coming from phylogenetics where even a complete list of quadrics and cubics were unknown. When the multigrading refines total degree, our algorithm is \emph{embarassingly parallel} and a fully parallelized version of our algorithm will be forthcoming in OSCAR.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Identifiability of Homoscedastic Linear Structural Equation Models using Algebraic Matroids
Authors:
Mathias Drton,
Benjamin Hollering,
Jun Wu
Abstract:
We consider structural equation models (SEMs), in which every variable is a function of a subset of the other variables and a stochastic error. Each such SEM is naturally associated with a directed graph describing the relationships between variables. When the errors are homoscedastic, recent work has proposed methods for inferring the graph from observational data under the assumption that the gr…
▽ More
We consider structural equation models (SEMs), in which every variable is a function of a subset of the other variables and a stochastic error. Each such SEM is naturally associated with a directed graph describing the relationships between variables. When the errors are homoscedastic, recent work has proposed methods for inferring the graph from observational data under the assumption that the graph is acyclic (i.e., the SEM is recursive). In this work, we study the setting of homoscedastic errors but allow the graph to be cyclic (i.e., the SEM to be non-recursive). Using an algebraic approach that compares matroids derived from the parameterizations of the models, we derive sufficient conditions for when two simple directed graphs generate different distributions generically. Based on these conditions, we exhibit subclasses of graphs that allow for directed cycles, yet are generically identifiable. We also conjecture a strengthening of our graphical criterion which can be used to distinguish many more non-complete graphs.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
Identifiability of the Rooted Tree Parameter under the Cavender-Farris-Neyman Model with a Molecular Clock
Authors:
Jane Ivy Coons,
Benjamin Hollering
Abstract:
Identifiability of the discrete tree parameter is a key property for phylogenetic models since it is necessary for statistically consistent estimation of the tree from sequence data. Algebraic methods have proven to be very effective at showing that tree and network parameters of phylogenetic models are identifiable, especially when the underlying models are group-based. However, since group-based…
▽ More
Identifiability of the discrete tree parameter is a key property for phylogenetic models since it is necessary for statistically consistent estimation of the tree from sequence data. Algebraic methods have proven to be very effective at showing that tree and network parameters of phylogenetic models are identifiable, especially when the underlying models are group-based. However, since group-based models are time-reversible, only the unrooted tree topology is identifiable and the location of the root is not. In this note we show that the rooted tree parameter of the Cavender-Farris-Neyman Model with a Molecular Clock is generically identifiable by using the invariants of the model which were characterized by Coons and Sullivant.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Toric Fiber Products in Geometric Modeling
Authors:
Eliana Duarte,
Benjamin Hollering,
Maximilian Wiesmann
Abstract:
An important challenge in Geometric Modeling is to classify polytopes with rational linear precision. Equivalently, in Algebraic Statistics one is interested in classifying scaled toric varieties, also known as discrete exponential families, for which the maximum likelihood estimator can be written in closed form as a rational function of the data (rational MLE). The toric fiber product (TFP) of s…
▽ More
An important challenge in Geometric Modeling is to classify polytopes with rational linear precision. Equivalently, in Algebraic Statistics one is interested in classifying scaled toric varieties, also known as discrete exponential families, for which the maximum likelihood estimator can be written in closed form as a rational function of the data (rational MLE). The toric fiber product (TFP) of statistical models is an operation to iteratively construct new models with rational MLE from lower dimensional ones. In this paper we introduce TFPs to the Geometric Modeling setting to construct polytopes with rational linear precision and give explicit formulae for their blending functions. A special case of the TFP is taking the Cartesian product of two polytopes and their blending functions. The Horn matrix of a statistical model with rational MLE is a key player in both Geometric Modeling and Algebraic Statistics; it proved to be fruitful providing a characterisation of those polytopes having the more restrictive property of strict linear precision. We give an explicit description of the Horn matrix of a TFP.
△ Less
Submitted 12 June, 2023; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Combinatorics of Correlated Equilibria
Authors:
Marie-Charlotte Brandenburg,
Benjamin Hollering,
Irem Portakal
Abstract:
We study the correlated equilibrium polytope $P_G$ of a game $G$ from a combinatorial point of view. We introduce the region of full-dimensionality for this class of polytopes and prove that it is a semialgebraic set for any game. Using a stratification via oriented matroids, we propose a structured method for describing the possible combinatorial types of $P_G$, and show that for $(2 \times n)$-g…
▽ More
We study the correlated equilibrium polytope $P_G$ of a game $G$ from a combinatorial point of view. We introduce the region of full-dimensionality for this class of polytopes and prove that it is a semialgebraic set for any game. Using a stratification via oriented matroids, we propose a structured method for describing the possible combinatorial types of $P_G$, and show that for $(2 \times n)$-games, the algebraic boundary of the stratification is a union of coordinate hyperplanes and binomial hypersurfaces. Finally, we provide a computational proof that there exists a unique combinatorial type of maximal dimension for generic $(2 \times 3)$-games.
△ Less
Submitted 27 February, 2024; v1 submitted 28 September, 2022;
originally announced September 2022.
-
Toric Ideals of Characteristic Imsets via Quasi-Independence Gluing
Authors:
Benjamin Hollering,
Joseph Johnson,
Irem Portakal,
Liam Solus
Abstract:
Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber…
▽ More
Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber product, which we call a quasi-independence gluing, and show that under certain combinatorial homogeneity conditions, one can iteratively compute a Gröbner basis via lifting. For faces of the characteristic imset polytope associated to trees, we apply this technique to compute a Gröbner basis for the associated toric ideal. We end with a study of the characteristic ideal of the cycle and propose directions for future work.
△ Less
Submitted 19 September, 2022; v1 submitted 5 September, 2022;
originally announced September 2022.
-
Markov Equivalence of Max-Linear Bayesian Networks
Authors:
Carlos Améndola,
Ben Hollering,
Seth Sullivant,
Ngoc Tran
Abstract:
Max-linear Bayesian networks have emerged as highly applicable models for causal inference via extreme value data. However, conditional independence (CI) for max-linear Bayesian networks behaves differently than for classical Gaussian Bayesian networks. We establish the parallel between the two theories via tropicalization, and establish the surprising result that the Markov equivalence classes fo…
▽ More
Max-linear Bayesian networks have emerged as highly applicable models for causal inference via extreme value data. However, conditional independence (CI) for max-linear Bayesian networks behaves differently than for classical Gaussian Bayesian networks. We establish the parallel between the two theories via tropicalization, and establish the surprising result that the Markov equivalence classes for max-linear Bayesian networks coincide with the ones obtained by regular CI. Our paper opens up many problems at the intersection of extreme value statistics, causal inference and tropical geometry.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
Invariants for level-1 phylogenetic networks under the Cavendar-Farris-Neyman Model
Authors:
Joseph Cummings,
Benjamin Hollering,
Christopher Manon
Abstract:
Phylogenetic networks can model more complicated evolutionary phenomena that trees fail to capture such as horizontal gene transfer and hybridization. The same Markov models that are used to model evolution on trees can also be extended to networks and similar questions, such as the identifiability of the network parameter or the invariants of the model, can be asked. In this paper we focus on fin…
▽ More
Phylogenetic networks can model more complicated evolutionary phenomena that trees fail to capture such as horizontal gene transfer and hybridization. The same Markov models that are used to model evolution on trees can also be extended to networks and similar questions, such as the identifiability of the network parameter or the invariants of the model, can be asked. In this paper we focus on finding the invariants of the Cavendar-Farris-Neyman (CFN) model on level-1 phylogenetic networks. We do this by reducing the problem to finding invariants of sunlet networks, which are level-1 networks consisting of a single cycle with leaves at each vertex. We then determine all quadratic invariants in the sunlet network ideal which we conjecture generate the full ideal.
△ Less
Submitted 5 February, 2021;
originally announced February 2021.
-
Discrete Max-Linear Bayesian Networks
Authors:
Benjamin Hollering,
Seth Sullivant
Abstract:
Discrete max-linear Bayesian networks are directed graphical models specified by the same recursive structural equations as max-linear models but with discrete innovations. When all of the random variables in the model are binary, these models are isomorphic to the conjunctive Bayesian network (CBN) models of Beerenwinkel, Eriksson, and Sturmfels. Many of the techniques used to study CBN models ca…
▽ More
Discrete max-linear Bayesian networks are directed graphical models specified by the same recursive structural equations as max-linear models but with discrete innovations. When all of the random variables in the model are binary, these models are isomorphic to the conjunctive Bayesian network (CBN) models of Beerenwinkel, Eriksson, and Sturmfels. Many of the techniques used to study CBN models can be extended to discrete max-linear models and similar results can be obtained. In particular, we extend the fact that CBN models are toric varieties after linear change of coordinates to all discrete max-linear models.
△ Less
Submitted 5 February, 2021;
originally announced February 2021.
-
Generalized Cut Polytopes for Binary Hierarchical Models
Authors:
Jane Ivy Coons,
Joseph Cummings,
Benjamin Hollering,
Aida Maraj
Abstract:
Marginal polytopes are important geometric objects that arise in statistics as the polytopes underlying hierarchical log-linear models. These polytopes can be used to answer geometric questions about these models, such as determining the existence of maximum likelihood estimates or the normality of the associated semigroup. Cut polytopes of graphs have been useful in analyzing binary marginal poly…
▽ More
Marginal polytopes are important geometric objects that arise in statistics as the polytopes underlying hierarchical log-linear models. These polytopes can be used to answer geometric questions about these models, such as determining the existence of maximum likelihood estimates or the normality of the associated semigroup. Cut polytopes of graphs have been useful in analyzing binary marginal polytopes in the case where the simplicial complex underlying the hierarchical model is a graph. We introduce a generalized cut polytope that is isomorphic to the binary marginal polytope of an arbitrary simplicial complex via a generalized covariance map. This polytope is full dimensional in its ambient space and has a natural switching operation among its facets that can be used to deduce symmetries between the facets of the correlation and binary marginal polytopes. We find complete H-representations of the generalized cut polytope for some important families of simplicial complexes. We also compute the volume of these polytopes in some instances.
△ Less
Submitted 31 July, 2020;
originally announced August 2020.
-
Identifiability in Phylogenetics using Algebraic Matroids
Authors:
Benjamin Hollering,
Seth Sullivant
Abstract:
Identifiability is a crucial property for a statistical model since distributions in the model uniquely determine the parameters that produce them. In phylogenetics, the identifiability of the tree parameter is of particular interest since it means that phylogenetic models can be used to infer evolutionary histories from data. In this paper we introduce a new computational strategy for proving the…
▽ More
Identifiability is a crucial property for a statistical model since distributions in the model uniquely determine the parameters that produce them. In phylogenetics, the identifiability of the tree parameter is of particular interest since it means that phylogenetic models can be used to infer evolutionary histories from data. In this paper we introduce a new computational strategy for proving the identifiability of discrete parameters in algebraic statistical models that uses algebraic matroids naturally associated to the models. We then use this algorithm to prove that the tree parameters are generically identifiable for 2-tree CFN and K3P mixtures. We also show that the $k$-cycle phylogenetic network parameter is identifiable under the K2P and K3P models.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Exchangeable and Sampling Consistent Distributions on Rooted Binary Trees
Authors:
Ben Hollering,
Seth Sullivant
Abstract:
We introduce a notion of finite sampling consistency for phylogenetic trees and show that the set of finitely sampling consistent and exchangeable distributions on n leaf phylogenetic trees is a polytope. We use this polytope to show that the set of all exchangeable and infinite sampling consistent distributions on 4 leaf phylogenetic trees is exactly Aldous' beta-splitting model and give a descri…
▽ More
We introduce a notion of finite sampling consistency for phylogenetic trees and show that the set of finitely sampling consistent and exchangeable distributions on n leaf phylogenetic trees is a polytope. We use this polytope to show that the set of all exchangeable and infinite sampling consistent distributions on 4 leaf phylogenetic trees is exactly Aldous' beta-splitting model and give a description of some of the vertices for the polytope of distributions on 5 leaves. We also introduce a new semialgebraic set of exchangeable and sampling consistent models we call the multinomial model and use it to characterize the set of exchangeable and sampling consistent distributions.
△ Less
Submitted 5 March, 2019; v1 submitted 8 February, 2019;
originally announced February 2019.