-
Hyperplane Representations of Interventional Characteristic Imset Polytopes
Authors:
Benjamin Hollering,
Joseph Johnson,
Liam Solus
Abstract:
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer…
▽ More
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer a cause-effect structure from data. Linear optimization methods typically require a hyperplane representation of the feasible region, which has proven difficult to compute for CIM polytopes despite continued efforts. We solve this problem for CIM polytopes that are the convex hull of imsets associated to DAGs whose underlying graph of adjacencies is a tree. Our methods use the theory of toric fiber products as well as the novel notion of interventional CIM polytopes. Our solution is obtained as a corollary of a more general result for interventional CIM polytopes. The identified hyperplanes are applied to yield a linear optimization-based causal discovery algorithm for learning polytree causal networks from a combination of observational and interventional data.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Colored Gaussian DAG models
Authors:
Tobias Boege,
Kaie Kubjas,
Pratik Misra,
Liam Solus
Abstract:
We study submodels of Gaussian DAG models defined by partial homogeneity constraints imposed on the model error variances and structural coefficients. We represent these models with colored DAGs and investigate their properties for use in statistical and causal inference. Local and global Markov properties are provided and shown to characterize the colored DAG model. Additional properties relevant…
▽ More
We study submodels of Gaussian DAG models defined by partial homogeneity constraints imposed on the model error variances and structural coefficients. We represent these models with colored DAGs and investigate their properties for use in statistical and causal inference. Local and global Markov properties are provided and shown to characterize the colored DAG model. Additional properties relevant to causal discovery are studied, including the existence and non-existence of faithful distributions and structural identifiability. Extending prior work of Peters and Bühlman and Wu and Drton, we prove structural identifiability under the assumption of homogeneous structural coefficients, as well as for a family of models with partially homogeneous structural coefficients. The latter models, termed BPEC-DAGs, capture additional causal insights by clustering the direct causes of each node into communities according to their effect on their common target. An analogue of the GES algorithm for learning BPEC-DAGs is given and evaluated on real and synthetic data. Regarding model geometry, we prove that these models are contractible, smooth, algebraic manifolds and compute their dimension. We also provide a proof of a conjecture of Sullivant which generalizes to colored DAG models, colored undirected graphical models and ancestral graph models. The proof yields a tool for the identification of Markov properties for rationally parameterized statistical models with globally, rationally identifiable parameters.
△ Less
Submitted 27 May, 2024; v1 submitted 5 April, 2024;
originally announced April 2024.
-
Scalable Structure Learning for Sparse Context-Specific Causal Systems
Authors:
Felix Leopoldo Rios,
Alex Markham,
Liam Solus
Abstract:
Several approaches to graphically representing context-specific relations among jointly distributed categorical variables have been proposed, along with structure learning algorithms. While existing optimization-based methods have limited scalability due to the large number of context-specific models, the constraint-based methods are more prone to error than even constraint-based DAG learning algo…
▽ More
Several approaches to graphically representing context-specific relations among jointly distributed categorical variables have been proposed, along with structure learning algorithms. While existing optimization-based methods have limited scalability due to the large number of context-specific models, the constraint-based methods are more prone to error than even constraint-based DAG learning algorithms since more relations must be tested. We present a hybrid algorithm for learning context-specific models that scales to hundreds of variables while testing no more constraints than standard DAG learning algorithms. Scalable learning is achieved through a combination of an order-based MCMC algorithm and sparsity assumptions analogous to those typically invoked for DAG models. To implement the method, we solve a special case of an open problem recently posed by Alon and Balogh. The method is shown to perform well on synthetic data and real world examples, in terms of both accuracy and scalability.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Neuro-Causal Factor Analysis
Authors:
Alex Markham,
Mingyu Liu,
Bryon Aragam,
Liam Solus
Abstract:
Factor analysis (FA) is a statistical tool for studying how observed variables with some mutual dependences can be expressed as functions of mutually independent unobserved factors, and it is widely applied throughout the psychological, biological, and physical sciences. We revisit this classic method from the comparatively new perspective given by advancements in causal discovery and deep learnin…
▽ More
Factor analysis (FA) is a statistical tool for studying how observed variables with some mutual dependences can be expressed as functions of mutually independent unobserved factors, and it is widely applied throughout the psychological, biological, and physical sciences. We revisit this classic method from the comparatively new perspective given by advancements in causal discovery and deep learning, introducing a framework for Neuro-Causal Factor Analysis (NCFA). Our approach is fully nonparametric: it identifies factors via latent causal discovery methods and then uses a variational autoencoder (VAE) that is constrained to abide by the Markov factorization of the distribution with respect to the learned graph. We evaluate NCFA on real and synthetic data sets, finding that it performs comparably to standard VAEs on data reconstruction tasks but with the advantages of sparser architecture, lower model complexity, and causal interpretability. Unlike traditional FA methods, our proposed NCFA method allows learning and reasoning about the latent factors underlying observed data from a justifiably causal perspective, even when the relations between factors and measurements are highly nonlinear.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
Triangulations of cosmological polytopes
Authors:
Martina Juhnke-Kubitzke,
Liam Solus,
Lorenzo Venturello
Abstract:
A cosmological polytope is defined for a given Feynman diagram, and its canonical form may be used to compute the contribution of the Feynman diagram to the wavefunction of certain cosmological models. Given a subdivision of a polytope, its canonical form is obtained as a sum of the canonical forms of the facets of the subdivision. In this paper, we identify such formulas for the canonical form vi…
▽ More
A cosmological polytope is defined for a given Feynman diagram, and its canonical form may be used to compute the contribution of the Feynman diagram to the wavefunction of certain cosmological models. Given a subdivision of a polytope, its canonical form is obtained as a sum of the canonical forms of the facets of the subdivision. In this paper, we identify such formulas for the canonical form via algebraic techniques. It is shown that the toric ideal of every cosmological polytope admits a Gröbner basis with a squarefree initial ideal, yielding a regular unimodular triangulation of the polytope. In specific instances, including trees and cycles, we recover graphical characterizations of the facets of such triangulations that may be used to compute the desired canonical form. For paths and cycles, these characterizations admit simple enumeration. Hence, we obtain formulas for the normalized volume of these polytopes, extending previous observations of Kühne and Monin.
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
Combinatorial and algebraic perspectives on the marginal independence structure of Bayesian networks
Authors:
Danai Deligeorgaki,
Alex Markham,
Pratik Misra,
Liam Solus
Abstract:
We consider the problem of estimating the marginal independence structure of a Bayesian network from observational data, learning an undirected graph we call the unconditional dependence graph. We show that unconditional dependence graphs of Bayesian networks correspond to the graphs having equal independence and intersection numbers. Using this observation, a Gröbner basis for a toric ideal assoc…
▽ More
We consider the problem of estimating the marginal independence structure of a Bayesian network from observational data, learning an undirected graph we call the unconditional dependence graph. We show that unconditional dependence graphs of Bayesian networks correspond to the graphs having equal independence and intersection numbers. Using this observation, a Gröbner basis for a toric ideal associated to unconditional dependence graphs of Bayesian networks is given and then extended by additional binomial relations to connect the space of all such graphs. An MCMC method, called GrUES (Gröbner-based Unconditional Equivalence Search), is implemented based on the resulting moves and applied to synthetic Gaussian data. GrUES recovers the true marginal independence structure via a penalized maximum likelihood or MAP estimate at a higher rate than simple independence tests while also yielding an estimate of the posterior, for which the $20\%$ HPD credible sets include the true structure at a high rate for data-generating graphs with density at least $0.5$.
△ Less
Submitted 31 January, 2024; v1 submitted 3 October, 2022;
originally announced October 2022.
-
On the Edges of Characteristic Imset Polytopes
Authors:
Svante Linusson,
Petter Restadh,
Liam Solus
Abstract:
The edges of the characteristic imset polytope, $\operatorname{CIM}_p$, were recently shown to have strong connections to causal discovery as many algorithms could be interpreted as greedy restricted edge-walks, even though only a strict subset of the edges are known. To better understand the general edge structure of the polytope we describe the edge structure of faces with a clear combinatorial…
▽ More
The edges of the characteristic imset polytope, $\operatorname{CIM}_p$, were recently shown to have strong connections to causal discovery as many algorithms could be interpreted as greedy restricted edge-walks, even though only a strict subset of the edges are known. To better understand the general edge structure of the polytope we describe the edge structure of faces with a clear combinatorial interpretation: for any undirected graph $G$ we have the face $\operatorname{CIM}_G$, the convex hull of the characteristic imsets of DAGs with skeleton $G$. We give a full edge-description of $\operatorname{CIM}_G$ when $G$ is a tree, leading to interesting connections to other polytopes. In particular the well-studied stable set polytope can be recovered as a face of $\operatorname{CIM}_G$ when $G$ is a tree. Building on this connection we are also able to give a description of all edges of $\operatorname{CIM}_G$ when $G$ is a cycle, suggesting possible inroads for generalization. We then introduce an algorithm for learning directed trees from data, utilizing our newly discovered edges, that outperforms classical methods on simulated Gaussian data.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Toric Ideals of Characteristic Imsets via Quasi-Independence Gluing
Authors:
Benjamin Hollering,
Joseph Johnson,
Irem Portakal,
Liam Solus
Abstract:
Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber…
▽ More
Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber product, which we call a quasi-independence gluing, and show that under certain combinatorial homogeneity conditions, one can iteratively compute a Gröbner basis via lifting. For faces of the characteristic imset polytope associated to trees, we apply this technique to compute a Gröbner basis for the associated toric ideal. We end with a study of the characteristic ideal of the cycle and propose directions for future work.
△ Less
Submitted 19 September, 2022; v1 submitted 5 September, 2022;
originally announced September 2022.
-
A Transformational Characterization of Unconditionally Equivalent Bayesian Networks
Authors:
Alex Markham,
Danai Deligeorgaki,
Pratik Misra,
Liam Solus
Abstract:
We consider the problem of characterizing Bayesian networks up to unconditional equivalence, i.e., when directed acyclic graphs (DAGs) have the same set of unconditional $d$-separation statements. Each unconditional equivalence class (UEC) is uniquely represented with an undirected graph whose clique structure encodes the members of the class. Via this structure, we provide a transformational char…
▽ More
We consider the problem of characterizing Bayesian networks up to unconditional equivalence, i.e., when directed acyclic graphs (DAGs) have the same set of unconditional $d$-separation statements. Each unconditional equivalence class (UEC) is uniquely represented with an undirected graph whose clique structure encodes the members of the class. Via this structure, we provide a transformational characterization of unconditional equivalence; i.e., we show that two DAGs are in the same UEC if and only if one can be transformed into the other via a finite sequence of specified moves. We also extend this characterization to the essential graphs representing the Markov equivalence classes (MECs) in the UEC. UECs partition the space of MECs and are easily estimable from marginal independence tests. Thus, a characterization of unconditional equivalence has applications in methods that involve searching the space of MECs of Bayesian networks.
△ Less
Submitted 10 August, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
A new characterization of discrete decomposable models
Authors:
Eliana Duarte,
Liam Solus
Abstract:
Decomposable graphical models, also known as perfect DAG models, play a fundamental role in standard approaches to probabilistic inference via graph representations in modern machine learning and statistics. However, such models are limited by the assumption that the data-generating distribution does not entail strictly context-specific conditional independence relations. The family of staged tree…
▽ More
Decomposable graphical models, also known as perfect DAG models, play a fundamental role in standard approaches to probabilistic inference via graph representations in modern machine learning and statistics. However, such models are limited by the assumption that the data-generating distribution does not entail strictly context-specific conditional independence relations. The family of staged tree models generalizes DAG models so as to accommodate context-specific knowledge. We provide a new characterization of perfect discrete DAG models in terms of their staged tree representations. This characterization identifies the family of balanced staged trees as the natural generalization of discrete decomposable models to the context-specific setting.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
The Integer Decomposition Property and Weighted Projective Space Simplices
Authors:
Benjamin Braun,
Robert Davis,
Derek Hanely,
Morgan Lane,
Liam Solus
Abstract:
Reflexive lattice polytopes play a key role in combinatorics, algebraic geometry, physics, and other areas. One important class of lattice polytopes are lattice simplices defining weighted projective spaces. We investigate the question of when a reflexive weighted projective space simplex has the integer decomposition property. We provide a complete classification of reflexive weighted projective…
▽ More
Reflexive lattice polytopes play a key role in combinatorics, algebraic geometry, physics, and other areas. One important class of lattice polytopes are lattice simplices defining weighted projective spaces. We investigate the question of when a reflexive weighted projective space simplex has the integer decomposition property. We provide a complete classification of reflexive weighted projective space simplices having the integer decomposition property for the case when there are at most three distinct non-unit weights, and conjecture a general classification for an arbitrary number of distinct non-unit weights. Further, for any weighted projective space simplex and $m\geq 1$, we define the $m$-th reflexive stabilization, a reflexive weighted projective space simplex. We prove that when $m$ is $2$ or greater, reflexive stabilizations do not have the integer decomposition property. We also prove that the Ehrhart $h^\ast$-polynomial of any sufficiently large reflexive stabilization is not unimodal and has only $1$ and $2$ as coefficients. We use this construction to generate interesting examples of reflexive weighted projective space simplices that are near the boundary of both $h^*$-unimodality and the integer decomposition property.
△ Less
Submitted 21 November, 2022; v1 submitted 31 March, 2021;
originally announced March 2021.
-
Greedy Causal Discovery is Geometric
Authors:
Svante Linusson,
Petter Restadh,
Liam Solus
Abstract:
Finding a directed acyclic graph (DAG) that best encodes the conditional independence statements observable from data is a central question within causality. Algorithms that greedily transform one candidate DAG into another given a fixed set of moves have been particularly successful, for example the GES, GIES, and MMHC algorithms. In 2010, Studený, Hemmecke and Lindner introduced the characterist…
▽ More
Finding a directed acyclic graph (DAG) that best encodes the conditional independence statements observable from data is a central question within causality. Algorithms that greedily transform one candidate DAG into another given a fixed set of moves have been particularly successful, for example the GES, GIES, and MMHC algorithms. In 2010, Studený, Hemmecke and Lindner introduced the characteristic imset polytope, $\operatorname{CIM}_p$, whose vertices correspond to Markov equivalence classes, as a way of transforming causal discovery into a linear optimization problem. We show that the moves of the aforementioned algorithms are included within classes of edges of $\operatorname{CIM}_p$ and that restrictions placed on the skeleton of the candidate DAGs correspond to faces of $\operatorname{CIM}_p$. Thus, we observe that GES, GIES, and MMHC all have geometric realizations as greedy edge-walks along $\operatorname{CIM}_p$. Furthermore, the identified edges of $\operatorname{CIM}_p$ strictly generalize the moves of these algorithms. Exploiting this generalization, we introduce a greedy simplex-type algorithm called \emph{greedy CIM}, and a hybrid variant, \emph{skeletal greedy CIM}, that outperforms current competitors among hybrid and constraint-based algorithms.
△ Less
Submitted 1 September, 2022; v1 submitted 5 March, 2021;
originally announced March 2021.
-
Representation of Context-Specific Causal Models with Observational and Interventional Data
Authors:
Eliana Duarte,
Liam Solus
Abstract:
We consider the problem of representing causal models that encode context-specific information for discrete data using a proper subclass of staged tree models which we call CStrees. We show that the context-specific information encoded by a CStree can be equivalently expressed via a collection of DAGs. As not all staged tree models admit this property, CStrees are a subclass that provides a transp…
▽ More
We consider the problem of representing causal models that encode context-specific information for discrete data using a proper subclass of staged tree models which we call CStrees. We show that the context-specific information encoded by a CStree can be equivalently expressed via a collection of DAGs. As not all staged tree models admit this property, CStrees are a subclass that provides a transparent, intuitive and compact representation of context-specific causal information. We prove that CStrees admit a global Markov property which yields a graphical criterion for model equivalence generalizing that of Verma and Pearl for DAG models. These results extend to the general interventional model setting, making CStrees the first family of context-specific models admitting a characterization of interventional model equivalence. We also provide a closed-form formula for the maximum likelihood estimator of a CStree and use it to show that the Bayesian information criterion is a locally consistent score function for this model class. The performance of CStrees is analyzed on both simulated and real data, where we see that modeling with CStrees instead of general staged trees does not result in a significant loss of predictive accuracy, while affording DAG representations of context-specific causal information.
△ Less
Submitted 12 January, 2022; v1 submitted 22 January, 2021;
originally announced January 2021.
-
Algebraic geometry of discrete interventional models
Authors:
Eliana Duarte,
Liam Solus
Abstract:
We investigate the algebra and geometry of general interventions in discrete DAG models. To this end, we introduce a theory for modeling soft interventions in the more general family of staged tree models and develop the formalism to study these models as parametrized subvarieties of a product of probability simplices. We then consider the problem of finding their defining equations, and we derive…
▽ More
We investigate the algebra and geometry of general interventions in discrete DAG models. To this end, we introduce a theory for modeling soft interventions in the more general family of staged tree models and develop the formalism to study these models as parametrized subvarieties of a product of probability simplices. We then consider the problem of finding their defining equations, and we derive a combinatorial criterion for identifying interventional staged tree models for which the defining ideal is toric. We apply these results to the class of discrete interventional DAG models and establish a criteria to determine when these models are toric varieties.
△ Less
Submitted 16 October, 2023; v1 submitted 7 December, 2020;
originally announced December 2020.
-
Subdivisions of Shellable Complexes
Authors:
Max Hlavacek,
Liam Solus
Abstract:
In geometric, algebraic, and topological combinatorics, the unimodality of combinatorial generating polynomials is frequently studied. Unimodality follows when the polynomial is (real) stable, a property often deduced via the theory of interlacing polynomials. Many of the open questions on stability and unimodality of polynomials pertain to the enumeration of faces of cell complexes.
In this pap…
▽ More
In geometric, algebraic, and topological combinatorics, the unimodality of combinatorial generating polynomials is frequently studied. Unimodality follows when the polynomial is (real) stable, a property often deduced via the theory of interlacing polynomials. Many of the open questions on stability and unimodality of polynomials pertain to the enumeration of faces of cell complexes.
In this paper, we relate the theory of interlacing polynomials to the shellability of cell complexes. We first derive a sufficient condition for stability of the $h$-polynomial of a subdivision of a shellable complex. To apply it, we generalize the notion of reciprocal domains for convex embeddings of polytopes to abstract polytopes and use this generalization to define the family of stable shellings of a polytopal complex. We characterize the stable shellings of cubical and simplicial complexes, and apply this theory to answer a question of Brenti and Welker on barycentric subdivisions for the well-known cubical polytopes. We also give a positive solution to a problem of Mohammadi and Welker on edgewise subdivisions of cell complexes. We end by relating the family of stable line shellings to the combinatorics of hyperplane arrangements. We pose related questions, answers to which would resolve some long-standing problems while strengthening ties between the theory of interlacing polynomials and the combinatorics of hyperplane arrangements.
△ Less
Submitted 25 June, 2020; v1 submitted 16 March, 2020;
originally announced March 2020.
-
Some Algebraic Properties of Lecture Hall Polytopes
Authors:
Petter Brändén,
Liam Solus
Abstract:
In this note, we investigate some of the fundamental algebraic and geometric properties of $s$-lecture hall simplices and their generalizations. We show that all $s$-lecture hall order polytopes, which simultaneously generalize $s$-lecture hall simplices and order polytopes, satisfy a property which implies the integer decomposition property. This answers one conjecture of Hibi, Olsen and Tsuchiya…
▽ More
In this note, we investigate some of the fundamental algebraic and geometric properties of $s$-lecture hall simplices and their generalizations. We show that all $s$-lecture hall order polytopes, which simultaneously generalize $s$-lecture hall simplices and order polytopes, satisfy a property which implies the integer decomposition property. This answers one conjecture of Hibi, Olsen and Tsuchiya. By relating $s$-lecture hall polytopes to alcoved polytopes, we then use this property to show that families of $s$-lecture hall simplices admit a quadratic Gröbner basis with a square-free initial ideal. Consequently, we find that all $s$-lecture hall simplices for which the first order difference sequence of $s$ is a $0,1$-sequence have a regular and unimodular triangulation. This answers a second conjecture of Hibi, Olsen and Tsuchiya, and it gives a partial answer to a conjecture of Beck, Braun, Köppe, Savage and Zafeirakopoulos.
△ Less
Submitted 27 November, 2019;
originally announced November 2019.
-
Distributional Invariances and Interventional Markov Equivalence for Mixed Graph Models
Authors:
Liam Solus
Abstract:
The invariance properties of interventional distributions relative to the observational distribution, and how these properties allow us to refine Markov equivalence classes (MECs) of DAGs, is central to causal DAG discovery algorithms that use both interventional and observational data. Here, we show how the invariance properties of interventional DAG models, and the corresponding refinement of ME…
▽ More
The invariance properties of interventional distributions relative to the observational distribution, and how these properties allow us to refine Markov equivalence classes (MECs) of DAGs, is central to causal DAG discovery algorithms that use both interventional and observational data. Here, we show how the invariance properties of interventional DAG models, and the corresponding refinement of MECs into interventional MECs, can be generalized to mixed graphical models that allow for latent cofounders and selection variables. We first generalize interventional Markov equivalence to all formal independence models associated to loopless mixed graphs. For ancestral graphs, we prove the resulting interventional MECs admit a graphical characterization generalizing that of DAGs. We then define interventional distributions for acyclic directed mixed graph models, and prove that this generalization aligns with the graphical generalization of interventional Markov equivalence given for the formal independence models. This provides a framework for causal model discovery via observational and interventional data in the presence of latent confounders that applies even when the interventions are uncontrolled.
△ Less
Submitted 7 June, 2020; v1 submitted 22 November, 2019;
originally announced November 2019.
-
Symmetric decompositions and real-rootedness
Authors:
Petter Brändén,
Liam Solus
Abstract:
In algebraic, topological, and geometric combinatorics inequalities among the coefficients of combinatorial polynomials are frequently studied. Recently a notion called the alternatingly increasing property, which is stronger than unimodality, was introduced. In this paper, we relate the alternatingly increasing property to real-rootedness of the symmetric decomposition of a polynomial to develop…
▽ More
In algebraic, topological, and geometric combinatorics inequalities among the coefficients of combinatorial polynomials are frequently studied. Recently a notion called the alternatingly increasing property, which is stronger than unimodality, was introduced. In this paper, we relate the alternatingly increasing property to real-rootedness of the symmetric decomposition of a polynomial to develop a systematic approach for proving the alternatingly increasing property for several classes of polynomials. We apply our results to strengthen and generalize real-rootedness, unimodality, and alternatingly increasing results pertaining to colored Eulerian and derangement polynomials, Ehrhart $h^\ast$-polynomials for lattice zonotopes, $h$-polynomials of barycentric subdivisions of doubly Cohen-Macaulay level simplicial complexes, and certain local $h$-polynomials for subdivisions of simplices. In particular, we prove two conjectures of Athanasiadis.
△ Less
Submitted 12 January, 2020; v1 submitted 13 August, 2018;
originally announced August 2018.
-
Local $h^*$-Polynomials of Some Weighted Projective Spaces
Authors:
Liam Solus
Abstract:
There is currently a growing interest in understanding which lattice simplices have unimodal local $h^\ast$-polynomials (sometimes called box polynomials); specifically in light of their potential applications to unimodality questions for Ehrhart $h^\ast$-polynomials. In this note, we compute a general form for the local $h^\ast$-polynomial of a well-studied family of lattice simplices whose assoc…
▽ More
There is currently a growing interest in understanding which lattice simplices have unimodal local $h^\ast$-polynomials (sometimes called box polynomials); specifically in light of their potential applications to unimodality questions for Ehrhart $h^\ast$-polynomials. In this note, we compute a general form for the local $h^\ast$-polynomial of a well-studied family of lattice simplices whose associated toric varieties are weighted projective spaces. We then apply this formula to prove that certain such lattice simplices, whose combinatorics are naturally encoded using common systems of numeration, all have real-rooted, and thus unimodal, local $h^\ast$-polynomials. As a consequence, we discover a new restricted Eulerian polynomial that is real-rooted, symmetric, and admits intriguing number theoretic properties.
△ Less
Submitted 11 January, 2020; v1 submitted 21 July, 2018;
originally announced July 2018.
-
Derangements, Ehrhart Theory, and Local h-polynomials
Authors:
Nils Gustafsson,
Liam Solus
Abstract:
The Eulerian polynomials and derangement polynomials are two well-studied generating functions that frequently arise in combinatorics, algebra, and geometry. When one makes an appearance, the other often does so as well, and their corresponding generalizations are similarly linked. This is this case in the theory of subdivisions of simplicial complexes, where the Eulerian polynomial is an $h$-poly…
▽ More
The Eulerian polynomials and derangement polynomials are two well-studied generating functions that frequently arise in combinatorics, algebra, and geometry. When one makes an appearance, the other often does so as well, and their corresponding generalizations are similarly linked. This is this case in the theory of subdivisions of simplicial complexes, where the Eulerian polynomial is an $h$-polynomial and the derangement polynomial is its local $h$-polynomial. Separately, in Ehrhart theory the Eulerian polynomials are generalized by the $h^\ast$-polynomials of $s$-lecture hall simplices. Here, we show that derangement polynomials are analogously generalized by the box polynomials, or local $h^\ast$-polynomials, of the $s$-lecture hall simplices, and that these polynomials are all real-rooted. We then connect the two theories by showing that the local $h$-polynomials of common subdivisions in algebra and topology are realized as local $h^\ast$-polynomials of $s$-lecture hall simplices. We use this connection to address some open questions on real-rootedness and unimodality of generating polynomials, some from each side of the story.
△ Less
Submitted 13 April, 2020; v1 submitted 13 July, 2018;
originally announced July 2018.
-
On the Relationship Between Ehrhart Unimodality and Ehrhart Positivity
Authors:
Fu Liu,
Liam Solus
Abstract:
For a given lattice polytope, two fundamental problems within the field of Ehrhart theory are to (1) determine if its (Ehrhart) $h^\ast$-polynomial is unimodal and (2) to determine if its Ehrhart polynomial has only positive coefficients. The former property of a lattice polytope is known as Ehrhart unimodality and the latter property is known as Ehrhart positivity. These two properties are often…
▽ More
For a given lattice polytope, two fundamental problems within the field of Ehrhart theory are to (1) determine if its (Ehrhart) $h^\ast$-polynomial is unimodal and (2) to determine if its Ehrhart polynomial has only positive coefficients. The former property of a lattice polytope is known as Ehrhart unimodality and the latter property is known as Ehrhart positivity. These two properties are often simultaneously conjectured to hold for interesting families of lattice polytopes, yet they are typically studied in parallel. As to answer a question posed at the 2017 Introductory Workshop to the MSRI Semester on Geometric and Topological Combinatorics, the purpose of this note is to show that there is no general implication between these two properties in any dimension greater than two. To do so, we investigate these two properties for families of well-studied lattice polytopes, assessing one property where previously only the other had been considered. Consequently, new examples of each phenomena are developed, some of which provide an answer to an open problem in the literature. The well-studied families of lattice polytopes considered include zonotopes, matroid polytopes, simplices of weighted projective spaces, empty lattice simplices, smooth polytopes, and $s$-lecture hall simplices.
△ Less
Submitted 23 April, 2018;
originally announced April 2018.
-
Geometry of Discrete Copulas
Authors:
Elisa Perrone,
Liam Solus,
Caroline Uhler
Abstract:
Multivariate distributions are fundamental to modeling. Discrete copulas can be used to construct diverse multivariate joint distributions over random variables from estimated univariate marginals. The space of discrete copulas admits a representation as a convex polytope which can be exploited in entropy-copula methods relevant to hydrology and climatology. To allow for an extensive use of such m…
▽ More
Multivariate distributions are fundamental to modeling. Discrete copulas can be used to construct diverse multivariate joint distributions over random variables from estimated univariate marginals. The space of discrete copulas admits a representation as a convex polytope which can be exploited in entropy-copula methods relevant to hydrology and climatology. To allow for an extensive use of such methods in a wide range of applied fields, it is important to have a geometric representation of discrete copulas with desirable stochastic properties. In this paper, we show that the families of ultramodular discrete copulas and their generalization to convex discrete quasi-copulas admit representations as polytopes. We draw connections to the prominent Birkhoff polytope, alternating sign matrix polytope, and their most extensive generalizations in the discrete geometry literature. In doing so, we generalize some well-known results on these polytopes from both the statistics literature and the discrete geometry literature.
△ Less
Submitted 30 May, 2018; v1 submitted 19 February, 2018;
originally announced February 2018.
-
Counting Markov Equivalence Classes for DAG models on Trees
Authors:
Adityanarayanan Radhakrishnan,
Liam Solus,
Caroline Uhler
Abstract:
DAG models are statistical models satisfying a collection of conditional independence relations encoded by the nonedges of a directed acyclic graph (DAG) $\mathcal{G}$. Such models are used to model complex cause-effect systems across a variety of research fields. From observational data alone, a DAG model $\mathcal{G}$ is only recoverable up to Markov equivalence. Combinatorially, two DAGs are Ma…
▽ More
DAG models are statistical models satisfying a collection of conditional independence relations encoded by the nonedges of a directed acyclic graph (DAG) $\mathcal{G}$. Such models are used to model complex cause-effect systems across a variety of research fields. From observational data alone, a DAG model $\mathcal{G}$ is only recoverable up to Markov equivalence. Combinatorially, two DAGs are Markov equivalent if and only if they have the same underlying undirected graph (i.e. skeleton) and the same set of the induced subDAGs $i\to j \leftarrow k$, known as immoralities. Hence it is of interest to study the number and size of Markov equivalence classes (MECs). In a recent paper, the authors introduced a pair of generating functions that enumerate the number of MECs on a fixed skeleton by number of immoralities and by class size, and they studied the complexity of computing these functions. In this paper, we lay the foundation for studying these generating functions by analyzing their structure for trees and other closely related graphs. We describe these polynomials for some important families of graphs including paths, stars, cycles, spider graphs, caterpillars, and complete binary trees. In doing so, we recover important connections to independence polynomials, and extend some classical identities that hold for Fibonacci numbers. We also provide tight lower and upper bounds for the number and size of MECs on any tree. Finally, we use computational methods to show that the number and distribution of high degree nodes in a triangle-free graph dictates the number and size of MECs.
△ Less
Submitted 17 June, 2017;
originally announced June 2017.
-
Simplices for Numeral Systems
Authors:
Liam Solus
Abstract:
The family of lattice simplices in $\mathbb{R}^n$ formed by the convex hull of the standard basis vectors together with a weakly decreasing vector of negative integers include simplices that play a central role in problems in enumerative algebraic geometry and mirror symmetry. From this perspective, it is useful to have formulae for their discrete volumes via Ehrhart $h^\ast$-polynomials. Here we…
▽ More
The family of lattice simplices in $\mathbb{R}^n$ formed by the convex hull of the standard basis vectors together with a weakly decreasing vector of negative integers include simplices that play a central role in problems in enumerative algebraic geometry and mirror symmetry. From this perspective, it is useful to have formulae for their discrete volumes via Ehrhart $h^\ast$-polynomials. Here we show, via an association with numeral systems, that such simplices yield $h^\ast$-polynomials with properties that are also desirable from a combinatorial perspective. First, we identify $n$-simplices in this family that associate via their normalized volume to the $n^{th}$ place value of a positional numeral system. We then observe that their $h^\ast$-polynomials admit combinatorial formula via descent-like statistics on the numeral strings encoding the nonnegative integers within the system. With these methods, we recover ubiquitous $h^\ast$-polynomials including the Eulerian polynomials and the binomial coefficients arising from the factoradic and binary numeral systems, respectively. We generalize the binary case to base-$r$ numeral systems for all $r\geq2$, and prove that the associated $h^\ast$-polynomials are real-rooted and unimodal for $r\geq2$ and $n\geq1$.
△ Less
Submitted 4 October, 2017; v1 submitted 1 June, 2017;
originally announced June 2017.
-
Permutation-based Causal Inference Algorithms with Interventions
Authors:
Yuhao Wang,
Liam Solus,
Karren Dai Yang,
Caroline Uhler
Abstract:
Learning directed acyclic graphs using both observational and interventional data is now a fundamentally important problem due to recent technological developments in genomics that generate such single-cell gene expression data at a very large scale. In order to utilize this data for learning gene regulatory networks, efficient and reliable causal inference algorithms are needed that can make use…
▽ More
Learning directed acyclic graphs using both observational and interventional data is now a fundamentally important problem due to recent technological developments in genomics that generate such single-cell gene expression data at a very large scale. In order to utilize this data for learning gene regulatory networks, efficient and reliable causal inference algorithms are needed that can make use of both observational and interventional data. In this paper, we present two algorithms of this type and prove that both are consistent under the faithfulness assumption. These algorithms are interventional adaptations of the Greedy SP algorithm and are the first algorithms using both observational and interventional data with consistency guarantees. Moreover, these algorithms have the advantage that they are nonparametric, which makes them useful also for analyzing non-Gaussian data. In this paper, we present these two algorithms and their consistency guarantees, and we analyze their performance on simulated data, protein signaling data, and single-cell gene expression data.
△ Less
Submitted 4 November, 2017; v1 submitted 29 May, 2017;
originally announced May 2017.
-
Consistency Guarantees for Greedy Permutation-Based Causal Inference Algorithms
Authors:
Liam Solus,
Yuhao Wang,
Caroline Uhler
Abstract:
Directed acyclic graphical models, or DAG models, are widely used to represent complex causal systems. Since the basic task of learning such a model from data is NP-hard, a standard approach is greedy search over the space of directed acyclic graphs or Markov equivalence classes of directed acyclic graphs. As the space of directed acyclic graphs on $p$ nodes and the associated space of Markov equi…
▽ More
Directed acyclic graphical models, or DAG models, are widely used to represent complex causal systems. Since the basic task of learning such a model from data is NP-hard, a standard approach is greedy search over the space of directed acyclic graphs or Markov equivalence classes of directed acyclic graphs. As the space of directed acyclic graphs on $p$ nodes and the associated space of Markov equivalence classes are both much larger than the space of permutations, it is desirable to consider permutation-based greedy searches. Here, we provide the first consistency guarantees, both uniform and high-dimensional, of a greedy permutation-based search. This search corresponds to a simplex-like algorithm operating over the edge-graph of a sub-polytope of the permutohedron, called a DAG associahedron. Every vertex in this polytope is associated with a directed acyclic graph, and hence with a collection of permutations that are consistent with the directed acyclic graph ordering. A walk is performed on the edges of the polytope maximizing the sparsity of the associated directed acyclic graphs. We show via simulated and real data that this permutation search is competitive with current approaches.
△ Less
Submitted 8 June, 2021; v1 submitted 12 February, 2017;
originally announced February 2017.
-
Monte Carlo goodness-of-fit tests for degree corrected and related stochastic blockmodels
Authors:
Vishesh Karwa,
Debdeep Pati,
Sonja Petrović,
Liam Solus,
Nikita Alexeev,
Mateja Raič,
Dane Wilburne,
Robert Williams,
Bowei Yan
Abstract:
We construct Bayesian and frequentist finite-sample goodness-of-fit tests for three different variants of the stochastic blockmodel for network data. Since all of the stochastic blockmodel variants are log-linear in form when block assignments are known, the tests for the \emph{latent} block model versions combine a block membership estimator with the algebraic statistics machinery for testing goo…
▽ More
We construct Bayesian and frequentist finite-sample goodness-of-fit tests for three different variants of the stochastic blockmodel for network data. Since all of the stochastic blockmodel variants are log-linear in form when block assignments are known, the tests for the \emph{latent} block model versions combine a block membership estimator with the algebraic statistics machinery for testing goodness-of-fit in log-linear models. We describe Markov bases and marginal polytopes of the variants of the stochastic blockmodel, and discuss how both facilitate the development of goodness-of-fit tests and understanding of model behavior.
The general testing methodology developed here extends to any finite mixture of log-linear models on discrete data, and as such is the first application of the algebraic statistics machinery for latent-variable models.
△ Less
Submitted 6 March, 2024; v1 submitted 18 December, 2016;
originally announced December 2016.
-
Counting Markov Equivalence Classes by Number of Immoralities
Authors:
Adityanarayanan Radhakrishnan,
Liam Solus,
Caroline Uhler
Abstract:
Two directed acyclic graphs (DAGs) are called Markov equivalent if and only if they have the same underlying undirected graph (i.e. skeleton) and the same set of immoralities. Using observational data, a DAG model can only be determined up to Markov equivalence, and so it is desirable to understand the size and number of Markov equivalence classes (MECs) combinatorially. In this paper, we address…
▽ More
Two directed acyclic graphs (DAGs) are called Markov equivalent if and only if they have the same underlying undirected graph (i.e. skeleton) and the same set of immoralities. Using observational data, a DAG model can only be determined up to Markov equivalence, and so it is desirable to understand the size and number of Markov equivalence classes (MECs) combinatorially. In this paper, we address this enumerative question using a pair of generating functions that encode the number and size of MECs on a skeleton $G$, and in doing so we connect this problem to classical problems in combinatorial optimization. The first is a graph polynomial that counts the number of MECs on $G$ by their number of immoralities. Using connections to the independent set problem, we show that computing a DAG on $G$ with the maximum possible number of immoralities is NP-hard. The second generating function counts the MECs on $G$ according to their size. Via computer enumeration, we show that this generating function is distinct for every connected graph on $p$ nodes for all $p\leq 10$.
△ Less
Submitted 17 June, 2017; v1 submitted 22 November, 2016;
originally announced November 2016.
-
Detecting the Integer Decomposition Property and Ehrhart Unimodality in Reflexive Simplices
Authors:
Benjamin Braun,
Robert Davis,
Liam Solus
Abstract:
A long-standing open conjecture in combinatorics asserts that a Gorenstein lattice polytope with the integer decomposition property (IDP) has a unimodal (Ehrhart) $h^\ast$-polynomial. This conjecture can be viewed as a strengthening of a previously disproved conjecture which stated that any Gorenstein lattice polytope has a unimodal $h^\ast$-polynomial. The first counterexamples to unimodality for…
▽ More
A long-standing open conjecture in combinatorics asserts that a Gorenstein lattice polytope with the integer decomposition property (IDP) has a unimodal (Ehrhart) $h^\ast$-polynomial. This conjecture can be viewed as a strengthening of a previously disproved conjecture which stated that any Gorenstein lattice polytope has a unimodal $h^\ast$-polynomial. The first counterexamples to unimodality for Gorenstein lattice polytopes were given in even dimensions greater than five by Musta{ţ}{ǎ} and Payne, and this was extended to all dimensions greater than five by Payne. While there exist numerous examples in support of the conjecture that IDP reflexives are $h^\ast$-unimodal, its validity has not yet been considered for families of reflexive lattice simplices that closely generalize Payne's counterexamples. The main purpose of this work is to prove that the former conjecture does indeed hold for a natural generalization of Payne's examples. The second purpose of this work is to extend this investigation to a broader class of lattice simplices, for which we present new results and open problems.
△ Less
Submitted 1 June, 2018; v1 submitted 4 August, 2016;
originally announced August 2016.
-
Extremal Positive Semidefinite Matrices for Graphs without $K_5$ Minors
Authors:
Liam Solus,
Caroline Uhler,
Ruriko Yoshida
Abstract:
For a graph $G$ with $p$ vertices the closed convex cone $\mathbb{S}^p_{\succeq0}(G)$ consists of all real positive semidefinite $p\times p$ matrices with zeros in the off-diagonal entries corresponding to nonedges of $G$. The extremal rays of this cone and their associated ranks have applications to matrix completion problems, maximum likelihood estimation in Gaussian graphical models in statisti…
▽ More
For a graph $G$ with $p$ vertices the closed convex cone $\mathbb{S}^p_{\succeq0}(G)$ consists of all real positive semidefinite $p\times p$ matrices with zeros in the off-diagonal entries corresponding to nonedges of $G$. The extremal rays of this cone and their associated ranks have applications to matrix completion problems, maximum likelihood estimation in Gaussian graphical models in statistics, and Gauss elimination for sparse matrices. For a graph $G$ without $K_5$ minors, we show that the normal vectors to the facets of the $(\pm1)$-cut polytope of $G$ specify the off-diagonal entries of extremal matrices in $\mathbb{S}^p_{\succeq0}(G)$. We also prove that the constant term of the linear equation of each facet-supporting hyperplane is the rank of its corresponding extremal matrix in $\mathbb{S}^p_{\succeq0}(G)$. Furthermore, we show that if $G$ is series-parallel then this gives a complete characterization of all possible extremal ranks of $\mathbb{S}^p_{\succeq0}(G)$, consequently solving the sparsity order problem for series-parallel graphs.
△ Less
Submitted 21 September, 2015; v1 submitted 22 June, 2015;
originally announced June 2015.
-
Facets of the r-stable n,k-hypersimplex
Authors:
Takayuki Hibi,
Liam Solus
Abstract:
Let $k, n$ and $r$ be positive integers with $k < n$ and $r\leq\lfloor\frac{n}{k}\rfloor$. We determine the facets of the $r$-stable $n,k$-hypersimplex. As a result, it turns out that the $r$-stable $n,k$-hypersimplex has exactly $2n$ facets for every $r<\lfloor\frac{n}{k}\rfloor$. We then utilize the equations of the facets to study when the $r$-stable hypersimplex is Gorenstein. For every $k>0$…
▽ More
Let $k, n$ and $r$ be positive integers with $k < n$ and $r\leq\lfloor\frac{n}{k}\rfloor$. We determine the facets of the $r$-stable $n,k$-hypersimplex. As a result, it turns out that the $r$-stable $n,k$-hypersimplex has exactly $2n$ facets for every $r<\lfloor\frac{n}{k}\rfloor$. We then utilize the equations of the facets to study when the $r$-stable hypersimplex is Gorenstein. For every $k>0$ we identify an infinite collection of Gorenstein $r$-stable hypersimplices, consequently expanding the collection of $r$-stable hypersimplices known to have unimodal Ehrhart $δ$-vectors.
△ Less
Submitted 21 September, 2015; v1 submitted 25 August, 2014;
originally announced August 2014.
-
Shellability, Ehrhart Theory, and $r$-stable Hypersimplices
Authors:
Benjamin Braun,
Liam Solus
Abstract:
Hypersimplices are well-studied objects in combinatorics, optimization, and representation theory. For each hypersimplex, we define a new family of subpolytopes, called r-stable hypersimplices, and show that a well-known regular unimodular triangulation of the hypersimplex restricts to a triangulation of each r-stable hypersimplex. For the case of the second hypersimplex defined by the two-element…
▽ More
Hypersimplices are well-studied objects in combinatorics, optimization, and representation theory. For each hypersimplex, we define a new family of subpolytopes, called r-stable hypersimplices, and show that a well-known regular unimodular triangulation of the hypersimplex restricts to a triangulation of each r-stable hypersimplex. For the case of the second hypersimplex defined by the two-element subsets of an n-set, we provide a shelling of this triangulation that sequentially shells each r-stable sub-hypersimplex. In this case, we utilize the shelling to compute the Ehrhart h*-polynomials of these polytopes, and the hypersimplex, via independence polynomials of graphs. For one such r-stable hypersimplex, this computation yields a connection to CR map**s of Lens spaces via Ehrhart-MacDonald reciprocity.
△ Less
Submitted 16 March, 2016; v1 submitted 20 August, 2014;
originally announced August 2014.
-
Borromean rays and hyperplanes
Authors:
Jack S. Calcut,
Jules R. Metcalf-Burton,
Taylor J. Richard,
Liam T. Solus
Abstract:
Three disjoint rays in euclidean 3-space form Borromean rays provided their union is knotted, but the union of any two components is unknotted. We construct infinitely many Borromean rays, uncountably many of which are pairwise inequivalent. We obtain uncountably many Borromean hyperplanes.
Three disjoint rays in euclidean 3-space form Borromean rays provided their union is knotted, but the union of any two components is unknotted. We construct infinitely many Borromean rays, uncountably many of which are pairwise inequivalent. We obtain uncountably many Borromean hyperplanes.
△ Less
Submitted 27 November, 2012;
originally announced November 2012.