Search | arXiv e-print repository

Hyperplane Representations of Interventional Characteristic Imset Polytopes

Authors: Benjamin Hollering, Joseph Johnson, Liam Solus

Abstract: Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer… ▽ More Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer a cause-effect structure from data. Linear optimization methods typically require a hyperplane representation of the feasible region, which has proven difficult to compute for CIM polytopes despite continued efforts. We solve this problem for CIM polytopes that are the convex hull of imsets associated to DAGs whose underlying graph of adjacencies is a tree. Our methods use the theory of toric fiber products as well as the novel notion of interventional CIM polytopes. Our solution is obtained as a corollary of a more general result for interventional CIM polytopes. The identified hyperplanes are applied to yield a linear optimization-based causal discovery algorithm for learning polytree causal networks from a combination of observational and interventional data. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 36 pages, 6 figures

MSC Class: 62E10; 62H22; 62D20; 62R01; 13P25; 13P10

arXiv:2404.04024 [pdf, other]

Colored Gaussian DAG models

Authors: Tobias Boege, Kaie Kubjas, Pratik Misra, Liam Solus

Abstract: We study submodels of Gaussian DAG models defined by partial homogeneity constraints imposed on the model error variances and structural coefficients. We represent these models with colored DAGs and investigate their properties for use in statistical and causal inference. Local and global Markov properties are provided and shown to characterize the colored DAG model. Additional properties relevant… ▽ More We study submodels of Gaussian DAG models defined by partial homogeneity constraints imposed on the model error variances and structural coefficients. We represent these models with colored DAGs and investigate their properties for use in statistical and causal inference. Local and global Markov properties are provided and shown to characterize the colored DAG model. Additional properties relevant to causal discovery are studied, including the existence and non-existence of faithful distributions and structural identifiability. Extending prior work of Peters and Bühlman and Wu and Drton, we prove structural identifiability under the assumption of homogeneous structural coefficients, as well as for a family of models with partially homogeneous structural coefficients. The latter models, termed BPEC-DAGs, capture additional causal insights by clustering the direct causes of each node into communities according to their effect on their common target. An analogue of the GES algorithm for learning BPEC-DAGs is given and evaluated on real and synthetic data. Regarding model geometry, we prove that these models are contractible, smooth, algebraic manifolds and compute their dimension. We also provide a proof of a conjecture of Sullivant which generalizes to colored DAG models, colored undirected graphical models and ancestral graph models. The proof yields a tool for the identification of Markov properties for rationally parameterized statistical models with globally, rationally identifiable parameters. △ Less

Submitted 27 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

Comments: 40 pages; v2: major revision

MSC Class: 62H22 (primary) 62R01; 62D20; 13C70; 13P25 (secondary)

arXiv:2402.07762 [pdf, other]

Scalable Structure Learning for Sparse Context-Specific Causal Systems

Authors: Felix Leopoldo Rios, Alex Markham, Liam Solus

Abstract: Several approaches to graphically representing context-specific relations among jointly distributed categorical variables have been proposed, along with structure learning algorithms. While existing optimization-based methods have limited scalability due to the large number of context-specific models, the constraint-based methods are more prone to error than even constraint-based DAG learning algo… ▽ More Several approaches to graphically representing context-specific relations among jointly distributed categorical variables have been proposed, along with structure learning algorithms. While existing optimization-based methods have limited scalability due to the large number of context-specific models, the constraint-based methods are more prone to error than even constraint-based DAG learning algorithms since more relations must be tested. We present a hybrid algorithm for learning context-specific models that scales to hundreds of variables while testing no more constraints than standard DAG learning algorithms. Scalable learning is achieved through a combination of an order-based MCMC algorithm and sparsity assumptions analogous to those typically invoked for DAG models. To implement the method, we solve a special case of an open problem recently posed by Alon and Balogh. The method is shown to perform well on synthetic data and real world examples, in terms of both accuracy and scalability. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 23 pages, 6 figures

arXiv:2305.19802 [pdf, other]

Neuro-Causal Factor Analysis

Authors: Alex Markham, Mingyu Liu, Bryon Aragam, Liam Solus

Abstract: Factor analysis (FA) is a statistical tool for studying how observed variables with some mutual dependences can be expressed as functions of mutually independent unobserved factors, and it is widely applied throughout the psychological, biological, and physical sciences. We revisit this classic method from the comparatively new perspective given by advancements in causal discovery and deep learnin… ▽ More Factor analysis (FA) is a statistical tool for studying how observed variables with some mutual dependences can be expressed as functions of mutually independent unobserved factors, and it is widely applied throughout the psychological, biological, and physical sciences. We revisit this classic method from the comparatively new perspective given by advancements in causal discovery and deep learning, introducing a framework for Neuro-Causal Factor Analysis (NCFA). Our approach is fully nonparametric: it identifies factors via latent causal discovery methods and then uses a variational autoencoder (VAE) that is constrained to abide by the Markov factorization of the distribution with respect to the learned graph. We evaluate NCFA on real and synthetic data sets, finding that it performs comparably to standard VAEs on data reconstruction tasks but with the advantages of sparser architecture, lower model complexity, and causal interpretability. Unlike traditional FA methods, our proposed NCFA method allows learning and reasoning about the latent factors underlying observed data from a justifiably causal perspective, even when the relations between factors and measurements are highly nonlinear. △ Less

Submitted 31 May, 2023; originally announced May 2023.

Comments: 23 pages, 13 figures

arXiv:2303.05876 [pdf, other]

Triangulations of cosmological polytopes

Authors: Martina Juhnke-Kubitzke, Liam Solus, Lorenzo Venturello

Abstract: A cosmological polytope is defined for a given Feynman diagram, and its canonical form may be used to compute the contribution of the Feynman diagram to the wavefunction of certain cosmological models. Given a subdivision of a polytope, its canonical form is obtained as a sum of the canonical forms of the facets of the subdivision. In this paper, we identify such formulas for the canonical form vi… ▽ More A cosmological polytope is defined for a given Feynman diagram, and its canonical form may be used to compute the contribution of the Feynman diagram to the wavefunction of certain cosmological models. Given a subdivision of a polytope, its canonical form is obtained as a sum of the canonical forms of the facets of the subdivision. In this paper, we identify such formulas for the canonical form via algebraic techniques. It is shown that the toric ideal of every cosmological polytope admits a Gröbner basis with a squarefree initial ideal, yielding a regular unimodular triangulation of the polytope. In specific instances, including trees and cycles, we recover graphical characterizations of the facets of such triangulations that may be used to compute the desired canonical form. For paths and cycles, these characterizations admit simple enumeration. Hence, we obtain formulas for the normalized volume of these polytopes, extending previous observations of Kühne and Monin. △ Less

Submitted 10 March, 2023; originally announced March 2023.

arXiv:2210.00822 [pdf, other]

doi 10.2140/astat.2023.14.233

Combinatorial and algebraic perspectives on the marginal independence structure of Bayesian networks

Authors: Danai Deligeorgaki, Alex Markham, Pratik Misra, Liam Solus

Abstract: We consider the problem of estimating the marginal independence structure of a Bayesian network from observational data, learning an undirected graph we call the unconditional dependence graph. We show that unconditional dependence graphs of Bayesian networks correspond to the graphs having equal independence and intersection numbers. Using this observation, a Gröbner basis for a toric ideal assoc… ▽ More We consider the problem of estimating the marginal independence structure of a Bayesian network from observational data, learning an undirected graph we call the unconditional dependence graph. We show that unconditional dependence graphs of Bayesian networks correspond to the graphs having equal independence and intersection numbers. Using this observation, a Gröbner basis for a toric ideal associated to unconditional dependence graphs of Bayesian networks is given and then extended by additional binomial relations to connect the space of all such graphs. An MCMC method, called GrUES (Gröbner-based Unconditional Equivalence Search), is implemented based on the resulting moves and applied to synthetic Gaussian data. GrUES recovers the true marginal independence structure via a penalized maximum likelihood or MAP estimate at a higher rate than simple independence tests while also yielding an estimate of the posterior, for which the $20\%$ HPD credible sets include the true structure at a high rate for data-generating graphs with density at least $0.5$. △ Less

Submitted 31 January, 2024; v1 submitted 3 October, 2022; originally announced October 2022.

Comments: 54 pages, 13 figures, 3 tables

MSC Class: 62R01; 62H22; 60J22; 13F65; 62D20; 05C75

Journal ref: Alg. Stat. 14 (2023) 233-286

arXiv:2209.07579 [pdf, other]

On the Edges of Characteristic Imset Polytopes

Authors: Svante Linusson, Petter Restadh, Liam Solus

Abstract: The edges of the characteristic imset polytope, $\operatorname{CIM}_p$, were recently shown to have strong connections to causal discovery as many algorithms could be interpreted as greedy restricted edge-walks, even though only a strict subset of the edges are known. To better understand the general edge structure of the polytope we describe the edge structure of faces with a clear combinatorial… ▽ More The edges of the characteristic imset polytope, $\operatorname{CIM}_p$, were recently shown to have strong connections to causal discovery as many algorithms could be interpreted as greedy restricted edge-walks, even though only a strict subset of the edges are known. To better understand the general edge structure of the polytope we describe the edge structure of faces with a clear combinatorial interpretation: for any undirected graph $G$ we have the face $\operatorname{CIM}_G$, the convex hull of the characteristic imsets of DAGs with skeleton $G$. We give a full edge-description of $\operatorname{CIM}_G$ when $G$ is a tree, leading to interesting connections to other polytopes. In particular the well-studied stable set polytope can be recovered as a face of $\operatorname{CIM}_G$ when $G$ is a tree. Building on this connection we are also able to give a description of all edges of $\operatorname{CIM}_G$ when $G$ is a cycle, suggesting possible inroads for generalization. We then introduce an algorithm for learning directed trees from data, utilizing our newly discovered edges, that outperforms classical methods on simulated Gaussian data. △ Less

Submitted 15 September, 2022; originally announced September 2022.

Comments: 36 pages, 5 figures

arXiv:2209.01834 [pdf, ps, other]

doi 10.2140/astat.2023.14.109

Toric Ideals of Characteristic Imsets via Quasi-Independence Gluing

Authors: Benjamin Hollering, Joseph Johnson, Irem Portakal, Liam Solus

Abstract: Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber… ▽ More Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber product, which we call a quasi-independence gluing, and show that under certain combinatorial homogeneity conditions, one can iteratively compute a Gröbner basis via lifting. For faces of the characteristic imset polytope associated to trees, we apply this technique to compute a Gröbner basis for the associated toric ideal. We end with a study of the characteristic ideal of the cycle and propose directions for future work. △ Less

Submitted 19 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

Comments: 19 pages, 7 figures

MSC Class: 62R01; 13P10; 62A09; 13P25

Journal ref: Alg. Stat. 14 (2023) 109-131

arXiv:2203.00521 [pdf, ps, other]

A Transformational Characterization of Unconditionally Equivalent Bayesian Networks

Authors: Alex Markham, Danai Deligeorgaki, Pratik Misra, Liam Solus

Abstract: We consider the problem of characterizing Bayesian networks up to unconditional equivalence, i.e., when directed acyclic graphs (DAGs) have the same set of unconditional $d$-separation statements. Each unconditional equivalence class (UEC) is uniquely represented with an undirected graph whose clique structure encodes the members of the class. Via this structure, we provide a transformational char… ▽ More We consider the problem of characterizing Bayesian networks up to unconditional equivalence, i.e., when directed acyclic graphs (DAGs) have the same set of unconditional $d$-separation statements. Each unconditional equivalence class (UEC) is uniquely represented with an undirected graph whose clique structure encodes the members of the class. Via this structure, we provide a transformational characterization of unconditional equivalence; i.e., we show that two DAGs are in the same UEC if and only if one can be transformed into the other via a finite sequence of specified moves. We also extend this characterization to the essential graphs representing the Markov equivalence classes (MECs) in the UEC. UECs partition the space of MECs and are easily estimable from marginal independence tests. Thus, a characterization of unconditional equivalence has applications in methods that involve searching the space of MECs of Bayesian networks. △ Less

Submitted 10 August, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

Comments: 12 pages, 1 figure. Accepted for publication at the 11th International Conference on Probabilistic Graphical Models (PGM 2022)

arXiv:2105.05907 [pdf, ps, other]

A new characterization of discrete decomposable models

Authors: Eliana Duarte, Liam Solus

Abstract: Decomposable graphical models, also known as perfect DAG models, play a fundamental role in standard approaches to probabilistic inference via graph representations in modern machine learning and statistics. However, such models are limited by the assumption that the data-generating distribution does not entail strictly context-specific conditional independence relations. The family of staged tree… ▽ More Decomposable graphical models, also known as perfect DAG models, play a fundamental role in standard approaches to probabilistic inference via graph representations in modern machine learning and statistics. However, such models are limited by the assumption that the data-generating distribution does not entail strictly context-specific conditional independence relations. The family of staged tree models generalizes DAG models so as to accommodate context-specific knowledge. We provide a new characterization of perfect discrete DAG models in terms of their staged tree representations. This characterization identifies the family of balanced staged trees as the natural generalization of discrete decomposable models to the context-specific setting. △ Less

Submitted 12 May, 2021; originally announced May 2021.

Comments: 13 pages, 2 figures. The original version of paper 2012.03593 as been broken into two papers. This is one of them

MSC Class: 62R01; 62A09; 13P10; 13P25

arXiv:2103.17156 [pdf, ps, other]

The Integer Decomposition Property and Weighted Projective Space Simplices

Authors: Benjamin Braun, Robert Davis, Derek Hanely, Morgan Lane, Liam Solus

Abstract: Reflexive lattice polytopes play a key role in combinatorics, algebraic geometry, physics, and other areas. One important class of lattice polytopes are lattice simplices defining weighted projective spaces. We investigate the question of when a reflexive weighted projective space simplex has the integer decomposition property. We provide a complete classification of reflexive weighted projective… ▽ More Reflexive lattice polytopes play a key role in combinatorics, algebraic geometry, physics, and other areas. One important class of lattice polytopes are lattice simplices defining weighted projective spaces. We investigate the question of when a reflexive weighted projective space simplex has the integer decomposition property. We provide a complete classification of reflexive weighted projective space simplices having the integer decomposition property for the case when there are at most three distinct non-unit weights, and conjecture a general classification for an arbitrary number of distinct non-unit weights. Further, for any weighted projective space simplex and $m\geq 1$, we define the $m$-th reflexive stabilization, a reflexive weighted projective space simplex. We prove that when $m$ is $2$ or greater, reflexive stabilizations do not have the integer decomposition property. We also prove that the Ehrhart $h^\ast$-polynomial of any sufficiently large reflexive stabilization is not unimodal and has only $1$ and $2$ as coefficients. We use this construction to generate interesting examples of reflexive weighted projective space simplices that are near the boundary of both $h^*$-unimodality and the integer decomposition property. △ Less

Submitted 21 November, 2022; v1 submitted 31 March, 2021; originally announced March 2021.

arXiv:2103.03771 [pdf, other]

Greedy Causal Discovery is Geometric

Authors: Svante Linusson, Petter Restadh, Liam Solus

Abstract: Finding a directed acyclic graph (DAG) that best encodes the conditional independence statements observable from data is a central question within causality. Algorithms that greedily transform one candidate DAG into another given a fixed set of moves have been particularly successful, for example the GES, GIES, and MMHC algorithms. In 2010, Studený, Hemmecke and Lindner introduced the characterist… ▽ More Finding a directed acyclic graph (DAG) that best encodes the conditional independence statements observable from data is a central question within causality. Algorithms that greedily transform one candidate DAG into another given a fixed set of moves have been particularly successful, for example the GES, GIES, and MMHC algorithms. In 2010, Studený, Hemmecke and Lindner introduced the characteristic imset polytope, $\operatorname{CIM}_p$, whose vertices correspond to Markov equivalence classes, as a way of transforming causal discovery into a linear optimization problem. We show that the moves of the aforementioned algorithms are included within classes of edges of $\operatorname{CIM}_p$ and that restrictions placed on the skeleton of the candidate DAGs correspond to faces of $\operatorname{CIM}_p$. Thus, we observe that GES, GIES, and MMHC all have geometric realizations as greedy edge-walks along $\operatorname{CIM}_p$. Furthermore, the identified edges of $\operatorname{CIM}_p$ strictly generalize the moves of these algorithms. Exploiting this generalization, we introduce a greedy simplex-type algorithm called \emph{greedy CIM}, and a hybrid variant, \emph{skeletal greedy CIM}, that outperforms current competitors among hybrid and constraint-based algorithms. △ Less

Submitted 1 September, 2022; v1 submitted 5 March, 2021; originally announced March 2021.

Comments: 21 pages

arXiv:2101.09271 [pdf, other]

Representation of Context-Specific Causal Models with Observational and Interventional Data

Authors: Eliana Duarte, Liam Solus

Abstract: We consider the problem of representing causal models that encode context-specific information for discrete data using a proper subclass of staged tree models which we call CStrees. We show that the context-specific information encoded by a CStree can be equivalently expressed via a collection of DAGs. As not all staged tree models admit this property, CStrees are a subclass that provides a transp… ▽ More We consider the problem of representing causal models that encode context-specific information for discrete data using a proper subclass of staged tree models which we call CStrees. We show that the context-specific information encoded by a CStree can be equivalently expressed via a collection of DAGs. As not all staged tree models admit this property, CStrees are a subclass that provides a transparent, intuitive and compact representation of context-specific causal information. We prove that CStrees admit a global Markov property which yields a graphical criterion for model equivalence generalizing that of Verma and Pearl for DAG models. These results extend to the general interventional model setting, making CStrees the first family of context-specific models admitting a characterization of interventional model equivalence. We also provide a closed-form formula for the maximum likelihood estimator of a CStree and use it to show that the Bayesian information criterion is a locally consistent score function for this model class. The performance of CStrees is analyzed on both simulated and real data, where we see that modeling with CStrees instead of general staged trees does not result in a significant loss of predictive accuracy, while affording DAG representations of context-specific causal information. △ Less

Submitted 12 January, 2022; v1 submitted 22 January, 2021; originally announced January 2021.

Comments: 28 pages, supplementary material 15 pages

MSC Class: 2020: 62E10; 62H22; 62D20; 62R01

arXiv:2012.03593 [pdf, ps, other]

Algebraic geometry of discrete interventional models

Authors: Eliana Duarte, Liam Solus

Abstract: We investigate the algebra and geometry of general interventions in discrete DAG models. To this end, we introduce a theory for modeling soft interventions in the more general family of staged tree models and develop the formalism to study these models as parametrized subvarieties of a product of probability simplices. We then consider the problem of finding their defining equations, and we derive… ▽ More We investigate the algebra and geometry of general interventions in discrete DAG models. To this end, we introduce a theory for modeling soft interventions in the more general family of staged tree models and develop the formalism to study these models as parametrized subvarieties of a product of probability simplices. We then consider the problem of finding their defining equations, and we derive a combinatorial criterion for identifying interventional staged tree models for which the defining ideal is toric. We apply these results to the class of discrete interventional DAG models and establish a criteria to determine when these models are toric varieties. △ Less

Submitted 16 October, 2023; v1 submitted 7 December, 2020; originally announced December 2020.

Comments: This version includes some minors revision to examples, intro and statistical/algebraic outlook

MSC Class: 62R01; 62A09; 13P10; 13P25

arXiv:2003.07328 [pdf, ps, other]

Subdivisions of Shellable Complexes

Authors: Max Hlavacek, Liam Solus

Abstract: In geometric, algebraic, and topological combinatorics, the unimodality of combinatorial generating polynomials is frequently studied. Unimodality follows when the polynomial is (real) stable, a property often deduced via the theory of interlacing polynomials. Many of the open questions on stability and unimodality of polynomials pertain to the enumeration of faces of cell complexes. In this pap… ▽ More In geometric, algebraic, and topological combinatorics, the unimodality of combinatorial generating polynomials is frequently studied. Unimodality follows when the polynomial is (real) stable, a property often deduced via the theory of interlacing polynomials. Many of the open questions on stability and unimodality of polynomials pertain to the enumeration of faces of cell complexes. In this paper, we relate the theory of interlacing polynomials to the shellability of cell complexes. We first derive a sufficient condition for stability of the $h$-polynomial of a subdivision of a shellable complex. To apply it, we generalize the notion of reciprocal domains for convex embeddings of polytopes to abstract polytopes and use this generalization to define the family of stable shellings of a polytopal complex. We characterize the stable shellings of cubical and simplicial complexes, and apply this theory to answer a question of Brenti and Welker on barycentric subdivisions for the well-known cubical polytopes. We also give a positive solution to a problem of Mohammadi and Welker on edgewise subdivisions of cell complexes. We end by relating the family of stable line shellings to the combinatorics of hyperplane arrangements. We pose related questions, answers to which would resolve some long-standing problems while strengthening ties between the theory of interlacing polynomials and the combinatorics of hyperplane arrangements. △ Less

Submitted 25 June, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

arXiv:1911.12459 [pdf, ps, other]

Some Algebraic Properties of Lecture Hall Polytopes

Authors: Petter Brändén, Liam Solus

Abstract: In this note, we investigate some of the fundamental algebraic and geometric properties of $s$-lecture hall simplices and their generalizations. We show that all $s$-lecture hall order polytopes, which simultaneously generalize $s$-lecture hall simplices and order polytopes, satisfy a property which implies the integer decomposition property. This answers one conjecture of Hibi, Olsen and Tsuchiya… ▽ More In this note, we investigate some of the fundamental algebraic and geometric properties of $s$-lecture hall simplices and their generalizations. We show that all $s$-lecture hall order polytopes, which simultaneously generalize $s$-lecture hall simplices and order polytopes, satisfy a property which implies the integer decomposition property. This answers one conjecture of Hibi, Olsen and Tsuchiya. By relating $s$-lecture hall polytopes to alcoved polytopes, we then use this property to show that families of $s$-lecture hall simplices admit a quadratic Gröbner basis with a square-free initial ideal. Consequently, we find that all $s$-lecture hall simplices for which the first order difference sequence of $s$ is a $0,1$-sequence have a regular and unimodular triangulation. This answers a second conjecture of Hibi, Olsen and Tsuchiya, and it gives a partial answer to a conjecture of Beck, Braun, Köppe, Savage and Zafeirakopoulos. △ Less

Submitted 27 November, 2019; originally announced November 2019.

arXiv:1911.10114 [pdf, ps, other]

Distributional Invariances and Interventional Markov Equivalence for Mixed Graph Models

Authors: Liam Solus

Abstract: The invariance properties of interventional distributions relative to the observational distribution, and how these properties allow us to refine Markov equivalence classes (MECs) of DAGs, is central to causal DAG discovery algorithms that use both interventional and observational data. Here, we show how the invariance properties of interventional DAG models, and the corresponding refinement of ME… ▽ More The invariance properties of interventional distributions relative to the observational distribution, and how these properties allow us to refine Markov equivalence classes (MECs) of DAGs, is central to causal DAG discovery algorithms that use both interventional and observational data. Here, we show how the invariance properties of interventional DAG models, and the corresponding refinement of MECs into interventional MECs, can be generalized to mixed graphical models that allow for latent cofounders and selection variables. We first generalize interventional Markov equivalence to all formal independence models associated to loopless mixed graphs. For ancestral graphs, we prove the resulting interventional MECs admit a graphical characterization generalizing that of DAGs. We then define interventional distributions for acyclic directed mixed graph models, and prove that this generalization aligns with the graphical generalization of interventional Markov equivalence given for the formal independence models. This provides a framework for causal model discovery via observational and interventional data in the presence of latent confounders that applies even when the interventions are uncontrolled. △ Less

Submitted 7 June, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

arXiv:1808.04141 [pdf, ps, other]

Symmetric decompositions and real-rootedness

Authors: Petter Brändén, Liam Solus

Abstract: In algebraic, topological, and geometric combinatorics inequalities among the coefficients of combinatorial polynomials are frequently studied. Recently a notion called the alternatingly increasing property, which is stronger than unimodality, was introduced. In this paper, we relate the alternatingly increasing property to real-rootedness of the symmetric decomposition of a polynomial to develop… ▽ More In algebraic, topological, and geometric combinatorics inequalities among the coefficients of combinatorial polynomials are frequently studied. Recently a notion called the alternatingly increasing property, which is stronger than unimodality, was introduced. In this paper, we relate the alternatingly increasing property to real-rootedness of the symmetric decomposition of a polynomial to develop a systematic approach for proving the alternatingly increasing property for several classes of polynomials. We apply our results to strengthen and generalize real-rootedness, unimodality, and alternatingly increasing results pertaining to colored Eulerian and derangement polynomials, Ehrhart $h^\ast$-polynomials for lattice zonotopes, $h$-polynomials of barycentric subdivisions of doubly Cohen-Macaulay level simplicial complexes, and certain local $h$-polynomials for subdivisions of simplices. In particular, we prove two conjectures of Athanasiadis. △ Less

Submitted 12 January, 2020; v1 submitted 13 August, 2018; originally announced August 2018.

arXiv:1807.08223 [pdf, ps, other]

Local $h^*$-Polynomials of Some Weighted Projective Spaces

Authors: Liam Solus

Abstract: There is currently a growing interest in understanding which lattice simplices have unimodal local $h^\ast$-polynomials (sometimes called box polynomials); specifically in light of their potential applications to unimodality questions for Ehrhart $h^\ast$-polynomials. In this note, we compute a general form for the local $h^\ast$-polynomial of a well-studied family of lattice simplices whose assoc… ▽ More There is currently a growing interest in understanding which lattice simplices have unimodal local $h^\ast$-polynomials (sometimes called box polynomials); specifically in light of their potential applications to unimodality questions for Ehrhart $h^\ast$-polynomials. In this note, we compute a general form for the local $h^\ast$-polynomial of a well-studied family of lattice simplices whose associated toric varieties are weighted projective spaces. We then apply this formula to prove that certain such lattice simplices, whose combinatorics are naturally encoded using common systems of numeration, all have real-rooted, and thus unimodal, local $h^\ast$-polynomials. As a consequence, we discover a new restricted Eulerian polynomial that is real-rooted, symmetric, and admits intriguing number theoretic properties. △ Less

Submitted 11 January, 2020; v1 submitted 21 July, 2018; originally announced July 2018.

Comments: 16 pages; In the proceedings of the 2018 Summer Workshop on Lattice Polytopes at Osaka University

arXiv:1807.05246 [pdf, ps, other]

Derangements, Ehrhart Theory, and Local h-polynomials

Authors: Nils Gustafsson, Liam Solus

Abstract: The Eulerian polynomials and derangement polynomials are two well-studied generating functions that frequently arise in combinatorics, algebra, and geometry. When one makes an appearance, the other often does so as well, and their corresponding generalizations are similarly linked. This is this case in the theory of subdivisions of simplicial complexes, where the Eulerian polynomial is an $h$-poly… ▽ More The Eulerian polynomials and derangement polynomials are two well-studied generating functions that frequently arise in combinatorics, algebra, and geometry. When one makes an appearance, the other often does so as well, and their corresponding generalizations are similarly linked. This is this case in the theory of subdivisions of simplicial complexes, where the Eulerian polynomial is an $h$-polynomial and the derangement polynomial is its local $h$-polynomial. Separately, in Ehrhart theory the Eulerian polynomials are generalized by the $h^\ast$-polynomials of $s$-lecture hall simplices. Here, we show that derangement polynomials are analogously generalized by the box polynomials, or local $h^\ast$-polynomials, of the $s$-lecture hall simplices, and that these polynomials are all real-rooted. We then connect the two theories by showing that the local $h$-polynomials of common subdivisions in algebra and topology are realized as local $h^\ast$-polynomials of $s$-lecture hall simplices. We use this connection to address some open questions on real-rootedness and unimodality of generating polynomials, some from each side of the story. △ Less

Submitted 13 April, 2020; v1 submitted 13 July, 2018; originally announced July 2018.

Comments: 29 pages, 2 figures

arXiv:1804.08258 [pdf, other]

On the Relationship Between Ehrhart Unimodality and Ehrhart Positivity

Authors: Fu Liu, Liam Solus

Abstract: For a given lattice polytope, two fundamental problems within the field of Ehrhart theory are to (1) determine if its (Ehrhart) $h^\ast$-polynomial is unimodal and (2) to determine if its Ehrhart polynomial has only positive coefficients. The former property of a lattice polytope is known as Ehrhart unimodality and the latter property is known as Ehrhart positivity. These two properties are often… ▽ More For a given lattice polytope, two fundamental problems within the field of Ehrhart theory are to (1) determine if its (Ehrhart) $h^\ast$-polynomial is unimodal and (2) to determine if its Ehrhart polynomial has only positive coefficients. The former property of a lattice polytope is known as Ehrhart unimodality and the latter property is known as Ehrhart positivity. These two properties are often simultaneously conjectured to hold for interesting families of lattice polytopes, yet they are typically studied in parallel. As to answer a question posed at the 2017 Introductory Workshop to the MSRI Semester on Geometric and Topological Combinatorics, the purpose of this note is to show that there is no general implication between these two properties in any dimension greater than two. To do so, we investigate these two properties for families of well-studied lattice polytopes, assessing one property where previously only the other had been considered. Consequently, new examples of each phenomena are developed, some of which provide an answer to an open problem in the literature. The well-studied families of lattice polytopes considered include zonotopes, matroid polytopes, simplices of weighted projective spaces, empty lattice simplices, smooth polytopes, and $s$-lecture hall simplices. △ Less

Submitted 23 April, 2018; originally announced April 2018.

Comments: 15 pages, 2 figures

MSC Class: 52B20; 05A20; 05A15

arXiv:1802.06969 [pdf, other]

Geometry of Discrete Copulas

Authors: Elisa Perrone, Liam Solus, Caroline Uhler

Abstract: Multivariate distributions are fundamental to modeling. Discrete copulas can be used to construct diverse multivariate joint distributions over random variables from estimated univariate marginals. The space of discrete copulas admits a representation as a convex polytope which can be exploited in entropy-copula methods relevant to hydrology and climatology. To allow for an extensive use of such m… ▽ More Multivariate distributions are fundamental to modeling. Discrete copulas can be used to construct diverse multivariate joint distributions over random variables from estimated univariate marginals. The space of discrete copulas admits a representation as a convex polytope which can be exploited in entropy-copula methods relevant to hydrology and climatology. To allow for an extensive use of such methods in a wide range of applied fields, it is important to have a geometric representation of discrete copulas with desirable stochastic properties. In this paper, we show that the families of ultramodular discrete copulas and their generalization to convex discrete quasi-copulas admit representations as polytopes. We draw connections to the prominent Birkhoff polytope, alternating sign matrix polytope, and their most extensive generalizations in the discrete geometry literature. In doing so, we generalize some well-known results on these polytopes from both the statistics literature and the discrete geometry literature. △ Less

Submitted 30 May, 2018; v1 submitted 19 February, 2018; originally announced February 2018.

arXiv:1706.06091 [pdf, other]

Counting Markov Equivalence Classes for DAG models on Trees

Authors: Adityanarayanan Radhakrishnan, Liam Solus, Caroline Uhler

Abstract: DAG models are statistical models satisfying a collection of conditional independence relations encoded by the nonedges of a directed acyclic graph (DAG) $\mathcal{G}$. Such models are used to model complex cause-effect systems across a variety of research fields. From observational data alone, a DAG model $\mathcal{G}$ is only recoverable up to Markov equivalence. Combinatorially, two DAGs are Ma… ▽ More DAG models are statistical models satisfying a collection of conditional independence relations encoded by the nonedges of a directed acyclic graph (DAG) $\mathcal{G}$. Such models are used to model complex cause-effect systems across a variety of research fields. From observational data alone, a DAG model $\mathcal{G}$ is only recoverable up to Markov equivalence. Combinatorially, two DAGs are Markov equivalent if and only if they have the same underlying undirected graph (i.e. skeleton) and the same set of the induced subDAGs $i\to j \leftarrow k$, known as immoralities. Hence it is of interest to study the number and size of Markov equivalence classes (MECs). In a recent paper, the authors introduced a pair of generating functions that enumerate the number of MECs on a fixed skeleton by number of immoralities and by class size, and they studied the complexity of computing these functions. In this paper, we lay the foundation for studying these generating functions by analyzing their structure for trees and other closely related graphs. We describe these polynomials for some important families of graphs including paths, stars, cycles, spider graphs, caterpillars, and complete binary trees. In doing so, we recover important connections to independence polynomials, and extend some classical identities that hold for Fibonacci numbers. We also provide tight lower and upper bounds for the number and size of MECs on any tree. Finally, we use computational methods to show that the number and distribution of high degree nodes in a triangle-free graph dictates the number and size of MECs. △ Less

Submitted 17 June, 2017; originally announced June 2017.

Comments: 31 Pages, 25 Figures, 1 Table

arXiv:1706.00480 [pdf, ps, other]

Simplices for Numeral Systems

Authors: Liam Solus

Abstract: The family of lattice simplices in $\mathbb{R}^n$ formed by the convex hull of the standard basis vectors together with a weakly decreasing vector of negative integers include simplices that play a central role in problems in enumerative algebraic geometry and mirror symmetry. From this perspective, it is useful to have formulae for their discrete volumes via Ehrhart $h^\ast$-polynomials. Here we… ▽ More The family of lattice simplices in $\mathbb{R}^n$ formed by the convex hull of the standard basis vectors together with a weakly decreasing vector of negative integers include simplices that play a central role in problems in enumerative algebraic geometry and mirror symmetry. From this perspective, it is useful to have formulae for their discrete volumes via Ehrhart $h^\ast$-polynomials. Here we show, via an association with numeral systems, that such simplices yield $h^\ast$-polynomials with properties that are also desirable from a combinatorial perspective. First, we identify $n$-simplices in this family that associate via their normalized volume to the $n^{th}$ place value of a positional numeral system. We then observe that their $h^\ast$-polynomials admit combinatorial formula via descent-like statistics on the numeral strings encoding the nonnegative integers within the system. With these methods, we recover ubiquitous $h^\ast$-polynomials including the Eulerian polynomials and the binomial coefficients arising from the factoradic and binary numeral systems, respectively. We generalize the binary case to base-$r$ numeral systems for all $r\geq2$, and prove that the associated $h^\ast$-polynomials are real-rooted and unimodal for $r\geq2$ and $n\geq1$. △ Less

Submitted 4 October, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

Comments: 15 pages; To appear in Transactions of the AMS

arXiv:1705.10220 [pdf, other]

Permutation-based Causal Inference Algorithms with Interventions

Authors: Yuhao Wang, Liam Solus, Karren Dai Yang, Caroline Uhler

Abstract: Learning directed acyclic graphs using both observational and interventional data is now a fundamentally important problem due to recent technological developments in genomics that generate such single-cell gene expression data at a very large scale. In order to utilize this data for learning gene regulatory networks, efficient and reliable causal inference algorithms are needed that can make use… ▽ More Learning directed acyclic graphs using both observational and interventional data is now a fundamentally important problem due to recent technological developments in genomics that generate such single-cell gene expression data at a very large scale. In order to utilize this data for learning gene regulatory networks, efficient and reliable causal inference algorithms are needed that can make use of both observational and interventional data. In this paper, we present two algorithms of this type and prove that both are consistent under the faithfulness assumption. These algorithms are interventional adaptations of the Greedy SP algorithm and are the first algorithms using both observational and interventional data with consistency guarantees. Moreover, these algorithms have the advantage that they are nonparametric, which makes them useful also for analyzing non-Gaussian data. In this paper, we present these two algorithms and their consistency guarantees, and we analyze their performance on simulated data, protein signaling data, and single-cell gene expression data. △ Less

Submitted 4 November, 2017; v1 submitted 29 May, 2017; originally announced May 2017.

Journal ref: Advances in Neural Information Processing Systems, 2017

arXiv:1702.03530 [pdf, other]

Consistency Guarantees for Greedy Permutation-Based Causal Inference Algorithms

Authors: Liam Solus, Yuhao Wang, Caroline Uhler

Abstract: Directed acyclic graphical models, or DAG models, are widely used to represent complex causal systems. Since the basic task of learning such a model from data is NP-hard, a standard approach is greedy search over the space of directed acyclic graphs or Markov equivalence classes of directed acyclic graphs. As the space of directed acyclic graphs on $p$ nodes and the associated space of Markov equi… ▽ More Directed acyclic graphical models, or DAG models, are widely used to represent complex causal systems. Since the basic task of learning such a model from data is NP-hard, a standard approach is greedy search over the space of directed acyclic graphs or Markov equivalence classes of directed acyclic graphs. As the space of directed acyclic graphs on $p$ nodes and the associated space of Markov equivalence classes are both much larger than the space of permutations, it is desirable to consider permutation-based greedy searches. Here, we provide the first consistency guarantees, both uniform and high-dimensional, of a greedy permutation-based search. This search corresponds to a simplex-like algorithm operating over the edge-graph of a sub-polytope of the permutohedron, called a DAG associahedron. Every vertex in this polytope is associated with a directed acyclic graph, and hence with a collection of permutations that are consistent with the directed acyclic graph ordering. A walk is performed on the edges of the polytope maximizing the sparsity of the associated directed acyclic graphs. We show via simulated and real data that this permutation search is competitive with current approaches. △ Less

Submitted 8 June, 2021; v1 submitted 12 February, 2017; originally announced February 2017.

Comments: 37 pages, 15 Figures

arXiv:1612.06040 [pdf, other]

Monte Carlo goodness-of-fit tests for degree corrected and related stochastic blockmodels

Authors: Vishesh Karwa, Debdeep Pati, Sonja Petrović, Liam Solus, Nikita Alexeev, Mateja Raič, Dane Wilburne, Robert Williams, Bowei Yan

Abstract: We construct Bayesian and frequentist finite-sample goodness-of-fit tests for three different variants of the stochastic blockmodel for network data. Since all of the stochastic blockmodel variants are log-linear in form when block assignments are known, the tests for the \emph{latent} block model versions combine a block membership estimator with the algebraic statistics machinery for testing goo… ▽ More We construct Bayesian and frequentist finite-sample goodness-of-fit tests for three different variants of the stochastic blockmodel for network data. Since all of the stochastic blockmodel variants are log-linear in form when block assignments are known, the tests for the \emph{latent} block model versions combine a block membership estimator with the algebraic statistics machinery for testing goodness-of-fit in log-linear models. We describe Markov bases and marginal polytopes of the variants of the stochastic blockmodel, and discuss how both facilitate the development of goodness-of-fit tests and understanding of model behavior. The general testing methodology developed here extends to any finite mixture of log-linear models on discrete data, and as such is the first application of the algebraic statistics machinery for latent-variable models. △ Less

Submitted 6 March, 2024; v1 submitted 18 December, 2016; originally announced December 2016.

Comments: substantial revision from v3, updated simulations and theoretical discussions

MSC Class: 62R01; 05C82

Journal ref: Journal of the Royal Statistical Society Series B: Statistical Methodology, Volume 86, Issue 1, February 2024, Pages 90-121

arXiv:1611.07493 [pdf, other]

Counting Markov Equivalence Classes by Number of Immoralities

Authors: Adityanarayanan Radhakrishnan, Liam Solus, Caroline Uhler

Abstract: Two directed acyclic graphs (DAGs) are called Markov equivalent if and only if they have the same underlying undirected graph (i.e. skeleton) and the same set of immoralities. Using observational data, a DAG model can only be determined up to Markov equivalence, and so it is desirable to understand the size and number of Markov equivalence classes (MECs) combinatorially. In this paper, we address… ▽ More Two directed acyclic graphs (DAGs) are called Markov equivalent if and only if they have the same underlying undirected graph (i.e. skeleton) and the same set of immoralities. Using observational data, a DAG model can only be determined up to Markov equivalence, and so it is desirable to understand the size and number of Markov equivalence classes (MECs) combinatorially. In this paper, we address this enumerative question using a pair of generating functions that encode the number and size of MECs on a skeleton $G$, and in doing so we connect this problem to classical problems in combinatorial optimization. The first is a graph polynomial that counts the number of MECs on $G$ by their number of immoralities. Using connections to the independent set problem, we show that computing a DAG on $G$ with the maximum possible number of immoralities is NP-hard. The second generating function counts the MECs on $G$ according to their size. Via computer enumeration, we show that this generating function is distinct for every connected graph on $p$ nodes for all $p\leq 10$. △ Less

Submitted 17 June, 2017; v1 submitted 22 November, 2016; originally announced November 2016.

Comments: 10 pages, 3 Figures, 1 Table

arXiv:1608.01614 [pdf, ps, other]

Detecting the Integer Decomposition Property and Ehrhart Unimodality in Reflexive Simplices

Authors: Benjamin Braun, Robert Davis, Liam Solus

Abstract: A long-standing open conjecture in combinatorics asserts that a Gorenstein lattice polytope with the integer decomposition property (IDP) has a unimodal (Ehrhart) $h^\ast$-polynomial. This conjecture can be viewed as a strengthening of a previously disproved conjecture which stated that any Gorenstein lattice polytope has a unimodal $h^\ast$-polynomial. The first counterexamples to unimodality for… ▽ More A long-standing open conjecture in combinatorics asserts that a Gorenstein lattice polytope with the integer decomposition property (IDP) has a unimodal (Ehrhart) $h^\ast$-polynomial. This conjecture can be viewed as a strengthening of a previously disproved conjecture which stated that any Gorenstein lattice polytope has a unimodal $h^\ast$-polynomial. The first counterexamples to unimodality for Gorenstein lattice polytopes were given in even dimensions greater than five by Musta{ţ}{ǎ} and Payne, and this was extended to all dimensions greater than five by Payne. While there exist numerous examples in support of the conjecture that IDP reflexives are $h^\ast$-unimodal, its validity has not yet been considered for families of reflexive lattice simplices that closely generalize Payne's counterexamples. The main purpose of this work is to prove that the former conjecture does indeed hold for a natural generalization of Payne's examples. The second purpose of this work is to extend this investigation to a broader class of lattice simplices, for which we present new results and open problems. △ Less

Submitted 1 June, 2018; v1 submitted 4 August, 2016; originally announced August 2016.

Comments: 15 pages

MSC Class: 52B20; 05E40; 05A20; 05A15

arXiv:1506.06702 [pdf, other]

Extremal Positive Semidefinite Matrices for Graphs without $K_5$ Minors

Authors: Liam Solus, Caroline Uhler, Ruriko Yoshida

Abstract: For a graph $G$ with $p$ vertices the closed convex cone $\mathbb{S}^p_{\succeq0}(G)$ consists of all real positive semidefinite $p\times p$ matrices with zeros in the off-diagonal entries corresponding to nonedges of $G$. The extremal rays of this cone and their associated ranks have applications to matrix completion problems, maximum likelihood estimation in Gaussian graphical models in statisti… ▽ More For a graph $G$ with $p$ vertices the closed convex cone $\mathbb{S}^p_{\succeq0}(G)$ consists of all real positive semidefinite $p\times p$ matrices with zeros in the off-diagonal entries corresponding to nonedges of $G$. The extremal rays of this cone and their associated ranks have applications to matrix completion problems, maximum likelihood estimation in Gaussian graphical models in statistics, and Gauss elimination for sparse matrices. For a graph $G$ without $K_5$ minors, we show that the normal vectors to the facets of the $(\pm1)$-cut polytope of $G$ specify the off-diagonal entries of extremal matrices in $\mathbb{S}^p_{\succeq0}(G)$. We also prove that the constant term of the linear equation of each facet-supporting hyperplane is the rank of its corresponding extremal matrix in $\mathbb{S}^p_{\succeq0}(G)$. Furthermore, we show that if $G$ is series-parallel then this gives a complete characterization of all possible extremal ranks of $\mathbb{S}^p_{\succeq0}(G)$, consequently solving the sparsity order problem for series-parallel graphs. △ Less

Submitted 21 September, 2015; v1 submitted 22 June, 2015; originally announced June 2015.

Comments: 20 pages, 8 figures

arXiv:1408.5932 [pdf, ps, other]

Facets of the r-stable n,k-hypersimplex

Authors: Takayuki Hibi, Liam Solus

Abstract: Let $k, n$ and $r$ be positive integers with $k < n$ and $r\leq\lfloor\frac{n}{k}\rfloor$. We determine the facets of the $r$-stable $n,k$-hypersimplex. As a result, it turns out that the $r$-stable $n,k$-hypersimplex has exactly $2n$ facets for every $r<\lfloor\frac{n}{k}\rfloor$. We then utilize the equations of the facets to study when the $r$-stable hypersimplex is Gorenstein. For every $k>0$… ▽ More Let $k, n$ and $r$ be positive integers with $k < n$ and $r\leq\lfloor\frac{n}{k}\rfloor$. We determine the facets of the $r$-stable $n,k$-hypersimplex. As a result, it turns out that the $r$-stable $n,k$-hypersimplex has exactly $2n$ facets for every $r<\lfloor\frac{n}{k}\rfloor$. We then utilize the equations of the facets to study when the $r$-stable hypersimplex is Gorenstein. For every $k>0$ we identify an infinite collection of Gorenstein $r$-stable hypersimplices, consequently expanding the collection of $r$-stable hypersimplices known to have unimodal Ehrhart $δ$-vectors. △ Less

Submitted 21 September, 2015; v1 submitted 25 August, 2014; originally announced August 2014.

Comments: 12 pages, 2 figures

arXiv:1408.4713 [pdf, ps, other]

Shellability, Ehrhart Theory, and $r$-stable Hypersimplices

Authors: Benjamin Braun, Liam Solus

Abstract: Hypersimplices are well-studied objects in combinatorics, optimization, and representation theory. For each hypersimplex, we define a new family of subpolytopes, called r-stable hypersimplices, and show that a well-known regular unimodular triangulation of the hypersimplex restricts to a triangulation of each r-stable hypersimplex. For the case of the second hypersimplex defined by the two-element… ▽ More Hypersimplices are well-studied objects in combinatorics, optimization, and representation theory. For each hypersimplex, we define a new family of subpolytopes, called r-stable hypersimplices, and show that a well-known regular unimodular triangulation of the hypersimplex restricts to a triangulation of each r-stable hypersimplex. For the case of the second hypersimplex defined by the two-element subsets of an n-set, we provide a shelling of this triangulation that sequentially shells each r-stable sub-hypersimplex. In this case, we utilize the shelling to compute the Ehrhart h*-polynomials of these polytopes, and the hypersimplex, via independence polynomials of graphs. For one such r-stable hypersimplex, this computation yields a connection to CR map**s of Lens spaces via Ehrhart-MacDonald reciprocity. △ Less

Submitted 16 March, 2016; v1 submitted 20 August, 2014; originally announced August 2014.

Comments: 35 pages, 23 figures

arXiv:1211.6465 [pdf, ps, other]

Borromean rays and hyperplanes

Authors: Jack S. Calcut, Jules R. Metcalf-Burton, Taylor J. Richard, Liam T. Solus

Abstract: Three disjoint rays in euclidean 3-space form Borromean rays provided their union is knotted, but the union of any two components is unknotted. We construct infinitely many Borromean rays, uncountably many of which are pairwise inequivalent. We obtain uncountably many Borromean hyperplanes. Three disjoint rays in euclidean 3-space form Borromean rays provided their union is knotted, but the union of any two components is unknotted. We construct infinitely many Borromean rays, uncountably many of which are pairwise inequivalent. We obtain uncountably many Borromean hyperplanes. △ Less

Submitted 27 November, 2012; originally announced November 2012.

Comments: 41 pages, 30 figures (19 with captions, 11 inline)

MSC Class: 57M30; 57R52; 57M05

Showing 1–33 of 33 results for author: Solus, L