-
New directions in algebraic statistics: Three challenges from 2023
Authors:
Yulia Alexandr,
Miles Bakenhus,
Mark Curiel,
Sameer K. Deshpande,
Elizabeth Gross,
Yuqi Gu,
Max Hill,
Joseph Johnson,
Bryson Kagy,
Vishesh Karwa,
Jiayi Li,
Hanbaek Lyu,
Sonja Petrović,
Jose Israel Rodriguez
Abstract:
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally…
▽ More
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally, new connections continue to be made with other areas of mathematics and statistics. This paper outlines three such connections: to statistical models used in educational testing, to a classification problem for a family of nonparametric regression models, and to phase transition phenomena under uniform sampling of contingency tables. We illustrate the motivating problems, each of which is for algebraic statistics a new direction, and demonstrate an enhancement of related methodologies.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Absolute concentration robustness: Algebra and geometry
Authors:
Luis David García Puente,
Elizabeth Gross,
Heather A Harrington,
Matthew Johnston,
Nicolette Meshkat,
Mercedes Pérez Millán,
Anne Shiu
Abstract:
Motivated by the question of how biological systems maintain homeostasis in changing environments, Shinar and Feinberg introduced in 2010 the concept of absolute concentration robustness (ACR). A biochemical system exhibits ACR in some species if the steady-state value of that species does not depend on initial conditions. Thus, a system with ACR can maintain a constant level of one species even a…
▽ More
Motivated by the question of how biological systems maintain homeostasis in changing environments, Shinar and Feinberg introduced in 2010 the concept of absolute concentration robustness (ACR). A biochemical system exhibits ACR in some species if the steady-state value of that species does not depend on initial conditions. Thus, a system with ACR can maintain a constant level of one species even as the environment changes. Despite a great deal of interest in ACR in recent years, the following basic question remains open: How can we determine quickly whether a given biochemical system has ACR? Although various approaches to this problem have been proposed, we show that they are incomplete. Accordingly, we present new methods for deciding ACR, which harness computational algebra. We illustrate our results on several biochemical signaling networks.
△ Less
Submitted 29 December, 2023;
originally announced January 2024.
-
The Pfaffian Structure of CFN Phylogenetic Networks
Authors:
Joseph Cummings,
Elizabeth Gross,
Benjamin Hollering,
Samuel Martin,
Ikenna Nometa
Abstract:
Algebraic techniques in phylogenetics have historically been successful at proving identifiability results and have also led to novel reconstruction algorithms. In this paper, we study the ideal of phylogenetic invariants of the Cavender-Farris-Neyman (CFN) model on a phylogenetic network with the goal of providing a description of the invariants which is useful for network inference. It was previ…
▽ More
Algebraic techniques in phylogenetics have historically been successful at proving identifiability results and have also led to novel reconstruction algorithms. In this paper, we study the ideal of phylogenetic invariants of the Cavender-Farris-Neyman (CFN) model on a phylogenetic network with the goal of providing a description of the invariants which is useful for network inference. It was previously shown that to characterize the invariants of any level-1 network, it suffices to understand all sunlet networks, which are those consisting of a single cycle with a leaf adjacent to each cycle vertex. We show that the parameterization of an affine open patch of the CFN sunlet model, which intersects the probability simplex factors through the space of skew-symmetric matrices via Pfaffians. We then show that this affine patch is isomorphic to a determinantal variety and give an explicit Gr{ö}bner basis for the associated ideal, which involves only polynomially many coordinates. Lastly, we show that sunlet networks with at least 6 leaves are identifiable using only these polynomials and run extensive simulations, which show that these polynomials can be used to accurately infer the correct network from DNA sequence data.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Mixed volumes of networks with binomial steady-states
Authors:
Jane Ivy Coons,
Mark Curiel,
Elizabeth Gross
Abstract:
The steady-state degree of a chemical reaction network is the number of complex steady-states for generic rate constants and initial conditions. One way to bound the steady-state degree is through the mixed volume of the steady-state system or an equivalent system. In this work, we show that for partionable binomial networks, whose resulting steady-state systems are given by a set of binomials and…
▽ More
The steady-state degree of a chemical reaction network is the number of complex steady-states for generic rate constants and initial conditions. One way to bound the steady-state degree is through the mixed volume of the steady-state system or an equivalent system. In this work, we show that for partionable binomial networks, whose resulting steady-state systems are given by a set of binomials and a set of linear (not necessarily binomial) conservation equations, computing the mixed volume is equivalent to finding the volume of a single mixed cell that is the translate of a parallelotope. We then turn our attention to identifying cycles with binomial steady-state ideals. To this end, we give a coloring condition on directed cycles that guarantees the network has a binomial steady-state ideal. We highlight both of these theorems using a class of networks referred to as species-overlap** networks and give a formula for the mixed volume of these networks.
△ Less
Submitted 31 March, 2023;
originally announced March 2023.
-
One-connection rule for structural equation models
Authors:
Bibhas Adhikari,
Elizabeth Gross,
Marc Härkönen,
Elias Tsigaridas
Abstract:
Linear structural equation models are multivariate statistical models encoded by mixed graphs. In particular, the set of covariance matrices for distributions belonging to a linear structural equation model for a fixed mixed graph $G=(V, D,B)$ is parameterized by a rational function with parameters for each vertex and edge in $G$. This rational parametrization naturally allows for the study of the…
▽ More
Linear structural equation models are multivariate statistical models encoded by mixed graphs. In particular, the set of covariance matrices for distributions belonging to a linear structural equation model for a fixed mixed graph $G=(V, D,B)$ is parameterized by a rational function with parameters for each vertex and edge in $G$. This rational parametrization naturally allows for the study of these models from an algebraic and combinatorial point of view. Indeed, this point of view has led to a collection of results in the literature, mainly focusing on questions related to identifiability and determining relationships between covariances (i.e., finding polynomials in the Gaussian vanishing ideal). So far, a large proportion of these results has focused on the case when $D$, the directed part of the mixed graph $G$, is acyclic. This is due to the fact that in the acyclic case, the parametrization becomes polynomial and there is a description of the entries of the covariance matrices in terms of a finite sum. We move beyond the acyclic case and give a closed form expression for the entries of the covariance matrices in terms of the one-connections in a graph obtained from $D$ through some small operations. This closed form expression then allows us to show that if $G$ is simple, then the parametrization map is generically finite-to-one. Finally, having a closed form expression for the covariance matrices allows for the development of an algorithm for systematically exploring possible polynomials in the Gaussian vanishing ideal.
△ Less
Submitted 1 October, 2022;
originally announced October 2022.
-
Broken Bracelets and Kostant's Partition Function
Authors:
Mark Curiel,
Elizabeth Gross,
Pamela E. Harris
Abstract:
Inspired by the work of Amdeberhan, Can, and Moll on broken necklaces, we define a broken bracelet as a linear arrangement of marked and unmarked vertices and introduce a generalization called $n$-stars, which is a collection of $n$ broken bracelets whose final (unmarked) vertices are identified. Through these combinatorial objects, we provide a new framework for the study of Kostant's partition f…
▽ More
Inspired by the work of Amdeberhan, Can, and Moll on broken necklaces, we define a broken bracelet as a linear arrangement of marked and unmarked vertices and introduce a generalization called $n$-stars, which is a collection of $n$ broken bracelets whose final (unmarked) vertices are identified. Through these combinatorial objects, we provide a new framework for the study of Kostant's partition function, which counts the number of ways to express a vector as a nonnegative integer linear combination of the positive roots of a Lie algebra. Our main result establishes that (up to reflection) the number of broken bracelets with a fixed number of unmarked vertices with nonconsecutive marked vertices gives an upper bound for the value of Kostant's partition function for multiples of the highest root of a Lie algebra of type $A$. We connect this work to multiplex juggling sequences, as studied by Benedetti, Hanusa, Harris, Morales, and Simpson, by providing a correspondence to an equivalence relation on $n$-stars.
△ Less
Submitted 3 February, 2022;
originally announced February 2022.
-
Identifiability of linear compartmental tree models and a general formula for input-output equations
Authors:
Cashous Bortner,
Elizabeth Gross,
Nicolette Meshkat,
Anne Shiu,
Seth Sullivant
Abstract:
A foundational question in the theory of linear compartmental models is how to assess whether a model is structurally identifiable -- that is, whether parameter values can be inferred from noiseless data -- directly from the combinatorics of the model. Our main result completely answers this question for models (with one input and one output) in which the underlying graph is a bidirectional tree;…
▽ More
A foundational question in the theory of linear compartmental models is how to assess whether a model is structurally identifiable -- that is, whether parameter values can be inferred from noiseless data -- directly from the combinatorics of the model. Our main result completely answers this question for models (with one input and one output) in which the underlying graph is a bidirectional tree; moreover, identifiability of such models can be verified visually}. Models of this structure include two families of models often appearing in biological applications: catenary and mammillary models. Our analysis of such models is enabled by two supporting results, which are significant in their own right. One result gives the first general formula for the coefficients of input-output equations (certain equations that can be used to determine identifiability) that allows for input and output to be in distinct compartments}. In another supporting result, we prove that identifiability is preserved when a model is enlarged and altered in specific ways involving adding a new compartment with a bidirected edge to an existing compartment.
△ Less
Submitted 15 December, 2022; v1 submitted 15 June, 2021;
originally announced June 2021.
-
What are higher-order networks?
Authors:
Christian Bick,
Elizabeth Gross,
Heather A. Harrington,
Michael T. Schaub
Abstract:
Network-based modeling of complex systems and data using the language of graphs has become an essential topic across a range of different disciplines. Arguably, this graph-based perspective derives its success from the relative simplicity of graphs: A graph consists of nothing more than a set of vertices and a set of edges, describing relationships between pairs of such vertices. This simple combi…
▽ More
Network-based modeling of complex systems and data using the language of graphs has become an essential topic across a range of different disciplines. Arguably, this graph-based perspective derives its success from the relative simplicity of graphs: A graph consists of nothing more than a set of vertices and a set of edges, describing relationships between pairs of such vertices. This simple combinatorial structure makes graphs interpretable and flexible modeling tools. The simplicity of graphs as system models, however, has been scrutinized in the literature recently. Specifically, it has been argued from a variety of different angles that there is a need for higher-order networks, which go beyond the paradigm of modeling pairwise relationships, as encapsulated by graphs. In this survey article we take stock of these recent developments. Our goals are to clarify (i) what higher-order networks are, (ii) why these are interesting objects of study, and (iii) how they can be used in applications.
△ Less
Submitted 4 July, 2022; v1 submitted 20 April, 2021;
originally announced April 2021.
-
Goodness of fit for log-linear ERGMs
Authors:
Elizabeth Gross,
Sonja Petrović,
Despina Stasi
Abstract:
Many popular models from the networks literature can be viewed through a common lens of contingency tables on network dyads, resulting in \emph{log-linear ERGMs}: exponential family models for random graphs whose sufficient statistics are linear on the dyads. We propose a new model in this family, the \emph{$p_1$-SBM}, which combines node and group effects common in network formation mechanisms. I…
▽ More
Many popular models from the networks literature can be viewed through a common lens of contingency tables on network dyads, resulting in \emph{log-linear ERGMs}: exponential family models for random graphs whose sufficient statistics are linear on the dyads. We propose a new model in this family, the \emph{$p_1$-SBM}, which combines node and group effects common in network formation mechanisms. In particular, it is a generalization of several well-known ERGMs including the stochastic blockmodel for undirected graphs with known block assignment, the degree-corrected version of it, and the directed $p_1$ model without group structure.
We frame the problem of testing model fit for the log-linear ERGM class through an exact conditional test whose $p$-value can be approximated efficiently in networks of both small and moderately large sizes. The sampling methods we build rely on a dynamic adaptation of Markov bases. We use quick estimation algorithms adapted from the contingency table literature and effective sampling methods rooted in graph theory and algebraic statistics.
The performance and scalability of the method is demonstrated on two data sets from biology: the connectome of \emph{C. elegans} and the interactome of \emph{Arabidopsis thaliana}. These two networks -- a network and a protein-protein interaction network -- have been popular examples in the network science literature. Our work provides a model-based approach to studying them.
△ Less
Submitted 3 March, 2024; v1 submitted 7 April, 2021;
originally announced April 2021.
-
When do two networks have the same steady-state ideal?
Authors:
Mark Curiel,
Elizabeth Gross,
Carlos Munoz
Abstract:
Chemical reaction networks are often used to model and understand biological processes such as cell signaling. Under the framework of chemical reaction network theory, a process is modeled with a directed graph and a choice of kinetics, which together give rise to a dynamical system. Under the assumption of mass action kinetics, the dynamical system is polynomial. In this paper, we consider the id…
▽ More
Chemical reaction networks are often used to model and understand biological processes such as cell signaling. Under the framework of chemical reaction network theory, a process is modeled with a directed graph and a choice of kinetics, which together give rise to a dynamical system. Under the assumption of mass action kinetics, the dynamical system is polynomial. In this paper, we consider the ideals generated by the these polynomials, which are called steady-state ideals. Steady-state ideals appear in multiple contexts within the chemical reaction network literature, however they have yet to be systematically studied. To begin such a study, we ask and partially answer the following question: when do two reaction networks give rise to the same steady-state ideal? In particular, our main results describe three operations on the reaction graph that preserve the steady-state ideal. Furthermore, since the motivation for this work is the classification of steady-state ideals, monomials play a primary role. To this end, combinatorial conditions are given to identify monomials in a steady-state ideal, and we give a sufficient condition for a steady-state ideal to be monomial.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
Binomial ideals of domino tilings
Authors:
Elizabeth Gross,
Nicole Yamzon
Abstract:
In this paper, we consider the set of all domino tilings of a cubiculated region. The primary question we explore is: How can we move from one tiling to another? Tiling spaces can be viewed as spaces of subgraphs of a fixed graph with a fixed degree sequence. Moves to connect such spaces have been explored in algebraic statistics. Thus, we approach this question from an applied algebra viewpoint,…
▽ More
In this paper, we consider the set of all domino tilings of a cubiculated region. The primary question we explore is: How can we move from one tiling to another? Tiling spaces can be viewed as spaces of subgraphs of a fixed graph with a fixed degree sequence. Moves to connect such spaces have been explored in algebraic statistics. Thus, we approach this question from an applied algebra viewpoint, making new connections between domino tilings, algebraic statistics, and toric algebra. Using results from toric ideals of graphs, we are able to describe moves that connect the tiling space of a given cubiculated region of any dimension. This is done by studying binomials that arise from two distinct domino tilings of the same region. Additionally, we introduce tiling ideals and flip ideals and use these ideals to restate what it means for a tiling space to be flip connected. Finally, we show that if $R$ is a $2$-dimensional simply connected cubiculated region, any binomial arising from two distinct tilings of $R$ can be written in terms of quadratic binomials. As a corollary to our main result, we obtain an alternative proof to the fact that the set of domino tilings of a $2$-dimensional simply connected region is connected by flips.
△ Less
Submitted 5 February, 2021; v1 submitted 6 August, 2020;
originally announced August 2020.
-
Algebraic statistics, tables, and networks: The Fienberg advantage
Authors:
Elizabeth Gross,
Vishesh Karwa,
Sonja Petrović
Abstract:
Stephen Fienberg's affinity for contingency table problems and reinterpreting models with a fresh look gave rise to a new approach for hypothesis testing of network models that are linear exponential families. We outline his vision and influence in this fundamental problem, as well as generalizations to multigraphs and hypergraphs.
Stephen Fienberg's affinity for contingency table problems and reinterpreting models with a fresh look gave rise to a new approach for hypothesis testing of network models that are linear exponential families. We outline his vision and influence in this fundamental problem, as well as generalizations to multigraphs and hypergraphs.
△ Less
Submitted 3 October, 2019;
originally announced October 2019.
-
The steady-state degree and mixed volume of a chemical reaction network
Authors:
Elizabeth Gross,
Cvetelina Hill
Abstract:
The steady-state degree of a chemical reaction network is the number of complex steady-states, which is a measure of the algebraic complexity of solving the steady-state system. In general, the steady-state degree may be difficult to compute. Here, we give an upper bound to the steady-state degree of a reaction network by utilizing the underlying polyhedral geometry associated with the correspondi…
▽ More
The steady-state degree of a chemical reaction network is the number of complex steady-states, which is a measure of the algebraic complexity of solving the steady-state system. In general, the steady-state degree may be difficult to compute. Here, we give an upper bound to the steady-state degree of a reaction network by utilizing the underlying polyhedral geometry associated with the corresponding polynomial system. We focus on three case studies of infinite families of networks, each generated by joining smaller networks to create larger ones. For each family, we give a formula for the steady-state degree and the mixed volume of the corresponding polynomial system.
△ Less
Submitted 23 April, 2020; v1 submitted 14 September, 2019;
originally announced September 2019.
-
Joining and decomposing reaction networks
Authors:
Elizabeth Gross,
Heather A Harrington,
Nicolette Meshkat,
Anne Shiu
Abstract:
In systems and synthetic biology, much research has focused on the behavior and design of single pathways, while, more recently, experimental efforts have focused on how cross-talk (coupling two or more pathways) or inhibiting molecular function (isolating one part of the pathway) affects systems-level behavior. However, the theory for tackling these larger systems in general has lagged behind. He…
▽ More
In systems and synthetic biology, much research has focused on the behavior and design of single pathways, while, more recently, experimental efforts have focused on how cross-talk (coupling two or more pathways) or inhibiting molecular function (isolating one part of the pathway) affects systems-level behavior. However, the theory for tackling these larger systems in general has lagged behind. Here, we analyze how joining networks (e.g., cross-talk) or decomposing networks (e.g., inhibition or knock-outs) affects three properties that reaction networks may possess---identifiability (recoverability of parameter values from data), steady-state invariants (relationships among species concentrations at steady state, used in model selection), and multistationarity (capacity for multiple steady states, which correspond to multiple cell decisions). Specifically, we prove results that clarify, for a network obtained by joining two smaller networks, how properties of the smaller networks can be inferred from or can imply similar properties of the original network. Our proofs use techniques from computational algebraic geometry, including elimination theory and differential algebra.
△ Less
Submitted 14 August, 2019; v1 submitted 12 October, 2018;
originally announced October 2018.
-
Linear compartmental models: input-output equations and operations that preserve identifiability
Authors:
Elizabeth Gross,
Heather A. Harrington,
Nicolette Meshkat,
Anne Shiu
Abstract:
This work focuses on the question of how identifiability of a mathematical model, that is, whether parameters can be recovered from data, is related to identifiability of its submodels. We look specifically at linear compartmental models and investigate when identifiability is preserved after adding or removing model components. In particular, we examine whether identifiability is preserved when a…
▽ More
This work focuses on the question of how identifiability of a mathematical model, that is, whether parameters can be recovered from data, is related to identifiability of its submodels. We look specifically at linear compartmental models and investigate when identifiability is preserved after adding or removing model components. In particular, we examine whether identifiability is preserved when an input, output, edge, or leak is added or deleted. Our approach, via differential algebra, is to analyze specific input-output equations of a model and the Jacobian of the associated coefficient map. We clarify a prior determinantal formula for these equations, and then use it to prove that, under some hypotheses, a model's input-output equations can be understood in terms of certain submodels we call "output-reachable". Our proofs use algebraic and combinatorial techniques.
△ Less
Submitted 24 May, 2019; v1 submitted 1 August, 2018;
originally announced August 2018.
-
Algebraic signatures of convex and non-convex codes
Authors:
Carina Curto,
Elizabeth Gross,
Jack Jeffries,
Katherine Morrison,
Zvi Rosen,
Anne Shiu,
Nora Youngs
Abstract:
A convex code is a binary code generated by the pattern of intersections of a collection of open convex sets in some Euclidean space. Convex codes are relevant to neuroscience as they arise from the activity of neurons that have convex receptive fields. In this paper, we use algebraic methods to determine if a code is convex. Specifically, we use the neural ideal of a code, which is a generalizati…
▽ More
A convex code is a binary code generated by the pattern of intersections of a collection of open convex sets in some Euclidean space. Convex codes are relevant to neuroscience as they arise from the activity of neurons that have convex receptive fields. In this paper, we use algebraic methods to determine if a code is convex. Specifically, we use the neural ideal of a code, which is a generalization of the Stanley-Reisner ideal. Using the neural ideal together with its standard generating set, the canonical form, we provide algebraic signatures of certain families of codes that are non-convex. We connect these signatures to the precise conditions on the arrangement of sets that prevent the codes from being convex. Finally, we also provide algebraic signatures for some families of codes that are convex, including the class of intersection-complete codes. These results allow us to detect convexity and non-convexity in a variety of situations, and point to some interesting open questions.
△ Less
Submitted 7 July, 2018;
originally announced July 2018.
-
Dimensions of Group-based Phylogenetic Mixtures
Authors:
Hector Baños,
Nathaniel Bushek,
Ruth Davidson,
Elizabeth Gross,
Pamela E. Harris,
Robert Krone,
Colby Long,
Allen Stewart,
Robert Walker
Abstract:
In this paper we study group-based Markov models of evolution and their mixtures. In the algebreo-geometric setting, group-based phylogenetic tree models correspond to toric varieties, while their mixtures correspond to secant and join varieties. Determining properties of these secant and join varieties can aid both in model selection and establishing parameter identifiability. Here we explore the…
▽ More
In this paper we study group-based Markov models of evolution and their mixtures. In the algebreo-geometric setting, group-based phylogenetic tree models correspond to toric varieties, while their mixtures correspond to secant and join varieties. Determining properties of these secant and join varieties can aid both in model selection and establishing parameter identifiability. Here we explore the first natural geometric property of these varieties: their dimension. The expected projective dimension of the join variety of a set of varieties is one more than the sum of their dimensions. A join variety that realizes the expected dimension is nondefective. Nondefectiveness is not only interesting from a geometric point-of-view, but has been used to establish combinatorial identifiability for several classes of phylogenetic mixture models. In this paper, we focus on group-based models where the equivalence classes of identified parameters are orbits of a subgroup of the automorphism group of the group defining the model. In particular, we show that, for these group-based models, the variety corresponding to the mixture of $r$ trees with $n$ leaves is nondefective when $n \geq 2r+5$. We also give improved bounds for claw trees and give computational evidence that 2-tree and 3-tree mixtures are nondefective for small~$n$.
△ Less
Submitted 23 November, 2017;
originally announced November 2017.
-
Identifiability of linear compartmental models: the singular locus
Authors:
Elizabeth Gross,
Nicolette Meshkat,
Anne Shiu
Abstract:
This work addresses the problem of identifiability, that is, the question of whether parameters can be recovered from data, for linear compartmental models. Using standard differential algebra techniques, the question of whether a given model is generically locally identifiable is equivalent to asking whether the Jacobian matrix of a certain coefficient map, arising from input-output equations, is…
▽ More
This work addresses the problem of identifiability, that is, the question of whether parameters can be recovered from data, for linear compartmental models. Using standard differential algebra techniques, the question of whether a given model is generically locally identifiable is equivalent to asking whether the Jacobian matrix of a certain coefficient map, arising from input-output equations, is generically full rank. A natural next step is to study the set of parameter values where the Jacobian matrix drops in rank, which we refer to as the locus of non-identifiable parameter values, or, for short, the singular locus. In this work, we give a formula for coefficient maps in terms of acyclic subgraphs of the model's underlying directed graph and, then, study the case when the singular locus is defined by a single equation, the singular-locus equation. We prove that the singular-locus equation can be used to determine when submodels are generically locally identifiable. We also determine the singular-locus equation for two families of linear compartmental models, cycle and mammillary (star) models with input and output in a single compartment. We also state a conjecture for the corresponding equation for a third family: catenary (path) models. Finally, we introduce the identifiability degree, which is the number of parameter values that map to a generic input-output data vector. This degree was previously computed for mammillary and catenary models, and here we determine this degree for cycle models.
△ Less
Submitted 28 June, 2021; v1 submitted 28 September, 2017;
originally announced September 2017.
-
Distinguishing Phylogenetic Networks
Authors:
Elizabeth Gross,
Colby Long
Abstract:
Phylogenetic networks are becoming increasingly popular in phylogenetics since they have the ability to describe a wider range of evolutionary events than their tree counterparts. In this paper, we study Markov models on phylogenetic networks and their associated geometry. We restrict our attention to large-cycle networks, networks with a single undirected cycle of length at least four. Using tool…
▽ More
Phylogenetic networks are becoming increasingly popular in phylogenetics since they have the ability to describe a wider range of evolutionary events than their tree counterparts. In this paper, we study Markov models on phylogenetic networks and their associated geometry. We restrict our attention to large-cycle networks, networks with a single undirected cycle of length at least four. Using tools from computational algebraic geometry, we show that the semi-directed network topology is generically identifiable for Jukes-Cantor large-cycle network models.
△ Less
Submitted 9 June, 2017;
originally announced June 2017.
-
The Multiple Roots Phenomenon in Maximum Likelihood Estimation for Factor Analysis
Authors:
Elizabeth Gross,
Sonja Petrović,
Donald Richards,
Despina Stasi
Abstract:
Multiple root estimation problems in statistical inference arise in many contexts in the literature. In the context of maximum likelihood estimation, the existence of multiple roots causes uncertainty in the computation of maximum likelihood estimators using hill-climbing algorithms, and consequent difficulties in the resulting statistical inference.
In this paper, we study the multiple roots ph…
▽ More
Multiple root estimation problems in statistical inference arise in many contexts in the literature. In the context of maximum likelihood estimation, the existence of multiple roots causes uncertainty in the computation of maximum likelihood estimators using hill-climbing algorithms, and consequent difficulties in the resulting statistical inference.
In this paper, we study the multiple roots phenomenon in maximum likelihood estimation for factor analysis. We prove that the corresponding likelihood equations have uncountably many feasible solutions even in the simplest cases. For the case in which the observed data are two-dimensional and the unobserved factor scores are one-dimensional, we prove that the solutions to the likelihood equations form a one-dimensional real curve.
△ Less
Submitted 15 February, 2017;
originally announced February 2017.
-
Phylogenetic trees
Authors:
Hector Baños,
Nathaniel Bushek,
Ruth Davidson,
Elizabeth Gross,
Pamela E. Harris,
Robert Krone,
Colby Long,
Allen Stewart,
Robert Walker
Abstract:
We introduce the package PhylogeneticTrees for Macaulay2 which allows users to compute phylogenetic invariants for group-based tree models. We provide some background information on phylogenetic algebraic geometry and show how the package PhylogeneticTrees can be used to calculate a generating set for a phylogenetic ideal as well as a lower bound for its dimension. Finally, we show how methods wit…
▽ More
We introduce the package PhylogeneticTrees for Macaulay2 which allows users to compute phylogenetic invariants for group-based tree models. We provide some background information on phylogenetic algebraic geometry and show how the package PhylogeneticTrees can be used to calculate a generating set for a phylogenetic ideal as well as a lower bound for its dimension. Finally, we show how methods within the package can be used to compute a generating set for the join of any two ideals.
△ Less
Submitted 17 November, 2016;
originally announced November 2016.
-
Neural ideals and stimulus space visualization
Authors:
Elizabeth Gross,
Nida Kazi Obatake,
Nora Youngs
Abstract:
A neural code $\mathcal{C}$ is a collection of binary vectors of a given length n that record the co-firing patterns of a set of neurons. Our focus is on neural codes arising from place cells, neurons that respond to geographic stimulus. In this setting, the stimulus space can be visualized as subset of $\mathbb{R}^2$ covered by a collection $\mathcal{U}$ of convex sets such that the arrangement…
▽ More
A neural code $\mathcal{C}$ is a collection of binary vectors of a given length n that record the co-firing patterns of a set of neurons. Our focus is on neural codes arising from place cells, neurons that respond to geographic stimulus. In this setting, the stimulus space can be visualized as subset of $\mathbb{R}^2$ covered by a collection $\mathcal{U}$ of convex sets such that the arrangement $\mathcal{U}$ forms an Euler diagram for $\mathcal{C}$. There are some methods to determine whether such a convex realization $\mathcal{U}$ exists; however, these methods do not describe how to draw a realization. In this work, we look at the problem of algorithmically drawing Euler diagrams for neural codes using two polynomial ideals: the neural ideal, a pseudo-monomial ideal; and the neural toric ideal, a binomial ideal. In particular, we study how these objects are related to the theory of piercings in information visualization, and we show how minimal generating sets of the ideals reveal whether or not a code is $0$, $1$, or $2$-inductively pierced.
△ Less
Submitted 3 July, 2016;
originally announced July 2016.
-
What makes a neural code convex?
Authors:
Carina Curto,
Elizabeth Gross,
Jack Jeffries,
Katherine Morrison,
Mohamed Omar,
Zvi Rosen,
Anne Shiu,
Nora Youngs
Abstract:
Neural codes allow the brain to represent, process, and store information about the world. Combinatorial codes, comprised of binary patterns of neural activity, encode information via the collective behavior of populations of neurons. A code is called convex if its codewords correspond to regions defined by an arrangement of convex open sets in Euclidean space. Convex codes have been observed expe…
▽ More
Neural codes allow the brain to represent, process, and store information about the world. Combinatorial codes, comprised of binary patterns of neural activity, encode information via the collective behavior of populations of neurons. A code is called convex if its codewords correspond to regions defined by an arrangement of convex open sets in Euclidean space. Convex codes have been observed experimentally in many brain areas, including sensory cortices and the hippocampus, where neurons exhibit convex receptive fields. What makes a neural code convex? That is, how can we tell from the intrinsic structure of a code if there exists a corresponding arrangement of convex open sets? In this work, we provide a complete characterization of local obstructions to convexity. This motivates us to define max intersection-complete codes, a family guaranteed to have no local obstructions. We then show how our characterization enables one to use free resolutions of Stanley-Reisner ideals in order to detect violations of convexity. Taken together, these results provide a significant advance in understanding the intrinsic combinatorial properties of convex codes.
△ Less
Submitted 21 December, 2016; v1 submitted 1 August, 2015;
originally announced August 2015.
-
Numerical algebraic geometry for model selection and its application to the life sciences
Authors:
Elizabeth Gross,
Brent Davis,
Kenneth L. Ho,
Daniel J. Bates,
Heather A. Harrington
Abstract:
Researchers working with mathematical models are often confronted by the related problems of parameter estimation, model validation, and model selection. These are all optimization problems, well-known to be challenging due to non-linearity, non-convexity and multiple local optima. Furthermore, the challenges are compounded when only partial data is available. Here, we consider polynomial models (…
▽ More
Researchers working with mathematical models are often confronted by the related problems of parameter estimation, model validation, and model selection. These are all optimization problems, well-known to be challenging due to non-linearity, non-convexity and multiple local optima. Furthermore, the challenges are compounded when only partial data is available. Here, we consider polynomial models (e.g., mass-action chemical reaction networks at steady state) and describe a framework for their analysis based on optimization using numerical algebraic geometry. Specifically, we use probability-one polynomial homotopy continuation methods to compute all critical points of the objective function, then filter to recover the global optima. Our approach exploits the geometric structures relating models and data, and we demonstrate its utility on examples from cell signaling, synthetic biology, and epidemiology.
△ Less
Submitted 1 April, 2016; v1 submitted 15 July, 2015;
originally announced July 2015.
-
Algebraic Systems Biology: A Case Study for the Wnt Pathway
Authors:
Elizabeth Gross,
Heather A. Harrington,
Zvi Rosen,
Bernd Sturmfels
Abstract:
Steady state analysis of dynamical systems for biological networks give rise to algebraic varieties in high-dimensional spaces whose study is of interest in their own right. We demonstrate this for the shuttle model of the Wnt signaling pathway. Here the variety is described by a polynomial system in 19 unknowns and 36 parameters. Current methods from computational algebraic geometry and combinato…
▽ More
Steady state analysis of dynamical systems for biological networks give rise to algebraic varieties in high-dimensional spaces whose study is of interest in their own right. We demonstrate this for the shuttle model of the Wnt signaling pathway. Here the variety is described by a polynomial system in 19 unknowns and 36 parameters. Current methods from computational algebraic geometry and combinatorics are applied to analyze this model.
△ Less
Submitted 10 February, 2015;
originally announced February 2015.
-
The Maximum Likelihood Threshold of a Graph
Authors:
Elizabeth Gross,
Seth Sullivant
Abstract:
The maximum likelihood threshold of a graph is the smallest number of data points that guarantees that maximum likelihood estimates exist almost surely in the Gaussian graphical model associated to the graph. We show that this graph parameter is connected to the theory of combinatorial rigidity. In particular, if the edge set of a graph $G$ is an independent set in the $n-1$-dimensional generic ri…
▽ More
The maximum likelihood threshold of a graph is the smallest number of data points that guarantees that maximum likelihood estimates exist almost surely in the Gaussian graphical model associated to the graph. We show that this graph parameter is connected to the theory of combinatorial rigidity. In particular, if the edge set of a graph $G$ is an independent set in the $n-1$-dimensional generic rigidity matroid, then the maximum likelihood threshold of $G$ is less than or equal to $n$. This connection allows us to prove many results about the maximum likelihood threshold.
△ Less
Submitted 15 September, 2015; v1 submitted 28 April, 2014;
originally announced April 2014.
-
Goodness-of-fit for log-linear network models: Dynamic Markov bases using hypergraphs
Authors:
Elizabeth Gross,
Sonja Petrović,
Despina Stasi
Abstract:
Social networks and other large sparse data sets pose significant challenges for statistical inference, as many standard statistical methods for testing model fit are not applicable in such settings. Algebraic statistics offers a theoretically justified approach to goodness-of-fit testing that relies on the theory of Markov bases and is intimately connected with the geometry of the model as descri…
▽ More
Social networks and other large sparse data sets pose significant challenges for statistical inference, as many standard statistical methods for testing model fit are not applicable in such settings. Algebraic statistics offers a theoretically justified approach to goodness-of-fit testing that relies on the theory of Markov bases and is intimately connected with the geometry of the model as described by its fibers.
Most current practices require the computation of the entire basis, which is infeasible in many practical settings. We present a dynamic approach to explore the fiber of a model, which bypasses this issue, and is based on the combinatorics of hypergraphs arising from the toric algebra structure of log-linear models.
We demonstrate the approach on the Holland-Leinhardt $p_1$ model for random directed graphs that allows for reciprocated edges.
△ Less
Submitted 20 January, 2014;
originally announced January 2014.
-
Maximum likelihood geometry in the presence of data zeros
Authors:
Elizabeth Gross,
Jose Israel Rodriguez
Abstract:
Given a statistical model, the maximum likelihood degree is the number of complex solutions to the likelihood equations for generic data. We consider discrete algebraic statistical models and study the solutions to the likelihood equations when the data contain zeros and are no longer generic. Focusing on sampling and model zeros, we show that, in these cases, the solutions to the likelihood equat…
▽ More
Given a statistical model, the maximum likelihood degree is the number of complex solutions to the likelihood equations for generic data. We consider discrete algebraic statistical models and study the solutions to the likelihood equations when the data contain zeros and are no longer generic. Focusing on sampling and model zeros, we show that, in these cases, the solutions to the likelihood equations are contained in a previously studied variety, the likelihood correspondence. The number of these solutions give a lower bound on the ML degree, and the problem of finding critical points to the likelihood function can be partitioned into smaller and computationally easier problems involving sampling and model zeros. We use this technique to compute a lower bound on the ML degree for $2 \times 2 \times 2 \times 2$ tensors of border rank $\leq 2$ and $3 \times n$ tables of rank $\leq 2$ for $n=11, 12, 13, 14$, the first four values of $n$ for which the ML degree was previously unknown.
△ Less
Submitted 5 May, 2014; v1 submitted 15 October, 2013;
originally announced October 2013.
-
Bertini for Macaulay2
Authors:
Daniel J. Bates,
Elizabeth Gross,
Anton Leykin,
Jose Israel Rodriguez
Abstract:
Numerical algebraic geometry is the field of computational mathematics concerning the numerical solution of polynomial systems of equations. Bertini, a popular software package for computational applications of this field, includes implementations of a variety of algorithms based on polynomial homotopy continuation. The Macaulay2 package Bertini.m2 provides an interface to Bertini, making it possi…
▽ More
Numerical algebraic geometry is the field of computational mathematics concerning the numerical solution of polynomial systems of equations. Bertini, a popular software package for computational applications of this field, includes implementations of a variety of algorithms based on polynomial homotopy continuation. The Macaulay2 package Bertini.m2 provides an interface to Bertini, making it possible to access the core run modes of Bertini in Macaulay2. With these run modes, users can find approximate solutions to zero-dimensional systems and positive-dimensional systems, test numerically whether a point lies on a variety, sample numerically from a variety, and perform parameter homotopy runs.
△ Less
Submitted 11 October, 2013;
originally announced October 2013.
-
Combinatorial degree bound for toric ideals of hypergraphs
Authors:
Elizabeth Gross,
Sonja Petrović
Abstract:
Associated to any hypergraph is a toric ideal encoding the algebraic relations among its edges. We study these ideals and the combinatorics of their minimal generators, and derive general degree bounds for both uniform and non-uniform hypergraphs in terms of balanced hypergraph bicolorings, separators, and splitting sets. In turn, this provides complexity bounds for algebraic statistical models as…
▽ More
Associated to any hypergraph is a toric ideal encoding the algebraic relations among its edges. We study these ideals and the combinatorics of their minimal generators, and derive general degree bounds for both uniform and non-uniform hypergraphs in terms of balanced hypergraph bicolorings, separators, and splitting sets. In turn, this provides complexity bounds for algebraic statistical models associated to hypergraphs. As two main applications, we recover a well-known complexity result for Markov bases of arbitrary 3-way tables, and we show that the defining ideal of the tangential variety is generated by quadratics and cubics in cumulant coordinates.
△ Less
Submitted 21 December, 2012; v1 submitted 12 June, 2012;
originally announced June 2012.
-
Maximum likelihood degree of variance component models
Authors:
Elizabeth Gross,
Mathias Drton,
Sonja Petrović
Abstract:
Most statistical software packages implement numerical strategies for computation of maximum likelihood estimates in random effects models. Little is known, however, about the algebraic complexity of this problem. For the one-way layout with random effects and unbalanced group sizes, we give formulas for the algebraic degree of the likelihood equations as well as the equations for restricted maxim…
▽ More
Most statistical software packages implement numerical strategies for computation of maximum likelihood estimates in random effects models. Little is known, however, about the algebraic complexity of this problem. For the one-way layout with random effects and unbalanced group sizes, we give formulas for the algebraic degree of the likelihood equations as well as the equations for restricted maximum likelihood estimation. In particular, the latter approach is shown to be algebraically less complex. The formulas are obtained by studying a univariate rational equation whose solutions correspond to the solutions of the likelihood equations. Applying techniques from computational algebra, we also show that balanced two-way layouts with or without interaction have likelihood equations of degree four. Our work suggests that algebraic methods allow one to reliably find global optima of likelihood functions of linear mixed models with a small number of variance components.
△ Less
Submitted 14 November, 2011;
originally announced November 2011.
-
PHCpack in Macaulay2
Authors:
Elizabeth Gross,
Sonja Petrović,
Jan Verschelde
Abstract:
The Macaulay2 package PHCpack.m2 provides an interface to PHCpack, a general-purpose polynomial system solver that uses homotopy continuation. The main method is a numerical blackbox solver which is implemented for all Laurent systems. The package also provides a fast mixed volume computation, the ability to filter solutions, homotopy path tracking, and a numerical irreducible decomposition method…
▽ More
The Macaulay2 package PHCpack.m2 provides an interface to PHCpack, a general-purpose polynomial system solver that uses homotopy continuation. The main method is a numerical blackbox solver which is implemented for all Laurent systems. The package also provides a fast mixed volume computation, the ability to filter solutions, homotopy path tracking, and a numerical irreducible decomposition method. As the size of many problems in applied algebraic geometry often surpasses the capabilities of symbolic software, this package will be of interest to those working on problems involving large polynomial systems.
△ Less
Submitted 10 October, 2012; v1 submitted 24 May, 2011;
originally announced May 2011.
-
A proof of the set-theoretic version of the salmon conjecture
Authors:
Shmuel Friedland,
Elizabeth Gross
Abstract:
We show that the irreducible variety of 4 x 4 x 4 complex valued tensors of border rank at most 4 is the zero set of polynomial equations of degree 5 (the Strassen commutative conditions), of degree 6 (the Landsberg-Manivel polynomials), and of degree 9 (the symmetrization conditions).
We show that the irreducible variety of 4 x 4 x 4 complex valued tensors of border rank at most 4 is the zero set of polynomial equations of degree 5 (the Strassen commutative conditions), of degree 6 (the Landsberg-Manivel polynomials), and of degree 9 (the symmetrization conditions).
△ Less
Submitted 29 April, 2011; v1 submitted 10 April, 2011;
originally announced April 2011.